Robert Kirk

Cited by

	All	Since 2019
Citations	587	587
h-index	6	6
i10-index	6	6

280

140

210

20212022202320245 119 194 263

Co-authors

Edward GrefenstetteDirector of Research, Google DeepMind | Honorary Professor, UCLVerified email at google.com
Tim RocktäschelProfessor of Artificial Intelligence at UCL, Open-Endedness Team Lead at Google DeepMindVerified email at cs.ucl.ac.uk
Eric HambroAnthropicVerified email at anthropic.com
David Scott KruegerUniversity Assistant Professor, University of CambridgeVerified email at cam.ac.uk
Amy ZhangAssistant Professor of Electrical and Computer Engineering at University of Texas at AustinVerified email at austin.utexas.edu
Minqi JiangResearch Scientist at Google DeepMindVerified email at ucl.ac.uk
Usman AnwarUniversity of CambridgeVerified email at cam.ac.uk
Vitaly KurinResearch Scientist at Isomorphic LabsVerified email at isomorphiclabs.com
Mikayel SamvelyanMeta AI, UCLVerified email at meta.com
Fabio PetroniSamaya AIVerified email at samaya.ai
Heinrich KüttlerxAIVerified email at math.lmu.de
Jack Parker-HolderGoogle DeepMind, UCLVerified email at google.com
Hidenori TanakaGroup Leader, CBS-NTT Program in "Physics of Intelligence", Harvard UniversityVerified email at fas.harvard.edu
Robert DickUniversity of Michigan, StrydVerified email at rpdmail.dyndns.org
Ekdeep Singh LubanaHarvard / NTTVerified email at fas.harvard.edu
Samyak JainUndergrad at Indian Institute of Technology(BHU),VaranasiVerified email at itbhu.ac.in
Christoforos NalmpantisPostdoctoral Researcher, Fundamental AI Research at MetaVerified email at fb.com
Jelena LuketinaOxford UniversityVerified email at cs.ox.ac.uk
Ishita MedirattaMeta FAIRVerified email at meta.com
Thomas CosteNoah's Ark Lab & University of CambridgeVerified email at cam.ac.uk

Robert Kirk

PhD Student, University College London

Verified email at ucl.ac.uk - Homepage

AI Alignment AI Safety Language Models Fine-tuning Generalisation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning R Kirk, A Zhang, E Grefenstette, T Rocktäschel Journal of Artificial Intelligence Research 76, 201-264, 2023	341	2023
MiniHack the Planet: A Sandbox for Open-ended Reinforcement Learning Research M Samvelyan, R Kirk, V Kurin, J Parker-Holder, M Jiang, E Hambro, ... NeurIPS 2021 Datasets and Benchmarks Track, 2021	85	2021
Understanding the Effects of RLHF on LLM Generalisation and Diversity R Kirk, I Mediratta, C Nalmpantis, J Luketina, E Hambro, E Grefenstette, ... ICLR 2024, 2023	51	2023
Reward Model Ensembles Help Mitigate Overoptimization T Coste, U Anwar, R Kirk, D Krueger ICLR 2024, 2023	48	2023
Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks S Jain, R Kirk, ES Lubana, RP Dick, H Tanaka, E Grefenstette, ... ICLR 2024, 2023	26	2023
Insights from the neurips 2021 nethack challenge E Hambro, S Mohanty, D Babaev, M Byeon, D Chakraborty, ... NeurIPS 2021 Competitions and Demonstrations Track, 41-52, 2022	18	2022
A study of off-policy learning in environments with procedural content generation A Ehrenberg, R Kirk, M Jiang, E Grefenstette, T Rocktäschel ICLR Workshop on Agent Learning in Open-Endedness, 2022	6	2022
Generalization to new sequential decision making tasks with in-context learning SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu ICML 2024, 2023	5	2023
Graph backup: Data efficient backup exploiting markovian transitions Z Jiang, T Zhang, R Kirk, T Rocktäschel, E Grefenstette arXiv preprint arXiv:2205.15824, 2022	4*	2022
Analyzing the Generalization and Reliability of Steering Vectors D Tan, D Chanin, A Lynch, D Kanoulas, B Paige, A Garriga-Alonso, R Kirk arXiv preprint arXiv:2407.12404, 2024	1	2024
Leading the Pack: N-player Opponent Shaping A Souly, T Willi, A Khan, R Kirk, C Lu, E Grefenstette, T Rocktäschel arXiv preprint arXiv:2312.12564, 2023	1	2023
Domain Generalization for Robust Model-Based Offline Reinforcement Learning A Clark, SA Siddiqui, R Kirk, U Anwar, S Chung, D Krueger arXiv preprint arXiv:2211.14827, 2022	1	2022
What Mechanisms Does Knowledge Distillation Distill? C Wu, ES Lubana, BK Mlodozeniec, R Kirk, D Krueger Proceedings of UniReps: the First Workshop on Unifying Representations in …, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–13

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors