Thomas Mesnard

Cited by

	All	Since 2019
Citations	1732	1532
h-index	14	14
i10-index	14	14

820

410

205

615

201520162017201820192020202120222023202416 31 71 74 84 119 134 138 234 820

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFARVerified email at umontreal.ca
Rémi MunosGoogle DeepMindVerified email at inria.fr
Bilal PiotGoogle DeepmindVerified email at google.com
Will DabneyDeepMindVerified email at google.com
Theophane WeberResearch Scientist at DeepMindVerified email at google.com
Doina PrecupDeepMind and McGill UniversityVerified email at cs.mcgill.ca
Eric MoulinesProfesseur, Ecole Polytechnique, Membre de l'Académie des SciencesVerified email at polytechnique.edu
Armand JoulinGoogle DeepMindVerified email at google.com
Laurent SifreGoogle DeepMindVerified email at polytechnique.edu
Demis HassabisDeepMind
Jeff DeanGoogle Chief Scientist, Google Research and Google DeepMindVerified email at google.com
koray kavukcuogluDeepMindVerified email at kavukcuoglu.org
Clement FarabetEx Research Scientist, New York UniversityVerified email at nyu.edu
Oriol VinyalsResearch Scientist at Google DeepMindVerified email at google.com
Noah FiedelGoogleVerified email at engineeralum.berkeley.edu

Thomas Mesnard

Research Scientist at Google DeepMind

Verified email at google.com

LLM Reinforcement Learning Artificial Intelligence


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Towards biologically plausible deep learning Y Bengio, DH Lee, J Bornschein, T Mesnard, Z Lin arXiv preprint arXiv:1502.04156, 2015	436	2015
Gemma: Open models based on gemini research and technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024	371	2024
Rlaif: Scaling reinforcement learning from human feedback with ai feedback H Lee, S Phatale, H Mansoor, T Mesnard, J Ferret, K Lu, C Bishop, E Hall, ... arXiv preprint arXiv:2309.00267, 2023	294	2023
An objective function for STDP Y Bengio, T Mesnard, A Fischer, S Zhang, Y Wu arXiv preprint arXiv:1509.05936 5 (6.2), 6.3, 2015	184*	2015
Hindsight credit assignment A Harutyunyan, W Dabney, T Mesnard, M Gheshlaghi Azar, B Piot, ... Advances in neural information processing systems 32, 2019	97	2019
Counterfactual credit assignment in model-free reinforcement learning T Mesnard, T Weber, F Viola, S Thakoor, A Saade, A Harutyunyan, ... arXiv preprint arXiv:2011.09464, 2020	69	2020
Nash learning from human feedback R Munos, M Valko, D Calandriello, MG Azar, M Rowland, ZD Guo, Y Tang, ... arXiv preprint arXiv:2312.00886, 2023	52	2023
Generalization of equilibrium propagation to vector field dynamics B Scellier, A Goyal, J Binas, T Mesnard, Y Bengio arXiv preprint arXiv:1808.04873, 2018	48*	2018
Direct language model alignment from online ai feedback S Guo, B Zhang, T Liu, T Liu, M Khalman, F Llinares, A Rame, T Mesnard, ... arXiv preprint arXiv:2402.04792, 2024	44	2024
Geometric entropic exploration ZD Guo, MG Azar, A Saade, S Thakoor, B Piot, BA Pires, M Valko, ... arXiv preprint arXiv:2101.02055, 2021	40	2021
Towards deep learning with spiking neurons in energy based models with contrastive hebbian plasticity T Mesnard, W Gerstner, J Brea arXiv preprint arXiv:1612.03214, 2016	27	2016
Charline Le Lan, Christopher A G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ...	18	2024
Gemma 2: Improving open language models at a practical size G Team, M Riviere, S Pathak, PG Sessa, C Hardin, S Bhupatiraju, ... arXiv preprint arXiv:2408.00118, 2024	17	2024
Curiosity in hindsight: Intrinsic exploration in stochastic environments D Jarrett, C Tallec, F Altché, T Mesnard, R Munos, M Valko	15	2023
Ghost units yield biologically plausible backprop in deep neural networks T Mesnard, G Vignoud, J Sacramento, W Senn, Y Bengio arXiv preprint arXiv:1911.08585, 2019	7	2019
A survey of temporal credit assignment in deep reinforcement learning E Pignatelli, J Ferret, M Geist, T Mesnard, H van Hasselt, L Toni arXiv preprint arXiv:2312.01072, 2023	5	2023
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models A Botev, S De, SL Smith, A Fernando, GC Muraru, R Haroun, L Berrada, ... arXiv preprint arXiv:2404.07839, 2024	3	2024
Quantile credit assignment T Mesnard, W Chen, A Saade, Y Tang, M Rowland, T Weber, C Lyle, ... International Conference on Machine Learning, 24517-24531, 2023	3	2023
Activation alignment: exploring the use of approximate activity gradients in multilayer networks T Mesnard, B Richards 2018 Conference on Cognitive Computational Neuroscience, Brentwood …, 2018	1	2018
Connectionist Temporal Classification: Labelling Unsegmented Sequences with Recurrent Neural Networks A AUVOLAT, T MESNARD	1	2006

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors