Systematic generalization: what is required and can it be learned? D Bahdanau, S Murty, M Noukhovitch, TH Nguyen, H de Vries, A Courville arXiv preprint arXiv:1811.12889, 2018 | 238* | 2018 |
Pretraining representations for data-efficient reinforcement learning M Schwarzer, N Rajkumar, M Noukhovitch, A Anand, L Charlin, RD Hjelm, ... Advances in Neural Information Processing Systems 34, 12686-12699, 2021 | 140 | 2021 |
Emergent communication under competition M Noukhovitch, T LaCroix, A Lazaridou, A Courville arXiv preprint arXiv:2101.10276, 2021 | 31 | 2021 |
Commonsense mining as knowledge base completion? A study on the impact of novelty S Jastrzębski, D Bahdanau, S Hosseini, M Noukhovitch, Y Bengio, ... arXiv preprint arXiv:1804.09259, 2018 | 29 | 2018 |
Simplicial embeddings in self-supervised learning and downstream classification S Lavoie, C Tsirigotis, M Schwarzer, A Vani, M Noukhovitch, K Kawaguchi, ... arXiv preprint arXiv:2204.00616, 2022 | 13 | 2022 |
Oríon: Experiment version control for efficient hyperparameter optimization C Tsirigotis, X Bouthillier, F Corneau-Tremblay, P Henderson, R Askari, ... | 13* | 2018 |
Language model alignment with elastic reset M Noukhovitch, S Lavoie, F Strub, AC Courville Advances in Neural Information Processing Systems 36, 2024 | 10 | 2024 |
The N+ Implementation Details of RLHF with PPO: A Case Study on TL; DR Summarization S Huang, M Noukhovitch, A Hosseini, K Rasul, W Wang, L Tunstall arXiv preprint arXiv:2403.17031, 2024 | 2 | 2024 |
EMERGENCE OF COMMUNICATION WITH SELFISH AGENTS M NOUKHOVITCH, T LACROIX, A LAZARIDOU, A COURVILLE LANGUAGE of, 314, 2020 | | 2020 |
In-Context Learning, Can It Break Safety? S Xhonneux, D Dobre, M Noukhovitch, J Tang, G Gidel, D Sridhar ICML 2024 Next Generation of AI Safety Workshop, 0 | | |
Learning Multi-Agent Communication with Contrastive Learning YL Lo, B Sengupta, JN Foerster, M Noukhovitch The Twelfth International Conference on Learning Representations, 0 | | |
Countering Language Drift with KL Regularization M Noukhovitch, S Lavoie, IH Laradji, D Kiela, F Strub, A Courville | | |
Selfish Emergent Communication M Noukhovitch, T LaCroix, A Courville | | |