Christopher Olah

Cited by

	All	Since 2019
Citations	79990	67695
h-index	41	40
i10-index	57	54

13000

6500

3250

9750

201620172018201920202021202220232024838 3326 7512 10208 11510 12413 11681 12424 9437

Co-authors

Dario AmodeiCEO and Co-Founder at AnthropicVerified email at anthropic.com
Jacob SteinhardtStanford UniversityVerified email at cs.stanford.edu
John SchulmanAnthropicVerified email at anthropic.com
Vincent DumoulinResearch ScientistVerified email at google.com
Andrew DaiGoogle DeepMindVerified email at google.com
Quoc V. LeResearch Scientist, GoogleVerified email at stanford.edu
Greg CorradoGoogle ResearchVerified email at google.com

Christopher Olah

Anthropic

Verified email at google.com - Homepage

Machine Learning Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
TensorFlow: Large-scale machine learning on heterogeneous systems M Abadi, A Agarwal, P Barham, E Brevdo, Z Chen, C Citro, GS Corrado, ...	56941*	2015
Conditional image synthesis with auxiliary classifier gans A Odena, C Olah, J Shlens International conference on machine learning, 2642-2651, 2017	4132	2017
Understanding LSTM Networks C Olah colah.github.io, 2015	3067*	2015
Concrete problems in AI safety D Amodei, C Olah, J Steinhardt, P Christiano, J Schulman, D Mané arXiv preprint arXiv:1606.06565, 2016	2902	2016
Deconvolution and Checkerboard Artifacts A Odena, V Dumoulin, C Olah Distill, 2016	1875	2016
Feature visualization C Olah, A Mordvintsev, L Schubert Distill 2 (11), e7, 2017	1441*	2017
Training a helpful and harmless assistant with reinforcement learning from human feedback Y Bai, A Jones, K Ndousse, A Askell, A Chen, N DasSarma, D Drain, ... arXiv preprint arXiv:2204.05862, 2022	1099	2022
Inceptionism: Going deeper into neural networks A Mordvintsev, C Olah, M Tyka Google research blog 20 (14), 5, 2015	1041*	2015
Constitutional ai: Harmlessness from ai feedback Y Bai, S Kadavath, S Kundu, A Askell, J Kernion, A Jones, A Chen, ... arXiv preprint arXiv:2212.08073, 2022	875	2022
The building blocks of interpretability C Olah, A Satyanarayan, I Johnson, S Carter, L Schubert, K Ye, ... Distill 3 (3), e10, 2018	860*	2018
Document embedding with paragraph vectors AM Dai arXiv preprint arXiv:1507.07998, 2015	577	2015
A mathematical framework for transformer circuits N Elhage, N Nanda, C Olsson, T Henighan, N Joseph, B Mann, A Askell, ... Transformer Circuits Thread 1 (1), 12, 2021	413*	2021
In-context learning and induction heads C Olsson, N Elhage, N Nanda, N Joseph, N DasSarma, T Henighan, ... arXiv preprint arXiv:2209.11895, 2022	385*	2022
Red teaming language models to reduce harms: Methods, scaling behaviors, and lessons learned D Ganguli, L Lovitt, J Kernion, A Askell, Y Bai, S Kadavath, B Mann, ... arXiv preprint arXiv:2209.07858, 2022	341	2022
Zoom in: An introduction to circuits C Olah, N Cammarata, L Schubert, G Goh, M Petrov, S Carter Distill 5 (3), e00024. 001, 2020	327	2020
Multimodal neurons in artificial neural networks G Goh, N Cammarata, C Voss, S Carter, M Petrov, L Schubert, A Radford, ... Distill 6 (3), e30, 2021	326	2021
A general language assistant as a laboratory for alignment A Askell, Y Bai, A Chen, D Drain, D Ganguli, T Henighan, A Jones, ... arXiv preprint arXiv:2112.00861, 2021	303	2021
Activation atlas S Carter, Z Armstrong, L Schubert, I Johnson, C Olah Distill 4 (3), e15, 2019	263*	2019
Predictability and surprise in large generative models D Ganguli, D Hernandez, L Lovitt, A Askell, Y Bai, A Chen, T Conerly, ... Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022	243	2022
Toy models of superposition N Elhage, T Hume, C Olsson, N Schiefer, T Henighan, S Kravec, ... arXiv preprint arXiv:2209.10652, 2022	204	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors