Loading...
The system can't perform the operation now. Try again later.
Articles
Case law
Profiles
My profile
My library
Metrics
Alerts
Settings
Get journal articles
Get journal articles
Profiles
My profile
My library
Michal Valko
Llama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMind
Verified email at meta.com
Cited by 12170
fine-tuning LLMs
rl with human feedback
deep reinforcement learning
Privacy
Terms
Help
About Scholar
Search help