Παρακολούθηση
Jonathan D. Chang
Jonathan D. Chang
Ph.D. Student, Cornell University
Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα cornell.edu
Τίτλος
Παρατίθεται από
Παρατίθεται από
Έτος
Mitigating covariate shift in imitation learning via offline data with partial coverage
J Chang, M Uehara, D Sreenivas, R Kidambi, W Sun
Advances in Neural Information Processing Systems 34, 965-979, 2021
892021
Mobile: Model-based imitation learning from observation alone
R Kidambi, J Chang, W Sun
Advances in Neural Information Processing Systems 34, 28598-28611, 2021
422021
Learning to generate better than your llm
JD Chang, K Brantley, R Ramamurthy, D Misra, W Sun
arXiv preprint arXiv:2306.11816, 2023
252023
Dataset reset policy optimization for rlhf
JD Chang, W Shan, O Oertell, K Brantley, D Misra, JD Lee, W Sun
arXiv preprint arXiv:2404.08495, 2024
162024
Learning deep parameterized skills from demonstration for re-targetable visuomotor control
J Chang, N Kumar, S Hastings, A Gokaslan, D Romeres, D Jha, ...
arXiv preprint arXiv:1910.10628, 2019
152019
Learning bellman complete representations for offline policy evaluation
J Chang, K Wang, N Kallus, W Sun
International Conference on Machine Learning, 2938-2971, 2022
102022
Rebel: Reinforcement learning via regressing relative rewards
Z Gao, JD Chang, W Zhan, O Oertell, G Swamy, K Brantley, T Joachims, ...
arXiv preprint arXiv:2404.16767, 2024
82024
Using unsupervised clustering to identify pregnancy co-morbidities
J Chang, IN Sarkar
AMIA Summits on Translational Science Proceedings 2019, 305, 2019
72019
Using self organizing maps to compare sepsis patients from the neonatal and adult intensive care unit
B Goddard, J Chang, IN Sarkar
AMIA Summits on Translational Science Proceedings 2019, 127, 2019
32019
Policy-Gradient Training of Language Models for Ranking
G Gao, JD Chang, C Cardie, K Brantley, T Joachim
arXiv preprint arXiv:2310.04407, 2023
22023
Critique-out-Loud Reward Models
Z Ankner, M Paul, B Cui, JD Chang, P Ammanabrolu
arXiv preprint arXiv:2408.11791, 2024
12024
Rl for consistency models: Faster reward guided text-to-image generation
O Oertell, JD Chang, Y Zhang, K Brantley, W Sun
arXiv preprint arXiv:2404.03673, 2024
12024
Adversarial Imitation Learning via Boosting
J Chang, D Sreenivas, Y Huang, K Brantley, W Sun
International Conference on Learning Representations, 2024
12024
Mitigating covariate shift in imitation learning via offline data without great coverage
JD Chang, M Uehara, D Sreenivas, R Kidambi, W Sun
arXiv preprint arXiv:2106.03207, 2021
2021
Δεν είναι δυνατή η εκτέλεση της ενέργειας από το σύστημα αυτή τη στιγμή. Προσπαθήστε ξανά αργότερα.
Άρθρα 1–14