A corpus for multilingual document classification in eight languages H Schwenk, X Li arXiv preprint arXiv:1805.09821, 2018 | 158 | 2018 |
Adaptive sparse transformer for multilingual translation H Gong, X Li, D Genzel arXiv preprint arXiv:2104.07358, 2021 | 15 | 2021 |
Addressing posterior collapse with mutual information for improved variational neural machine translation AD McCarthy, X Li, J Gu, N Dong Proceedings of the 58th annual meeting of the association for computational …, 2020 | 20 | 2020 |
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM S Sukhbaatar, O Golovneva, V Sharma, H Xu, XV Lin, B Rozière, J Kahn, ... arXiv preprint arXiv:2403.07816, 2024 | | 2024 |
Characterize Investor Attention on the Social Web X Li, J Hendler, JL Teall Available at SSRN 2256400, 2013 | | 2013 |
Cross-lingual retrieval for iterative self-supervised training C Tran, Y Tang, X Li, J Gu Advances in Neural Information Processing Systems 33, 2207-2219, 2020 | 68 | 2020 |
Data-gov wiki: Towards linking government data L Ding, D DiFranzo, A Graves, JR Michaelis, X Li, DL McGuinness, ... 2010 AAAI spring symposium series, 2010 | 126 | 2010 |
Deep transformers with latent depth X Li, A Cooper Stickland, Y Tang, X Kong Advances in Neural Information Processing Systems 33, 1736-1746, 2020 | 21 | 2020 |
Design and evaluation of a social media writing support tool for people with dyslexia S Wu, L Reynolds, X Li, F Guzmán Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems …, 2019 | 40 | 2019 |
Distributionally robust multilingual machine translation C Zhou, D Levy, X Li, M Ghazvininejad, G Neubig arXiv preprint arXiv:2109.04020, 2021 | 23 | 2021 |
Do language models have beliefs? methods for detecting, updating, and visualizing model beliefs P Hase, M Diab, A Celikyilmaz, X Li, Z Kozareva, V Stoyanov, M Bansal, ... arXiv preprint arXiv:2111.13654, 2021 | 66 | 2021 |
Dynamics of investor attention on the social web X Li Rensselaer Polytechnic Institute, 2013 | | 2013 |
Efficient language modeling with sparse all-mlp P Yu, M Artetxe, M Ott, S Shleifer, H Gong, V Stoyanov, X Li arXiv preprint arXiv:2203.06850, 2022 | 11 | 2022 |
Efficient large scale language modeling with mixtures of experts M Artetxe, S Bhosale, N Goyal, T Mihaylov, M Ott, S Shleifer, XV Lin, J Du, ... arXiv preprint arXiv:2112.10684, 2021 | 80 | 2021 |
Few-shot learning with multilingual language models XV Lin, T Mihaylov, M Artetxe, T Wang, S Chen, D Simig, M Ott, N Goyal, ... arXiv preprint arXiv:2112.10668, 2021 | 321* | 2021 |
Financial and economic data management using Semantic Web technologies X Li 2012 IEEE Conference on Computational Intelligence for Financial Engineering …, 2012 | 1 | 2012 |
Findings of the first shared task on machine translation robustness X Li, P Michel, A Anastasopoulos, Y Belinkov, N Durrani, O Firat, P Koehn, ... arXiv preprint arXiv:1906.11943, 2019 | 67 | 2019 |
Findings of the fourth workshop on neural generation and translation K Heafield, H Hayashi, Y Oda, A Finch, G Neubig, X Li, A Birch The 4th Workshop on Neural Generation and Translation, 1-9, 2020 | 17 | 2020 |
Findings of the WMT 2020 shared task on machine translation robustness L Specia, Z Li, J Pino, V Chaudhary, F Guzmán, G Neubig, N Durrani, ... Proceedings of the Fifth Conference on Machine Translation, 76-91, 2020 | 29 | 2020 |
Flowseq: Non-autoregressive conditional sequence generation with generative flow X Ma, C Zhou, X Li, G Neubig, E Hovy arXiv preprint arXiv:1909.02480, 2019 | 203 | 2019 |