Semdedup: Data-efficient learning at web-scale through semantic deduplication A Abbas, K Tirumala, D Simig, S Ganguli, AS Morcos ICLR 2023: Multimodal Representation Learning Workshop, 2023 | 97 | 2023 |
Progress and limitations of deep networks to recognize objects in unusual poses A Abbas, S Deny AAAI 2023, 2022 | 18 | 2022 |
Sieve: Multimodal dataset pruning using image captioning models A Mahmoud, M Elhoushi, A Abbas, Y Yang, N Ardalani, H Leather, ... CVPR 2024, 2023 | 8 | 2023 |
Alaaeldin El-Nouby, Hadi Pouransari, Alexander Toshev, Stephanie Wang, Dirk Groeneveld, Luca Soldaini, Pang Wei Koh, Jenia Jitsev, Thomas Kollar, Alexandros G J Li, A Fang, G Smyrnis, M Ivgi, M Jordan, S Gadre, H Bansal, E Guha, ... Dimakis, Yair Carmon, Achal Dave, Ludwig Schmidt, and Vaishaal Shankar, 2024 | 7 | 2024 |
Effective pruning of web-scale datasets based on complexity of concept clusters A Abbas, E Rusak, K Tirumala, W Brendel, K Chaudhuri, AS Morcos ICLR 2024, 2024 | 6 | 2024 |
Semdedup: Data-efficient learning at web-scale through semantic deduplication, 2023 A Abbas, K Tirumala, D Simig, S Ganguli, AS Morcos Zaharia, M., Zhang, M., Zhang, T., Zhang, X., Zhang, Y., Zheng, L., Zhou, K …, 2021 | 5 | 2021 |
DataComp-LM: In search of the next generation of training sets for language models J Li, A Fang, G Smyrnis, M Ivgi, M Jordan, S Gadre, H Bansal, E Guha, ... arXiv preprint arXiv:2406.11794, 2024 | 4 | 2024 |
Humans Beat Deep Networks at Recognizing Objects in Unusual Poses, Given Enough Time N Ollikka, A Abbas, A Perin, M Kilpeläinen, S Deny arXiv preprint arXiv:2402.03973, 2024 | | 2024 |