Albert: A lite bert for self-supervised learning of language representations Z Lan arXiv preprint arXiv:1909.11942, 2019 | 8023 | 2019 |
Conceptual captions: A cleaned, hypernymed, image alt-text dataset for automatic image captioning P Sharma, N Ding, S Goodman, R Soricut Proceedings of the 56th Annual Meeting of the Association for Computational …, 2018 | 2560 | 2018 |
Pali: A jointly-scaled multilingual language-image model X Chen, X Wang, S Changpinyo, AJ Piergiovanni, P Padlewski, D Salz, ... arXiv preprint arXiv:2209.06794, 2022 | 619 | 2022 |
Albert: A lite bert for self-supervised learning of language representations. arXiv 2019 Z Lan, M Chen, S Goodman, K Gimpel, P Sharma, R Soricut arXiv preprint arXiv:1909.11942, 1909 | 235 | 1909 |
Scaling up models and data with t5x and seqio A Roberts, HW Chung, G Mishra, A Levskaya, J Bradbury, D Andor, ... Journal of Machine Learning Research 24 (377), 1-8, 2023 | 154 | 2023 |
Pali-x: On scaling up a multilingual vision and language model X Chen, J Djolonga, P Padlewski, B Mustafa, S Changpinyo, J Wu, ... arXiv preprint arXiv:2305.18565, 2023 | 148 | 2023 |
Pali-3 vision language models: Smaller, faster, stronger X Chen, X Wang, L Beyer, A Kolesnikov, J Wu, P Voigtlaender, B Mustafa, ... arXiv preprint arXiv:2310.09199, 2023 | 62 | 2023 |
Bridging the gap between practice and pac-bayes theory in few-shot meta-learning N Ding, X Chen, T Levinboim, S Goodman, R Soricut Advances in Neural Information Processing Systems 34, 29506-29516, 2021 | 33 | 2021 |
Prestu: Pre-training for scene-text understanding J Kil, S Changpinyo, X Chen, H Hu, S Goodman, WL Chao, R Soricut Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 22 | 2023 |
Teaforn: Teacher-forcing with n-grams S Goodman, N Ding, R Soricut arXiv preprint arXiv:2010.03494, 2020 | 22 | 2020 |
CausalLM is not optimal for in-context learning N Ding, T Levinboim, J Wu, S Goodman, R Soricut arXiv preprint arXiv:2308.06912, 2023 | 20 | 2023 |
Understanding image and text simultaneously: a dual vision-language machine comprehension task N Ding, S Goodman, F Sha, R Soricut arXiv preprint arXiv:1612.07833, 2016 | 13 | 2016 |
System and method for positioning solar panels with automated drones S Goodman US Patent 10,439,550, 2019 | 11 | 2019 |
Multi-image summarization: Textual summary from a set of cohesive images N Trieu, S Goodman, P Narayana, K Sone, R Soricut arXiv preprint arXiv:2006.08686, 2020 | 6 | 2020 |
On Scaling Up a Multilingual Vision and Language Model X Chen, J Djolonga, P Padlewski, B Mustafa, S Changpinyo, J Wu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 4 | 2024 |
Multi-stage pretraining for abstractive summarization S Goodman, Z Lan, R Soricut arXiv preprint arXiv:1909.10599, 2019 | 2 | 2019 |
TeaForN: Teacher-Forcing with N-grams N Ding, R Soricut, SA Goodman | | 2020 |
Xiujun Fan, DVM Ph. D. X Fan, M Petitt, M Gamboa, M Huang, S Dhal, ML Druzin, JC Wu, ... Nitric oxide 22 (10), 3571-3580, 2008 | | 2008 |
PreSTU: Pre-Training for Scene-Text Understanding (Supplementary Material) J Kil, S Changpinyo, X Chen, H Hu, S Goodman, WL Chao, R Soricut | | |