Language Models are Unsupervised Multitask Learners A Radford, J Wu, R Child, D Luan, D Amodei, I Sutskever | 15380* | |
PaLM: Scaling Language Modeling with Pathways A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ... arXiv preprint arXiv:2204.02311, 2022 | 5453 | 2022 |
Generative Pretraining from Pixels M Chen, A Radford, R Child, J Wu, H Jun, P Dhariwal, D Luan, I Sutskever Proceedings of the 37th International Conference on Machine Learning, 2020 | 1851 | 2020 |
Show Your Work: Scratchpads for Intermediate Computation with Language Models M Nye, AJ Andreassen, G Gur-Ari, H Michalewski, J Austin, D Bieber, ... arXiv preprint arXiv:2112.00114, 2021 | 572 | 2021 |
Language models are unsupervised multitask learners (2019) A Radford, J Wu, R Child, D Luan, D Amodei, I Sutskever | 323* | 2019 |
Palm: Scaling language modeling with pathways. arXiv 2022 A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ... arXiv preprint arXiv:2204.02311 10, 2022 | 119 | 2022 |