Blip-2: Bootstrapping language-image pre-training with frozen image encoders and large language models J Li, D Li, S Savarese, S Hoi International conference on machine learning, 19730-19742, 2023 | 4206 | 2023 |
Blip: Bootstrapping language-image pre-training for unified vision-language understanding and generation J Li, D Li, C Xiong, S Hoi International conference on machine learning, 12888-12900, 2022 | 3801 | 2022 |
Align before fuse: Vision and language representation learning with momentum distillation J Li, R Selvaraju, A Gotmare, S Joty, C Xiong, SCH Hoi Advances in neural information processing systems 34, 9694-9705, 2021 | 1944 | 2021 |
Dividemix: Learning with noisy labels as semi-supervised learning J Li, R Socher, SCH Hoi arXiv preprint arXiv:2002.07394, 2020 | 1186 | 2020 |
Prototypical contrastive learning of unsupervised representations J Li, P Zhou, C Xiong, SCH Hoi arXiv preprint arXiv:2005.04966, 2020 | 1124 | 2020 |
Learning to Learn from Noisy Labeled Data J Li, Y Wong, Q Zhao, MS Kankanhalli The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019 | 408 | 2019 |
Codet5+: Open code large language models for code understanding and generation Y Wang, H Le, AD Gotmare, NDQ Bui, J Li, SCH Hoi arXiv preprint arXiv:2305.07922, 2023 | 368 | 2023 |
Comatch: Semi-supervised learning with contrastive graph regularization J Li, C Xiong, SCH Hoi Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 303 | 2021 |
Blip-diffusion: Pre-trained subject representation for controllable text-to-image generation and editing D Li, J Li, S Hoi Advances in Neural Information Processing Systems 36, 2024 | 218 | 2024 |
Align and prompt: Video-and-language pre-training with entity prompts D Li, J Li, H Li, JC Niebles, SCH Hoi Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 212 | 2022 |
The devil is in classification: A simple framework for long-tail instance segmentation T Wang, Y Li, B Kang, J Li, J Liew, S Tang, S Hoi, J Feng Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 204 | 2020 |
Learning to detect human-object interactions with knowledge B Xu, Y Wong, J Li, Q Zhao, MS Kankanhalli Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 180 | 2019 |
Unsupervised Learning of View-invariant Action Representations J Li, Y Wong, Q Zhao, M Kankanhalli Conference on Neural Information Processing Systems (NeurIPS), 2018 | 126 | 2018 |
Interact as you intend: Intention-driven human-object interaction detection B Xu, J Li, Y Wong, Q Zhao, MS Kankanhalli IEEE Transactions on Multimedia 22 (6), 1423-1432, 2019 | 123 | 2019 |
From images to textual prompts: Zero-shot visual question answering with frozen large language models J Guo, J Li, D Li, AMH Tiong, B Li, D Tao, S Hoi Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 118 | 2023 |
Lavis: A library for language-vision intelligence D Li, J Li, H Le, G Wang, S Savarese, SCH Hoi arXiv preprint arXiv:2209.09019, 2022 | 118 | 2022 |
Mopro: Webly supervised learning with momentum prototypes J Li, C Xiong, SCH Hoi arXiv preprint arXiv:2009.07995, 2020 | 113 | 2020 |
Learning from noisy data with robust representation learning J Li, C Xiong, SCH Hoi Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 110 | 2021 |
Facial expression recognition using deep neural networks J Li, EY Lam 2015 IEEE International Conference on Imaging Systems and Techniques (IST), 1-6, 2015 | 101 | 2015 |
Plug-and-play vqa: Zero-shot vqa by conjoining large pretrained models with zero training AMH Tiong, J Li, B Li, S Savarese, SCH Hoi arXiv preprint arXiv:2210.08773, 2022 | 94 | 2022 |