Pengchuan Zhang*, Xiujun Li*, Xiaowei Hu, Jianwei Yang, Lei Zhang, Lijuan Wang, Yejin Choi, Jianfeng Gao VinVL: Making Visual Representations Matter in Vision-Language Models,
CVPR 2021 Impact: OSCAR+ is the current SoTA on seven downstream tasks, and also No.1 on three public leaderboards (VQA, Image Captioning, and NoCaps).
Xiujun Li, Xi Yin, Chunyuan Li, Xiaowei Hu, Pengchuan Zhang, Lei Zhang, Lijuan Wang, Houdong Hu, Li Dong, Furu Wei, Yejin Choi, Jianfeng Gao Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks,
ECCV 2020 Impact: Oscar is widely used in several Microsoft products.