I am currently a Research Scientist at Apple. I got my PhD from UW CSE with Yejin Choi in 2024. Prior to this, I worked five years at Microsoft Research. My research experience is on Dialog, Deep Reinforcement Learning, NLP, Vision and Language, Multimodal LLMs etc. Recently, I am also interested in Video Generation.
Enrico Fini, Mustafa Shukor, Xiujun Li, Philipp Dufter, Michal Klein, David Haldimann, Sai Aitharaju, Victor Guilherme Turrisi da Costa, Louis Béthune, Zhe Gan, Alexander T Toshev, Marcin Eichner, Moin Nabi, Yinfei Yang, Joshua M. Susskind, Alaaeldin El-Nouby Multimodal Autoregressive Pre-training of Large Vision Encoders,
arXiv 2024