I am a Research Scientist at Google DeepMind
working on Gemini Multimodal.
Email / CV / GitHub / Google Scholar / LinkedIn / Twitter (X) / Blog
Recent talk on criticizing and creating vision-language models. [YouTube English, Chinese ]
Hyunsik Chae, Seungwoo Yoon, Jaden Park, Chloe Yewon Chun, Yongin Cho, Mu Cai, Yong Jae Lee, Ernest K. Ryu
Shubham Bharti, Shiyun Cheng, Jihyun Rho, Jianrui Zhang, Mu Cai, Yong Jae Lee, Martina Rau, Xiaojin Zhu