Hi, I am a fourth-year Ph.D. student in Computer Sciences Department at University of Wisconsin-Madison, advised by Prof. Yong Jae Lee.
My research interest lies in the intersection of deep learning and computer vision. I am especially interested in multimodal generative models, visual prompting, video and 3D understanding.
Email / CV / GitHub / Google Scholar / LinkedIn / Twitter (X) / Blog
Recent talk on compositional vision-language models in the input sapce. [YouTube link]