headshot

Xingyu Fu (ๅบœๆ˜Ÿๅฆค)

Email: xingyufu@princeton.edu

๐Ÿ‘‹ Hi, I am Xingyu Fu, an incoming Postdoctral Researcher at Princeton Language and Intelligence. I'm currently a fifth-year PhD student in Computer Science at the University of Pennsylvania advised by Prof. Dan Roth. During my PhD, I have interned at Microsoft and AWS AI Labs. I received my B.S. in Computer Science from UIUC in 2020, where I was very fortunate to be advised by Prof. Jiawei Han.

My research primarily focuses on generative multimodal models at the intersection between vision and natural language (e.g., multimodal LLMs, text-to-image/video generation, omni models). I aim to improve the perception and reasoning capabilities of multimodal models by bridging them together. I have built better evaluations for emergent abilities, and used synthetic data to design models that can better perceive and reason about the multimodal world.

I'm always open to collaborations. Send me an email if you're interested!

๐ŸŒŸ Recent highlights

๐Ÿ“‘ Research Projects


๐ŸŽค Invited Talks

๐Ÿ’ผ Work Experience