About Me
Hi! I am Yuanhao Zou, a first-year Computer Science PhD student at the University of Central Florida, advised by Prof. Chen Chen at the Center for Research in Computer Vision (CRCV).
I received my Master’s degree in Electrical and Computer Engineering from the University of Michigan, Ann Arbor, where I was advised by Prof. Zhaozheng Yin on Medical Vision and Language Models. Before that, I obtained my Bachelor’s degree in Computer Science from Central South University, working with Prof. Xiangjian He on Medical Image Segmentation.
Research Interests
My research focuses on building efficient and scalable multimodal models that can understand the visual world over time. Specifically, I am interested in:
- Video Understanding — Temporal grounding, long-form video reasoning, and frame selection.
- Efficient Vision-Language Models — Compact, on-device VLMs that keep strong perception and reasoning under tight compute budgets.
- Video Anomaly Detection — Detecting and localizing anomalous events in real-world surveillance and edge scenarios.
Previously, I also worked extensively on Medical Vision-Language Models (visual question answering, image–text retrieval) and Medical Image Segmentation.
News
- [Jun 2026] CoMET-Bench and CoMET Agent submitted to NeurIPS 2026. 🎉
- [Jun 2026] CLARITY is accepted to ECCV 2026!
- [Feb 2026] How Should Video LLMs Output Time? is accepted to a CVPR 2026 Workshop.
- [Jan 2026] A.I.R. is accepted to ICLR 2026!
- [Aug 2025] Started my PhD journey at UCF CRCV with Prof. Chen Chen. 🐊
Contact
Feel free to reach out via yuanhaoz@ucf.edu if you would like to chat about research or collaboration.
