I am currently a Ph.D. candidate at HMI Lab, NERCV²T, School of Computer Science, Peking University, supervised by Prof. Shanghang Zhang. I received my Bachelor’s degree in Artificial Intelligence (Turing Honor Degree) from Peking University in 2023, where I also obtained a Bachelor’s degree in Economics.
My research interests lie in multimodal large language models, including visual foundation models, vision language models, unified multimodal models, visual complex reasoning, visual model efficiency, and visual continual learning. The overall goal of my research is to develop a large-scale efficient visual perception system with human-like expression, adaptation, and generalization, equipped with powerful abilities including fundamental perception, cognitive reasoning, and autonomous creativity.
📧 Email: theia@pku.edu.cn, theia4869@gmail.com
Feel free to reach out for collaboration!

