Yu Kong

Department of Computer Science and Engineering, Michigan State University

yukong.jpg

Office: EB 3573

Email: yukong@msu.edu

Dr. Yu Kong is an Assistant Professor of the Department of Computer Science and Engineering, Michigan State University. He is now directing the ACTION Lab. He serves Pattern Recognition and IEEE Transactions on Multimedia as an Associate Editor. He also serves CVPR, ICCV, IJCAI, AAAI, Multimedia, FG, etc., as an Area Chair. He is an IEEE Senior Member.

Dr. Kong is interested in video modeling and human modeling for decision-making support, with the goal of enabling intelligent systems to perceive, reason, and act in complex and dynamic environments. Technically, his work centers on vision–language modeling, human action understanding, and world models. He is particularly interested in learning structured representations from video that capture human and agent intent, actions, and environment dynamics, and in grounding these representations in language to support flexible querying, explanation, and planning. More broadly, his research aims to bridge perception, reasoning, and action by developing multimodal models that support embodied and decision-centric intelligence.

Openings: We have openings for Fall 2026 for Ph.D. students, Visiting Scholars, and Research Assistants. You can either 1) fill in the external contact form (preferred) or 2) send your application package to actionlab@msu.edu. I will review applications periodically. Due to large volume of applications, I am not able to track and respond to every applicant. How to apply?

news

Apr 02, 2026 One paper about media forenisics has been accepted by FG 2026. This is the Zhongyi’s first paper at ACTION Lab. Congratulations to Zhongyi and collaborators!
Feb 21, 2026 We have two papers accepted by CVPR 2026. Congratulations to Zhanbo and our collaborators!
Jan 26, 2026 One paper about mistake detection has been accepted by ICLR 2026. This is the Wenliang’s first paper at ACTION Lab. Congratulations to Wenliang and Yujiang!
Nov 10, 2025 One paper about Egocentric Video Generation has been accepted by WACV 2026. Congratulations to Yujiang!
Sep 25, 2025 Our project on Neuro-Symbolic Video Understanding ($50K) is selected for funding by Jenison Fund.
Sep 18, 2025 We have one paper accepted by NeurIPS 2025. Congratulations to Yifan!
Aug 16, 2025 I will be serving CVPR 2026 as an Area Chair.
Jun 25, 2025 We have one paper accepted by ICCV 2025. Congratulations to Yifan!
Mar 31, 2025 We have one paper accepted by CVPR 2025 Workshop on Efficient Large Vision Models. Congratulations to Yifan!
Feb 26, 2025 We have one paper accepted by CVPR 2025. Congratulations to Zhanbo on his first CVPR paper at ACTION Lab!

selected publications

  1. CVPR Spotlight
    Zhanbo2026a.pdf
    Unlocking Motion from Large Vision Models with a Semantic and Kinematic Duality for Gait Recognition
    Zhanbo Huang, Dingqiang Ye, Xiaoming Liu, and Yu Kong
    In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2026
    Spotlight
  2. WACV
    Yujiang2025.pdf
    Show Me: Generating Instructional Videos with Diffusion Models
    Yujiang Pu, Zhanbo Huang, Vishnu Boddeti, and Yu Kong
    In Winter Conference on Applications of Computer Vision (WACV), 2026
  3. NeurIPS
    YifanNeurIPS2025.pdf
    IndustryEQA: Pushing the Frontiers of Embodied Question Answering in Industrial Scenarios
    Yifan Li, Yuhang Chen, Anh Dao, Lichi Li, Zhongyi Cai, Zhen Tan, Tianlong Chen, and 1 more author
    In NeurIPS DB Track, 2025
  4. CVPR Highlight
    Zhanbo2025.pdf
    Learning Human-centric Motion Representation for Action Analysis
    Zhanbo Huang, Xiaoming Liu, and Yu Kong
    In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025
    Highlight