publications

2025
-
CVPR HighlightLearning Human-centric Motion Representation for Action AnalysisIn IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025
-
arxivMake it NoisEasier: Boosting Text-to-Video Generation with Direct Noise OptimizationIn under review, 2025
-
arxivGaitPro: Nuisance-Invariant Gait Recognition via Condition Annotations and Proxy SamplesIn under review, 2025
-
arxivAre We Merely Justifying Results ex Post Facto? Quantifying Explanatory Inversion in Post-Hoc Model ExplanationsIn under review, 2025
-
arxivIndustryEQA: Pushing the Frontiers of Embodied Question Answering in Industrial ScenariosIn under review, 2025
-
arxiv
-
-
-
ViT-Split: Unleashing the Power of Vision Foundation Models via Efficient Splitting HeadsIn under review, 2025
-
Continual Visual Question Answering Through Bayesian Mixture of Experts AggregationIn under review, 2025
-
arXivWindow Token Concatenation for Efficient Visual Large Language ModelsIn 2nd CVPR Workshop on Efficient Large Vision Models, 2025
-
Advancing Assessment Fairness and Equity in Medical Education with Artificial IntelligenceIn 2025 American Educational Research Association Annual Meeting, 2025
2024
2023
-
Using Computer Vision to Assess Students’ Safety Behaviors in an Objective Structured Clinical ExaminationIn The annual ChangeMedEd conference, 2023
-
arXiv
2022
2021
-
From Ensemble Clustering to Subspace Clustering: Cluster Structure EncodingIEEE Transactions on Neural Networks and Learning Systems (T-NNLS), 2021
-
Coupling Adversarial Graph Embedding for Transductive Zero-shot Action RecognitionNeurocomputing, 2021
2020
-
T-IPVisual Object Tracking Via Multi-Stream Deep Similarity Learning NetworksIEEE Transactions on Image Processing (T-IP), 2020
-
T-CSVTAligned Dynamic-Preserving Embedding for Zero-Shot Action RecognitionIEEE Transactions on Circuits and Systems for Video Technology, 2020
-
T-PAMIAdversarial Action Prediction NetworksIEEE Transaction on Pattern Analysis and Machine Intelligence (T-PAMI), 2020
2019
2018
-
Hierarchical and Spatio-Temporal Sparse Representation for Human Action RecognitionIEEE Transactions on Image Processing (T-IP), 2018
-
ICDMClustered Lifelong Learning via Representative Task SelectionIn IEEE International Conference on Data Mining (ICDM), 2018
-
CVPRResidual Dense Network for Image Super-ResolutionIn IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018
-
AAAIAction Prediction from Videos via Memorizing Hard-to-Predict SamplesIn AAAI Conference on Artificial Intelligence (AAAI), 2018
2017
2016
2015
2014
2013
-
FGActivity recognition by learning structural and pairwise mid-level features using random forestIn Automatic Face and Gesture Recognition (FG), 2013 10th IEEE International Conference and Workshops on, 2013