I received Ph.D. degree from Wangxuan Institute of Computer Technology, Peking University in 2020 (advised by Prof. Yuxin Peng), and B.E degree from College of Computer Science, Nankai University in 2014 (advised by Prof. Jufeng Yang). I have authored more than 30 papers, including IJCV, TIP, TMM, TCSVT, CVPR, ICCV, ICDE, ACM MM, ACM SIGIR, IJCAI and AAAI. My research interests include fine-grained visual analysis, video retrieval, multi-modal content analysis. I was one of the recipients of 2020 CCF (China Computer Federation) Outstanding Doctoral Dissertation Award and 2018 Baidu Scholarship, and awarded Young Elite Scientists Sponsorship Program by CAST in 2022.
I am actively seeking collaborations, please do not hesitate to shoot me an email.
Peng Wu, Wanshun Su, Xiangteng He, Peng Wang, Yanning Zhang, "VarCMP: Adapting Cross-Modal Pre-Training Models for Video Anomaly Retrieval", AAAI Conference on Artificial Intelligence (AAAI), Philadelphia, Pennsylvania, USA, February 25 – March 4, 2025. (Accept)
Zhaoda Ye, Xiangteng He and Yuxin Peng, "RaT2IGen: Relation-aware Text-to-image Generation via Learnable Prompt", ACM Transactions on Multimedia Computing, Communications and Applications (TOMM), 2025. (Accept)
Hongbo Sun, Xiangteng He, Jinglin Xu, Yuxin Peng, "SIM-OFE: Structure Information Mining and Object-aware Feature Enhancement for Fine-Grained Visual Categorization", IEEE Transactions on Image Processing (TIP), Vol. 33, pp. 5312-5326, Sep. 2024. [PDF]
Hongbo Sun, Jiahuan Zhou, Xiangteng He, Jinglin Xu and Yuxin Peng, "FineFMPL: Fine-grained Feature Mining Prompt Learning for Few-Shot Class Incremental Learning", International Joint Conference on Artificial Intelligence (IJCAI), pp. 1299-1307, Jeju, South Korea, August 3-9, 2024. [PDF] [Code]
Ruoyan Pi, Peng Wu, Xiangteng He*, Yuxin Peng, "EOGT: Video Anomaly Detection with Enhanced Object Information and Global Temporal Dependency", ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Vol. 20, No. 10, pp. 1-21, Sep. 2024. [PDF]
Hulingxiao He, Xiangteng He*, Yuxin Peng, Zifei Shan, Xin Su, "Firzen: Firing Strict Cold-Start Items with Frozen Heterogeneous and Homogeneous Graphs for Recommendation", IEEE International Conference on Data Engineering (ICDE), pp. 4657-4670, Utrecht, Netherlands, May 13 - 16th, 2024. [PDF] [Code] [Introduction]
Peng Wu, Jing Liu, Xiangteng He, Yuxin Peng, Peng Wang, Yanning Zhang, "Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model", IEEE Transactions on Image Processing (TIP), Vol. 33, pp. 2213-2225, Mar. 2024. [PDF] [Datasets]
Hongbo Sun, Xiangteng He, Yuxin Peng, "HCL: Hierarchical Consistency Learning for Webly Supervised Fine-Grained Recognition", IEEE Transactions on Multimedia (TMM), Vol.26, pp. 5108-5119, Mar. 2024. [PDF] [Code] [Introduction]
Yanzhe Chen, Huasong Zhong, Xiangteng He, Yuxin Peng, Jiahuan Zhou, Lele Cheng, "FashionERN: Enhance-and-Refine Network for Composed Fashion Image Retrieval", AAAI Conference on Artificial Intelligence (AAAI), Vol. 38, No. 2, pp. 1228-1236, Vancouver, Canada, February 20 - 27, 2024. [PDF]
Hongbo Sun, Xiangteng He, Jiahuan Zhou, Yuxin Peng, "Fine-Grained Visual Prompt Learning of Vision-Language Models for Image Recognition", ACM Multimedia Conference (ACM MM), pp. 5828–5836, Ottawa, Canada, Oct.29 - Nov.3, 2023. [PDF] [Introduction]
Yanzhe Chen, Huasong Zhong, Xiangteng He, Yuxin Peng, Lele Cheng, "Real20M: A Large-scale E-commerce Dataset for Cross-domain Retrieval", ACM Multimedia Conference (ACM MM), pp. 4939–4948, Ottawa, Canada, Oct.29 - Nov.3, 2023. [PDF] [Introduction]
Zijun Deng, Xiangteng He, Yuxin Peng, Xiongwei Zhu, Lele Cheng, "MV-Diffusion: Motion-aware Video Diffusion Model", ACM Multimedia Conference (ACM MM), pp. 7255-7263, Ottawa, Canada, Oct.29 - Nov.3, 2023. [PDF] [Introduction]
Zijun Deng, Xiangteng He, Yuxin Peng, "Efficiency-optimized Video Diffusion Models", ACM Multimedia Conference (ACM MM), pp. 7295-7303, Ottawa, Canada, Oct.29 - Nov.3, 2023. [PDF] [Introduction]
Yulin Pan, Xiangteng He*, Biao Gong, Yiliang Lv, Yujun Shen, Yuxin Peng, Deli Zhao*, "Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos", International Conference of Computer Vision (ICCV), pp. 13767-13777, Paris, France, Oct. 2-6, 2023. [PDF] [SUPP] [Code]
Zijun Deng, Xiangteng He, Yuxin Peng, "LFR-GAN: Local Feature Refinement based Generative Adversarial Network for Text-to-Image Generation", ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Vol. 19, No. 6, pp. 1-18, Jul. 2023. [PDF] [Code] [Introduction]
HsiaoYuan Hsu, Xiangteng He, Yuxin Peng, Hao Kong, Qing Zhang, "PosterLayout: A New Benchmark and Approach for Content-Aware Visual-Textual Presentation Layout", IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6018-6026, Vancouver, Canada, Jun. 18-22, 2023. [PDF] [Code] [Dataset] [Introduction]
Duoduo Feng, Xiangteng He, Yuxin Peng, "MKVSE: Multimodal Knowledge Enhanced Visual-Semantic Embedding for Image-Text Retrieval", ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Vol. 19, No. 5, pp. 1–21, Mar. 2023. [PDF] [Code] [Introduction]
Hongbo Sun, Xiangteng He, Yuxin Peng, "SIM-Trans: Structure Information Modeling Transformer for Fine-grained Visual Categorization", ACM Multimedia Conference (ACM MM), pp. 5853–5861, Lisbon, Portugal, Oct. 10-14, 2022. [PDF] [PPT] [Code]
Xiangteng He#, Yulin Pan#, Mingqian Tang, Yiliang Lv, Yuxin Peng, "Learn from Unlabeled Videos for Near-duplicate Video Retrieval", ACM SIGIR Conference on Research and Development in Information Retrieval (ACM SIGIR), pp. 1002-1011, Madrid, Spain, Jul. 11-15, 2022. [PDF] [PPT] [Introduction]
Peng Wu, Xiangteng He*, Mingqian Tang, Yiliang Lv, Jing Liu*, "HANet: Hierarchical Alignment Networks for Video-Text Retrieval", ACM Multimedia Conference (ACM MM), pp. 3518–3527, Chengdu, China, Oct. 20-24, 2021. [PDF] [Code]
Zhen Han, Xiangteng He*, Mingqian Tang, Yiliang Lv, "Video Similarity and Alignment Learning on Partial Video Copy Detection", ACM Multimedia Conference (ACM MM), pp. 4165–4173, Chengdu, China, Oct. 20-24, 2021. [PDF] [Project]
Xiangteng He, Yuxin Peng, "Fine-grained Visual-textual Representation Learning", IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), Vol. 30, No. 2, pp. 520-531, Feb. 2020. [PDF]
Xiangteng He, Yuxin Peng, Liu Xie, "A New Benchmark and Approach for Fine-grained Cross-media Retrieval", ACM Multimedia Conference (ACM MM), pp. 1740-1748, Nice, France, Oct. 21-25, 2019. [PDF] [Dataset] [Code] [Introduction]
Xiangteng He, Yuxin Peng, Junjie Zhao, "Which and How Many Regions to Gaze: Focus Discriminative Regions for Fine-grained Visual Categorization", International Journal of Computer Vision (IJCV), Vol. 127, No. 9, pp. 1235-1255, Sep. 2019. [PDF]
Xiangteng He, Yuxin Peng, Junjie Zhao, "Fast Fine-grained Image Classification via Weakly Supervised Discriminative Localization", IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), Vol. 29, No. 5, pp. 1394-1407, May. 2019. [PDF] [Code]
Xiangteng He, Yuxin Peng, "Only Learn One Sample: Fine-Grained Visual Categorization with One Sample Training", ACM Multimedia Conference (ACM MM), pp. 1372-1380, Seoul, Korea, Oct. 22-26, 2018. [PDF]
Xiangteng He, Yuxin Peng, Junjie Zhao, "StackDRL: Stacked Deep Reinforcement Learning for Fine-grained Visual Categorization", International Joint Conference on Artificial Intelligence (IJCAI), pp. 741-747, Stockholm, Sweden, Jul. 13-19, 2018. [PDF]
Yuxin Peng, Xiangteng He, Junjie Zhao, "Object-Part Attention Model for Fine-grained Image Classification", IEEE Transactions on Image Processing (TIP), Vol. 27, No. 3, pp. 1487-1500, Mar. 2018. [PDF] [Code] (This work was the highly cited paper by ESI, which received enough citations to place it in the top 1% of its academic field.)
Xiangteng He, Yuxin Peng, Junjie Zhao, "Fine-grained Discriminative Localization via Saliency-guided Faster R-CNN", ACM Multimedia Conference (ACM MM), pp. 627-635, Mountain View, CA, USA, Oct. 23-27, 2017. [PDF] [Code]
Xiangteng He, Yuxin Peng, "Fine-grained Image Classification via Combining Vision and Language", IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5994-6002, Honolulu, Hawaii, USA, Jul. 21-26, 2017. [PDF]
Xiangteng He, Yuxin Peng, "Weakly Supervised Learning of Part Selection Model with Spatial Constraints for Fine-grained Image Classification", AAAI Conference on Artificial Intelligence (AAAI), pp. 4075-4081, San Francisco, California, USA, Feb. 4–9, 2017. [PDF]