The University of British Columbia (UBC)
Email: xiangteng.he AT ubc.ca
Google ScholarI am a Postdoctoral Researcher in the Department of Computer Science at UBC, working with Prof. Leonid Sigal. Previously, I served as an Assistant Research Professor at Peking University. Prior to academia, I spent a wonderful year in Alibaba DAMO Academy as a Senior Algorithm Engineer. I received Ph.D. degree from Peking University, and B.E degree from Nankai University. My research interests include fine-grained multi-modal analysis, vision-language models, and their applications (e.g., medical imaging).
Looking for strong graduate/undergraduate students to collaborate. Please reach out if you are interested.
Yulin Pan, Xiangteng He*, Chaojie Mao, Zhen Han, Zeyinzi Jiang, Jingfeng Zhang, Yu Liu, "ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing", International Conference on Computer Vision (ICCV), Honolulu, Hawai'i, USA, October 19 – 23, 2025. [Project] [PDF]
Zhaoda Ye, Xiangteng He, Yuxin Peng, "RaT2IGen: Relation-aware Text-to-image Generation via Learnable Prompt", ACM Transactions on Multimedia Computing, Communications and Applications (TOMM), Vol. 21, Issue 5, Article No. 151, pp. 1 - 19, May 2025. [PDF]
Peng Wu, Wanshun Su, Xiangteng He, Peng Wang, Yanning Zhang, "VarCMP: Adapting Cross-Modal Pre-Training Models for Video Anomaly Retrieval", AAAI Conference on Artificial Intelligence (AAAI), Vol. 39, No. 8, pp. 8423-8431, Philadelphia, Pennsylvania, USA, February 25 – March 4, 2025. [PDF]
Hongbo Sun, Xiangteng He, Jinglin Xu, Yuxin Peng, "SIM-OFE: Structure Information Mining and Object-aware Feature Enhancement for Fine-Grained Visual Categorization", IEEE Transactions on Image Processing (TIP), Vol. 33, pp. 5312-5326, Sep. 2024. [PDF]
Hongbo Sun, Jiahuan Zhou, Xiangteng He, Jinglin Xu, Yuxin Peng, "FineFMPL: Fine-grained Feature Mining Prompt Learning for Few-Shot Class Incremental Learning", International Joint Conference on Artificial Intelligence (IJCAI), pp. 1299-1307, Jeju, South Korea, August 3-9, 2024. [PDF] [Code]
Ruoyan Pi, Peng Wu, Xiangteng He*, Yuxin Peng, "EOGT: Video Anomaly Detection with Enhanced Object Information and Global Temporal Dependency", ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Vol. 20, Issue 10, Article No. 320, pp. 1-21, Sep. 2024. [PDF]
Hulingxiao He, Xiangteng He*, Yuxin Peng, Zifei Shan, Xin Su, "Firzen: Firing Strict Cold-Start Items with Frozen Heterogeneous and Homogeneous Graphs for Recommendation", IEEE International Conference on Data Engineering (ICDE), pp. 4657-4670, Utrecht, Netherlands, May 13 - 16th, 2024. [PDF] [Code] [Introduction]
邓梓焌, 何相腾, 彭宇新, "文本到视频生成:研究现状、进展和挑战", 电子学报, Vol.46, pp. 1632-1644, May. 2024. [PDF]
Peng Wu, Jing Liu, Xiangteng He, Yuxin Peng, Peng Wang, Yanning Zhang, "Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model", IEEE Transactions on Image Processing (TIP), Vol. 33, pp. 2213-2225, Mar. 2024. [PDF] [Datasets]
Hongbo Sun, Xiangteng He, Yuxin Peng, "HCL: Hierarchical Consistency Learning for Webly Supervised Fine-Grained Recognition", IEEE Transactions on Multimedia (TMM), Vol.26, pp. 5108-5119, Mar. 2024. [PDF] [Code] [Introduction]
Yanzhe Chen, Huasong Zhong, Xiangteng He, Yuxin Peng, Jiahuan Zhou, Lele Cheng, "FashionERN: Enhance-and-Refine Network for Composed Fashion Image Retrieval", AAAI Conference on Artificial Intelligence (AAAI), Vol. 38, No. 2, pp. 1228-1236, Vancouver, Canada, February 20 - 27, 2024. [PDF] [Code]
Hongbo Sun, Xiangteng He, Jiahuan Zhou, Yuxin Peng, "Fine-Grained Visual Prompt Learning of Vision-Language Models for Image Recognition", ACM Multimedia Conference (ACM MM), pp. 5828–5836, Ottawa, Canada, Oct.29 - Nov.3, 2023. [PDF] [Introduction]
Yanzhe Chen, Huasong Zhong, Xiangteng He, Yuxin Peng, Lele Cheng, "Real20M: A Large-scale E-commerce Dataset for Cross-domain Retrieval", ACM Multimedia Conference (ACM MM), pp. 4939–4948, Ottawa, Canada, Oct.29 - Nov.3, 2023. [PDF] [Code] [Dataset] [Introduction]
Zijun Deng, Xiangteng He, Yuxin Peng, Xiongwei Zhu, Lele Cheng, "MV-Diffusion: Motion-aware Video Diffusion Model", ACM Multimedia Conference (ACM MM), pp. 7255-7263, Ottawa, Canada, Oct.29 - Nov.3, 2023. [PDF] [Introduction]
Zijun Deng, Xiangteng He, Yuxin Peng, "Efficiency-optimized Video Diffusion Models", ACM Multimedia Conference (ACM MM), pp. 7295-7303, Ottawa, Canada, Oct.29 - Nov.3, 2023. [PDF] [Introduction]
Yulin Pan, Xiangteng He*, Biao Gong, Yiliang Lv, Yujun Shen, Yuxin Peng, Deli Zhao*, "Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos", International Conference of Computer Vision (ICCV), pp. 13767-13777, Paris, France, Oct. 2-6, 2023. [PDF] [SUPP] [Code]
HsiaoYuan Hsu, Xiangteng He, Yuxin Peng, "DensityLayout: Density-conditioned Layout GAN for Visual-textual Presentation Designs", International Conference on Image and Graphics (ICIG), pp. 187–199, Nanjing, China, Sep.22 - 24, 2023. [PDF]
Zijun Deng, Xiangteng He, Yuxin Peng, "LFR-GAN: Local Feature Refinement based Generative Adversarial Network for Text-to-Image Generation", ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Vol. 19, Issue 6, Article No. 207, pp. 1-18, Jul. 2023. [PDF] [Code] [Introduction]
HsiaoYuan Hsu, Xiangteng He, Yuxin Peng, Hao Kong, Qing Zhang, "PosterLayout: A New Benchmark and Approach for Content-Aware Visual-Textual Presentation Layout", IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6018-6026, Vancouver, Canada, Jun. 18-22, 2023. [PDF] [Code] [Dataset] [Introduction]
Duoduo Feng, Xiangteng He, Yuxin Peng, "MKVSE: Multimodal Knowledge Enhanced Visual-Semantic Embedding for Image-Text Retrieval", ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Vol. 19, Issue 5, Article No. 162, pp. 1–21, Mar. 2023. [PDF] [Code] [Introduction]
Ruoyan Pi, Xiangteng He, Yuxin Peng, "Weakly Supervised Video Anomaly Detection with Temporal and Abnormal Information", Chinese Conference on Pattern Recognition and Computer Vision (PRCV), pp. 594–608, Shenzhen, China, Nov. 4-7, 2022. [PDF]
Hongbo Sun, Xiangteng He, Yuxin Peng, "SIM-Trans: Structure Information Modeling Transformer for Fine-grained Visual Categorization", ACM Multimedia Conference (ACM MM), pp. 5853–5861, Lisbon, Portugal, Oct. 10-14, 2022. [PDF] [PPT] [Code]
Zhaoda Ye, Xiangteng He, Yuxin Peng, "Unsupervised Cross-media Hashing Learning via Knowledge Graph", Chinese Journal of Electronics (CJE), Vol. 31, No. 6, pp. 1081–1091, Oct. 2022. [PDF]
Xiangteng He#, Yulin Pan#, Mingqian Tang, Yiliang Lv, Yuxin Peng, "Learn from Unlabeled Videos for Near-duplicate Video Retrieval", ACM SIGIR Conference on Research and Development in Information Retrieval (ACM SIGIR), pp. 1002-1011, Madrid, Spain, Jul. 11-15, 2022. [PDF] [PPT] [Introduction]
Peng Wu, Xiangteng He*, Mingqian Tang, Yiliang Lv, Jing Liu*, "HANet: Hierarchical Alignment Networks for Video-Text Retrieval", ACM Multimedia Conference (ACM MM), pp. 3518–3527, Chengdu, China, Oct. 20-24, 2021. [PDF] [Code]
Zhen Han, Xiangteng He*, Mingqian Tang, Yiliang Lv, "Video Similarity and Alignment Learning on Partial Video Copy Detection", ACM Multimedia Conference (ACM MM), pp. 4165–4173, Chengdu, China, Oct. 20-24, 2021. [PDF] [Project]
Junjie Zhao, Xiangteng He, Yuxin Peng, "Attribute Hierarchy based Multi-task Learning for Fine-grained Image Classification", Neurocomputing, Vol. 395, pp. 150-159, Jun. 2020. [PDF]
Xiangteng He, Yuxin Peng, "Fine-grained Visual-textual Representation Learning", IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), Vol. 30, No. 2, pp. 520-531, Feb. 2020. [PDF]
Xiangteng He, Yuxin Peng, Liu Xie, "A New Benchmark and Approach for Fine-grained Cross-media Retrieval", ACM Multimedia Conference (ACM MM), pp. 1740-1748, Nice, France, Oct. 21-25, 2019. [PDF] [Dataset] [Code] [Introduction]
Xiangteng He, Yuxin Peng, Junjie Zhao, "Which and How Many Regions to Gaze: Focus Discriminative Regions for Fine-grained Visual Categorization", International Journal of Computer Vision (IJCV), Vol. 127, No. 9, pp. 1235-1255, Sep. 2019. [PDF]
Xiangteng He, Yuxin Peng, Junjie Zhao, "Fast Fine-grained Image Classification via Weakly Supervised Discriminative Localization", IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), Vol. 29, No. 5, pp. 1394-1407, May. 2019. [PDF] [Code]
Xiangteng He, Yuxin Peng, "Multi-attention Guided Activation Propagation in CNNs", Chinese Conference on Pattern Recognition and Computer Vision (PRCV), pp. 16-27, Guangzhou, China, Nov. 23-26, 2018. [PDF]
Xiangteng He, Yuxin Peng, "Only Learn One Sample: Fine-Grained Visual Categorization with One Sample Training", ACM Multimedia Conference (ACM MM), pp. 1372-1380, Seoul, Korea, Oct. 22-26, 2018. [PDF]
Xiangteng He, Yuxin Peng, Junjie Zhao, "StackDRL: Stacked Deep Reinforcement Learning for Fine-grained Visual Categorization", International Joint Conference on Artificial Intelligence (IJCAI), pp. 741-747, Stockholm, Sweden, Jul. 13-19, 2018. [PDF]
Yuxin Peng, Xiangteng He, Junjie Zhao, "Object-Part Attention Model for Fine-grained Image Classification", IEEE Transactions on Image Processing (TIP), Vol. 27, No. 3, pp. 1487-1500, Mar. 2018. [PDF] [Code]
Xiangteng He, Yuxin Peng, Junjie Zhao, "Fine-grained Discriminative Localization via Saliency-guided Faster R-CNN", ACM Multimedia Conference (ACM MM), pp. 627-635, Mountain View, CA, USA, Oct. 23-27, 2017. [PDF] [Code]
Xiangteng He, Yuxin Peng, "Fine-grained Image Classification via Combining Vision and Language", IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5994-6002, Honolulu, Hawaii, USA, Jul. 21-26, 2017. [PDF]
Xiangteng He, Yuxin Peng, "Weakly Supervised Learning of Part Selection Model with Spatial Constraints for Fine-grained Image Classification", AAAI Conference on Artificial Intelligence (AAAI), pp. 4075-4081, San Francisco, California, USA, Feb. 4–9, 2017. [PDF]