šØš»āš» About Me
Hi, I am Xiaokun Feng (äø°ęå¤)! Iām a Ph.D. student at Institute of Automation, Chinese Academy of Sciences (CASIA), supervised by Prof. Kaiqi Huang (IAPR Fellow). Additionally, Iām a member of Visual Intelligence Interest Group (VIIG).
Currently, my research focuses on object tracking, with a particular emphasis on the visual-language tracking task. If you are intrigued by my work or wish to collaborate, feel free to reach out to me.
š„ News
- 2024.04: š£ We will present our work (Global Instance Tracking) at TPAMI2023 during the VALSE2024 poster session (May 2024, Chongqing, China) and extend a warm invitation to colleagues interested in visual object/language tracking, evaluation methodologies, and human-computer interaction to engage in discussions with us (see our Poster for more information).
- 2024.04: š One paper has been accepted by the 3rd CVPR Workshop on Vision Datasets Understanding and DataCV Challenge as Oral Presentation (CVPRW, Workshop in CCF-A Conference, Oral)!
- 2023.09: š One paper has been accepted by the 37th Conference on Neural Information Processing Systems (NeurIPS, CCF-A Conference, Poster)!
- 2022.04: š Obtain Beijing Outstanding Graduates (åäŗ¬åøä¼ē§ęÆäøē) !
- 2021.12: š Obtain China National Scholarship (å½å®¶å„å¦é) (the highest honor for undergraduates in China, awarded to top 1% students of BIT)!
- 2020.12: š Obtain China National Scholarship (å½å®¶å„å¦é) (the highest honor for undergraduates in China, awarded to top 1% students of BIT)!
š¬ Research Interests
Visual Language Tracking (VLT)
- Investigating multi-modal tracking to address challenges related to integrating visual and linguistic information, thereby improving tracking accuracy.
- Exploring the extension of VLT research to tasks involving comprehensive video understanding, aiming to interpret and contextualize objects in videos based on linguistic input.
- Utilizing Large Language Models (LLMs) in conjunction with visual language tracking to explore human-computer interaction patterns, contributing to the development of more intuitive and user-friendly interactions.
Visual Object Tracking (VOT)
- Researching visual object tracking algorithms across diverse scenes to enhance single object tracking performance in various scenarios.
- Exploring the robustness and generalization aspects of single object tracking algorithms to ensure consistent and reliable performance across diverse scenarios.
š Educations
2018.09 - 2022.06, undergraduate study, Ranking 5/381 (1.3%)
School of Information and Electronics
Beijing Institute of Technology, Beijing
š» Research Experiences
- 2022.09 - Present: Pursuing a Ph.D. degree at Institute of Automation, Chinese Academy of Sciences (CASIA), conducting research on single-object tracking in Visual Intelligence Interest Group (VIIG), initiated and organized by Dr. Shiyu Hu.
š Publications
ā Acceptance
DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM
Xuchen Li, Xiaokun Feng, Shiyu Hu, Meiqi Wu, Dailing Zhang, Jing Zhang, Kaiqi Huang
CVPRW 2024 Oral (Workshop in CCF-A Conference, Oral): the 3rd CVPR Workshop on Vision Datasets Understanding and DataCV Challenge
A Multi-modal Global Instance Tracking Benchmark (MGIT): Better Locating Target in Complex Spatio-temporal and Causal Relationship
Shiyu Hu, Dailing Zhang, Meiqi Wu, Xiaokun Feng, Xuchen Li, Xin Zhao, Kaiqi Huang
NeurIPS 2023 (CCF-A Conference, Poster): the 37th Conference on Neural Information Processing Systems
[Paper]
[BibTeX]
[Poster]
[Slides]
[Platform]
[Toolkit]
[Dataset]
āļø Ongoing Research
Remembering Target More Like Humans: A Robust Visual-Language Tracker with Adaptive Prompts
Xiaokun Feng, Xuchen Li, Shiyu Hu, Dailing Zhang, Meiqi Wu, Xiaotang Chen, Kaiqi Huang
NeurIPS 2024 (CCF-A Conference, In Preparation): the 38th Conference on Neural Information Processing Systems
š Honors and Awards
- China National Scholarship (å½å®¶å„å¦é), at BIT, by Ministry of Education of China, 2021
- China National Scholarship (å½å®¶å„å¦é), at BIT, by Ministry of Education of China, 2020
- Beijing Outstanding Graduates (åäŗ¬åøä¼ē§ęÆäøē), at BIT, by Beijing Municipal Education Commission, 2022
- China National Encouragement Scholarship, at BIT, by Ministry of Education of China, 2019
š¤ Collaborators
I am honored to collaborate with these outstanding researchers. We engage in close discussions concerning various fields such as computer vision, AI4Science, and human-computer interaction. If you are also interested in these areas, please feel free to contact me.
- Shiyu Hu, Ph.D. at the Institute of Automation, Chinese Academy of Sciences (CASIA) and University of Chinese Academy of Sciences (UCAS), focusing on visual object tracking, visual language tracking, benchmark construction, intelligent evaluation technique, and AI4Science.
- Meiqi Wu, Ph.D. student at the University of Chinese Academy of Sciences (UCAS), focusing on computer vision, intelligent evaluation technique, and human-computer interaction.
- Yiping Ma, Ph.D. student at the East China Normal University (ECNU), focusing on intelligent education technique and human-computer interaction.
- Yaxuan Kang, design researcher, research assistant and interaction designer at the Institute of Automation, Chinese Academy of Sciences (CASIA), focusing on human-computer interaction.
- Jing Zhang, research assistant at the Institute of Automation, Chinese Academy of Sciences (CASIA), focusing on computer vision and AI4Science.
- Xuchen Li, incoming Ph.D. student at the Institute of Automation, Chinese Academy of Sciences (CASIA), focusing on visual object tracking, visual language tracking, and AI4Science.
- Dailing Zhang, Ph.D. student at the Institute of Automation, Chinese Academy of Sciences (CASIA), focusing on visual object tracking, visual language tracking, and AI4Science.
My homepage visitors recorded from April, 2024. Thanks for attention.