Siru Zhong
Ph.D. Student, The Hong Kong University of Science and Technology (Guangzhou), Huawei 2012 Laboratories, Ex-Tencent & XPENG

Greetings! I am a first-year Ph.D. student in Data Science and Analytics at HKUST(GZ), supervised by Prof. Yuxuan Liang and co-supervised by Prof. Yue Yang and Prof. Yangqiu Song. Previously, I obtained my M.Phil. degree from the same university and my B.E. degree in Computer Science from HFUT.

Currently, I am collaborating closely with Dr. Hao Xu from Huawei 2012 Labs and Dr. Qingsong Wen from Squirrel AI. Prior to that, I interned at Autonomous Driving Center of XPENG (supervised by Dr. Cheng Lu) and worked as a full-time Software Engineer at CloudDev Center of Tencent from 2022 to 2023.

My research focuses on Spatio-Temporal Modeling, Foundation Models, and Multimodal Time Series Analysis, I aim to build lightweight, adaptive, and practical spatio-temporal systems. I am passionate about translating cutting-edge research into deployable solutions that address real-world challenges.

Feel free to reach out via email if you're interested in discussing research ideas or exploring potential collaborations!

Education
  • Hong Kong University of Science and Technology (GZ)
    Hong Kong University of Science and Technology (GZ)
    Ph.D. in Data Science and Analytics
    Feb. 2025 - Jan. 2028
  • Hong Kong University of Science and Technology (GZ)
    Hong Kong University of Science and Technology (GZ)
    M.Phil. in Data Science and Analytics
    Aug. 2023 - Jan. 2025
  • Hefei University of Technology
    Hefei University of Technology
    B.Eng. in Computer Science and Information Engineering
    Sep. 2018 - Jul. 2022
Experience
  • 2012 Laboratories, Huawei
    2012 Laboratories, Huawei
    Research Intern, Spatio-Temporal Foundation Models
    Feb. 2025 - Now
  • Autonomous Driving Center, XPENG
    Autonomous Driving Center, XPENG
    Research Intern, Multimodal Image-Text Perception
    May. 2024 - Aug. 2024
  • CloudDev Center, Tencent
    CloudDev Center, Tencent
    Software Engineer & Intern, Cloud Intelligent IDE
    Jun. 2021 - May. 2023
News
2025
One paper on Traffic Flow Forecasting was accepted to TITS 2025.
Aug 20
I was invited to serve as a Program Committee member for AAAI 2026 (Special Track on AI for Social Impact).
Aug 08
I was invited to serve as a Program Committee member for AAAI 2026.
Jul 26
I was invited to serve as a reviewer for KDD 2026 Datasets and Benchmarks Track.
Jul 25
I was invited to serve as a reviewer for ACM MM 2025 Dataset Track.
Jul 23
One paper on Multimodal Building Electricity Loads Forecasting was accepted to ACM MM 2025.
Jul 05
One paper on Urban Heat Island Effect Forecasting was accepted to KDD 2025.
May 15
One paper on Multimodal Time Series Forecasting was accepted to ICML 2025.
May 01
I was invited to serve as a reviewer for ACM MM 2025.
Apr 26
One tutorial on Multimodal Learning for Spatio-Temporal Data Mining was accepted to ACM MM 2025 Tutorials.
Feb 28
I started to cooperate with Huawei 2012 Labs, leading the project of Spatio-temporal knowledge fusion.
Feb 06
I started my PhD studies at the Hong Kong University of Science and Technology (Guangzhou).
Feb 05
I received the Runner-Up Prize in the 2024 HKUST(GZ) DSA Excellent Research Award.
Jan 27
2024
Two papers on Urban Indicator Prediction and Air Quality Inference were accepted to AAAI 2025.
Dec 10
I successfully defended my MPhil thesis in Data Science and Analytics at HKUST(GZ).
Nov 25
I was invited to serve as a reviewer for ICLR 2025.
Aug 23
One paper on Urban Multimodal Image-Text Retrieval was accepted to ACM MM 2024.
Jul 18
I joined XPENG as a Research Intern focusing on multimodal research in autonomous driving scenarios.
May 14
Two papers on Spatio-Temporal Prediction and Neural Networks were accepted to IJCAI 2024.
May 14
One paper on LLM-Enhanced Urban Region Profiling was accepted to WWW 2024.
Jan 23
2023
I started my MPhil studies at the Hong Kong University of Science and Technology (Guangzhou).
Aug 24
2022
I officially joined Tencent as a Software Engineer!
Jul 04
I graduated from Hefei University of Technology with Outstanding Thesis Award.
Jun 05
2021
I completed my three-month internship at Tencent and received a SP return offer.
Sep 14
Publications (view all )
Time-VLM: Exploring Multimodal Vision-Language Models for Augmented Time Series Forecasting
Time-VLM: Exploring Multimodal Vision-Language Models for Augmented Time Series Forecasting

Siru Zhong, Weilin Ruan, Min Jin, Huan Li, Qingsong Wen, Yuxuan Liang

International Conference on Machine Learning (ICML 2025), Vancouver, Canada

Time-VLM: Exploring Multimodal Vision-Language Models for Augmented Time Series Forecasting
Time-VLM: Exploring Multimodal Vision-Language Models for Augmented Time Series Forecasting

Siru Zhong, Weilin Ruan, Min Jin, Huan Li, Qingsong Wen, Yuxuan Liang

International Conference on Machine Learning (ICML 2025), Vancouver, Canada

Multimodal Learning for Spatio-Temporal Data Mining
Multimodal Learning for Spatio-Temporal Data Mining

Siru Zhong, Xixuan Hao, Hao Miao, Yan Zhao, Oingsong Wen, Roger Zimmermann, Yuxuan Liang

ACM International Conference on Multimedia Tutorial (ACM MM 2025), Dublin, Ireland

Multimodal Learning for Spatio-Temporal Data Mining
Multimodal Learning for Spatio-Temporal Data Mining

Siru Zhong, Xixuan Hao, Hao Miao, Yan Zhao, Oingsong Wen, Roger Zimmermann, Yuxuan Liang

ACM International Conference on Multimedia Tutorial (ACM MM 2025), Dublin, Ireland

UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation
UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation

Siru Zhong, Xixuan Hao, Yibo Yan, Ying Zhang, Yangqiu Song, Yuxuan Liang

ACM International Conference on Multimedia (ACM MM 2024), Melbourne, Australia

UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation
UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation

Siru Zhong, Xixuan Hao, Yibo Yan, Ying Zhang, Yangqiu Song, Yuxuan Liang

ACM International Conference on Multimedia (ACM MM 2024), Melbourne, Australia

Cross Space and Time: A Spatio-Temporal Unitized Model for Traffic Flow Forecasting
Cross Space and Time: A Spatio-Temporal Unitized Model for Traffic Flow Forecasting

Weilin Ruan, Wenzhuo Wang, Siru Zhong, Wei Chen, Li Liu, Yuxuan Liang

IEEE Transactions on Intelligent Transportation Systems (TITS, 2025)

Cross Space and Time: A Spatio-Temporal Unitized Model for Traffic Flow Forecasting
Cross Space and Time: A Spatio-Temporal Unitized Model for Traffic Flow Forecasting

Weilin Ruan, Wenzhuo Wang, Siru Zhong, Wei Chen, Li Liu, Yuxuan Liang

IEEE Transactions on Intelligent Transportation Systems (TITS, 2025)

Towards Multi-Scenario Forecasting of Building Electricity Loads with Multimodal Data
Towards Multi-Scenario Forecasting of Building Electricity Loads with Multimodal Data

Yongzheng Liu, Siru Zhong, Gefeng Luo, Weilin Ruan, Yuxuan Liang

ACM International Conference on Multimedia (ACM MM 2025), Dublin, Ireland

Towards Multi-Scenario Forecasting of Building Electricity Loads with Multimodal Data
Towards Multi-Scenario Forecasting of Building Electricity Loads with Multimodal Data

Yongzheng Liu, Siru Zhong, Gefeng Luo, Weilin Ruan, Yuxuan Liang

ACM International Conference on Multimedia (ACM MM 2025), Dublin, Ireland

AirRadar: Inferring Nationwide Air Quality in China with Deep Neural Networks
AirRadar: Inferring Nationwide Air Quality in China with Deep Neural Networks

Qiongyan WANG, Yutong Xia, Siru Zhong, Weichuang Li, Yuankai Wu, Shi Fen Cheng, Junbo Zhang, Yu Zheng, Yuxuan Liang

AAAI Conference on Artificial Intelligence (AAAI 2025), Philadelphia, America

AirRadar: Inferring Nationwide Air Quality in China with Deep Neural Networks
AirRadar: Inferring Nationwide Air Quality in China with Deep Neural Networks

Qiongyan WANG, Yutong Xia, Siru Zhong, Weichuang Li, Yuankai Wu, Shi Fen Cheng, Junbo Zhang, Yu Zheng, Yuxuan Liang

AAAI Conference on Artificial Intelligence (AAAI 2025), Philadelphia, America

UrbanVLP: A Multi-Granularity Vision-Language Pre-Trained Model for Urban Indicator Prediction
UrbanVLP: A Multi-Granularity Vision-Language Pre-Trained Model for Urban Indicator Prediction

Xixuan Hao, Wei Chen, Yibo Yan, Siru Zhong, Kun Wang, Qingsong Wen, Yuxuan Liang

AAAI Conference on Artificial Intelligence (AAAI 2025), Philadelphia, America

UrbanVLP: A Multi-Granularity Vision-Language Pre-Trained Model for Urban Indicator Prediction
UrbanVLP: A Multi-Granularity Vision-Language Pre-Trained Model for Urban Indicator Prediction

Xixuan Hao, Wei Chen, Yibo Yan, Siru Zhong, Kun Wang, Qingsong Wen, Yuxuan Liang

AAAI Conference on Artificial Intelligence (AAAI 2025), Philadelphia, America

Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach
Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach

Huaiwu Zhang, Yutong Xia, Siru Zhong, Kun Wang, Zekun Tong, Qingsong Wen, Roger Zimmermann, Yuxuan Liang

International Joint Conference on Artificial Intelligence (IJCAI 2024), Jeju Island, South Korea

Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach
Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach

Huaiwu Zhang, Yutong Xia, Siru Zhong, Kun Wang, Zekun Tong, Qingsong Wen, Roger Zimmermann, Yuxuan Liang

International Joint Conference on Artificial Intelligence (IJCAI 2024), Jeju Island, South Korea

Spatio-Temporal Field Neural Networks for Air Quality Inference
Spatio-Temporal Field Neural Networks for Air Quality Inference

Yutong Feng, Qiongyan Wang, Yutong Xia, Junlin Huang, Siru Zhong, Kun Wang, Shifen Cheng, Yuxuan Liang

International Joint Conference on Artificial Intelligence (IJCAI 2024), Jeju Island, South Korea

Spatio-Temporal Field Neural Networks for Air Quality Inference
Spatio-Temporal Field Neural Networks for Air Quality Inference

Yutong Feng, Qiongyan Wang, Yutong Xia, Junlin Huang, Siru Zhong, Kun Wang, Shifen Cheng, Yuxuan Liang

International Joint Conference on Artificial Intelligence (IJCAI 2024), Jeju Island, South Korea

UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web
UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web

Yibo Yan, Haomin Wen, Siru Zhong, Wei Chen, Haodong Chen, Qingsong Wen, Roger Zimmermann, Yuxuan Liang

International World Wide Web Conference (WWW 2024), Singapore

UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web
UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web

Yibo Yan, Haomin Wen, Siru Zhong, Wei Chen, Haodong Chen, Qingsong Wen, Roger Zimmermann, Yuxuan Liang

International World Wide Web Conference (WWW 2024), Singapore

Awards
  • Runner-Up Prize for DSA Excellent Research Award, HKUST(GZ)
    2024
  • Best Project Award (1st/15) in Data Science Computing, HKUST(GZ)
    2023
  • Outstanding Student (Top 10%) in Red Bird Summer Camp, HKUST(GZ)
    2023
  • iCode Certification of R&D Engineering Competency Evaluation, Tencent
    2022
  • Silver Award (2nd/12) in Code World Program, Tencent
    2022
  • Outstanding Student Award (8th/100) in New Employee Training, Tencent
    2022
  • Outstanding Graduation Thesis Award (Top 2%), HFUT
    2022
  • First Prize (Top 10) in CSDN Technology Blogger Competition
    2020
Activities
  • International Conference on Machine Learning attendee, Vancouver, Canada
    2025
  • Greater Bay Area Science Forum, Guangzhou, China
    2024
  • ACM Multimedia Conference attendee, Melbourne, Australia
    2024
  • China National Computer Conference attendee, Yiwu, China
    2024
  • HKUST-GZ System Hub Welcome Party performer, Guangzhou, China
    2023
  • Tencent New Year Gala performer, Shenzhen, China
    2023
  • HFUT Chorus member, Hefei, China
    2018-2022
  • HFUT External Relations Department member, Hefei, China
    2018-2019
Service
  • Program Committee of AAAI 2026 (Main Track & AISI Track)
  • Reviewer of ACM MM 2025 (Main Track & Dataset Track)
  • Reviewer of KDD 2026 Datasets and Benchmarks Track
  • Reviewer of ICLR 2025
  • Web Master of WebST 2025 & UrbComp 2025
Teaching
  • DSAA2043 Design and Analysis of Algorithms
    Fall 2025
  • PLED5001 Communicating Research in English
    Spring 2025
  • PDEV6800 Introduction to Teaching and Learning in Higher Education
    Fall 2024