Homepage - Data Science & Analytics

Siru Zhong

Ph.D. Student, The Hong Kong University of Science and Technology (Guangzhou), Huawei 2012 Laboratories, Ex-Tencent & XPENG

Greetings! I am a first-year Ph.D. student in Data Science and Analytics at HKUST(GZ), supervised by Prof. Yuxuan Liang and co-supervised by Prof. Yue Yang and Prof. Yangqiu Song. Previously, I obtained my M.Phil. degree from the same university and my B.E. degree in Computer Science from HFUT.

Currently, I am collaborating closely with Dr. Hao Xu from Huawei 2012 Labs and Dr. Qingsong Wen from Squirrel AI. Prior to that, I interned at Autonomous Driving Center of XPENG (supervised by Dr. Cheng Lu) and worked as a full-time Software Engineer at CloudDev Center of Tencent from 2022 to 2023.

My research focuses on Spatio-Temporal Modeling, Foundation Models, and Multimodal Time Series Analysis, I aim to build lightweight, adaptive, and practical spatio-temporal systems. I am passionate about translating cutting-edge research into deployable solutions that address real-world challenges.

Feel free to reach out via email if you're interested in discussing research ideas or exploring potential collaborations!

Education

Hong Kong University of Science and Technology (GZ)

Ph.D. in Data Science and Analytics

Feb. 2025 - Jan. 2028
Hong Kong University of Science and Technology (GZ)

M.Phil. in Data Science and Analytics

Aug. 2023 - Jan. 2025
Hefei University of Technology

B.Eng. in Computer Science and Information Engineering

Sep. 2018 - Jul. 2022

Experience

2012 Laboratories, Huawei

Research Intern, Spatio-Temporal Foundation Models

Feb. 2025 - Now
Autonomous Driving Center, XPENG

Research Intern, Multimodal Image-Text Perception

May. 2024 - Aug. 2024
CloudDev Center, Tencent

Software Engineer & Intern, Cloud Intelligent IDE

Jun. 2021 - May. 2023

News

2025

One paper on Spatio-Temporal Foundation Model was accepted to NeurIPS 2025 (spotlight).

Sep 18

I was invited to serve as a reviewer for ICLR 2026.

Sep 03

One paper on Traffic Flow Forecasting was accepted to TITS 2025.

Aug 20

I was invited to serve as a Program Committee member for AAAI 2026 (Special Track on AI for Social Impact).

Aug 08

I was invited to serve as a Program Committee member for AAAI 2026.

Jul 26

I was invited to serve as a reviewer for KDD 2026 Datasets and Benchmarks Track.

Jul 25

I was invited to serve as a reviewer for MM 2025 Dataset Track.

Jul 23

One paper on Multimodal Building Electricity Loads Forecasting was accepted to MM 2025.

Jul 05

One paper on Urban Heat Island Effect Forecasting was accepted to KDD 2025.

May 15

One paper on Multimodal Time Series Forecasting was accepted to ICML 2025.

May 01

I was invited to serve as a reviewer for MM 2025.

Apr 26

One tutorial on Multimodal Learning for Spatio-Temporal Data Mining was accepted to MM 2025 Tutorials.

Feb 28

I started to cooperate with Huawei 2012 Labs, leading the project of Spatio-temporal knowledge fusion.

Feb 06

I started my PhD studies at the Hong Kong University of Science and Technology (Guangzhou).

Feb 05

I received the Runner-Up Prize in the 2024 HKUST(GZ) DSA Excellent Research Award.

Jan 27

2024

Two papers on Urban Indicator Prediction and Air Quality Inference were accepted to AAAI 2025.

Dec 10

I successfully defended my MPhil thesis in Data Science and Analytics at HKUST(GZ).

Nov 25

I was invited to serve as a reviewer for ICLR 2025.

Aug 23

One paper on Urban Multimodal Image-Text Retrieval was accepted to MM 2024.

Jul 18

I joined XPENG as a Research Intern focusing on multimodal research in autonomous driving scenarios.

May 14

Two papers on Spatio-Temporal Prediction and Neural Networks were accepted to IJCAI 2024.

May 14

One paper on LLM-Enhanced Urban Region Profiling was accepted to WWW 2024.

Jan 23

2023

I started my MPhil studies at the Hong Kong University of Science and Technology (Guangzhou).

Aug 24

2022

I officially joined Tencent as a Software Engineer!

Jul 04

I graduated from Hefei University of Technology with Outstanding Thesis Award.

Jun 05

2021

I completed my three-month internship at Tencent and received a SP return offer.

Sep 14

Publications (view all )

Learning to Factorize Spatio-Temporal Foundation Models

Siru Zhong, Junjie Qiu, Yangyu Wu, Xingchen Zou, Zhongwen Rao, Bin Yang, Chenjuan Guo, Hao Xu, Yuxuan Liang

NeurIPS 2025, Santiago, America Spotlight

Learning to Factorize Spatio-Temporal Foundation Models

Siru Zhong, Junjie Qiu, Yangyu Wu, Xingchen Zou, Zhongwen Rao, Bin Yang, Chenjuan Guo, Hao Xu, Yuxuan Liang

NeurIPS 2025, Santiago, America Spotlight

Time-VLM: Exploring Multimodal Vision-Language Models for Augmented Time Series Forecasting

Siru Zhong, Weilin Ruan, Min Jin, Huan Li, Qingsong Wen, Yuxuan Liang

ICML 2025, Vancouver, Canada

[Code] [Paper]

Time-VLM: Exploring Multimodal Vision-Language Models for Augmented Time Series Forecasting

Siru Zhong, Weilin Ruan, Min Jin, Huan Li, Qingsong Wen, Yuxuan Liang

ICML 2025, Vancouver, Canada

[Code] [Paper]

Multimodal Learning for Spatio-Temporal Data Mining

Siru Zhong, Xixuan Hao, Hao Miao, Yan Zhao, Oingsong Wen, Roger Zimmermann, Yuxuan Liang

MM 2025 Tutorial, Dublin, Ireland

[Code] [Paper]

Multimodal Learning for Spatio-Temporal Data Mining

Siru Zhong, Xixuan Hao, Hao Miao, Yan Zhao, Oingsong Wen, Roger Zimmermann, Yuxuan Liang

MM 2025 Tutorial, Dublin, Ireland

[Code] [Paper]

UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation

Siru Zhong, Xixuan Hao, Yibo Yan, Ying Zhang, Yangqiu Song, Yuxuan Liang

MM 2024, Melbourne, Australia

[Code] [Paper]

UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation

Siru Zhong, Xixuan Hao, Yibo Yan, Ying Zhang, Yangqiu Song, Yuxuan Liang

MM 2024, Melbourne, Australia

[Code] [Paper]

Cross Space and Time: A Spatio-Temporal Unitized Model for Traffic Flow Forecasting

Weilin Ruan, Wenzhuo Wang, Siru Zhong, Wei Chen, Li Liu, Yuxuan Liang

TITS, 2025

[Paper] [Code]

Cross Space and Time: A Spatio-Temporal Unitized Model for Traffic Flow Forecasting

Weilin Ruan, Wenzhuo Wang, Siru Zhong, Wei Chen, Li Liu, Yuxuan Liang

TITS, 2025

[Paper] [Code]

Towards Multi-Scenario Forecasting of Building Electricity Loads with Multimodal Data

Yongzheng Liu, Siru Zhong, Gefeng Luo, Weilin Ruan, Yuxuan Liang

MM 2025, Dublin, Ireland

[Paper]

Towards Multi-Scenario Forecasting of Building Electricity Loads with Multimodal Data

Yongzheng Liu, Siru Zhong, Gefeng Luo, Weilin Ruan, Yuxuan Liang

MM 2025, Dublin, Ireland

[Paper]

AirRadar: Inferring Nationwide Air Quality in China with Deep Neural Networks

Qiongyan WANG, Yutong Xia, Siru Zhong, Weichuang Li, Yuankai Wu, Shi Fen Cheng, Junbo Zhang, Yu Zheng, Yuxuan Liang

AAAI 2025, Philadelphia, America

[Code] [Paper]

AirRadar: Inferring Nationwide Air Quality in China with Deep Neural Networks

Qiongyan WANG, Yutong Xia, Siru Zhong, Weichuang Li, Yuankai Wu, Shi Fen Cheng, Junbo Zhang, Yu Zheng, Yuxuan Liang

AAAI 2025, Philadelphia, America

[Code] [Paper]

UrbanVLP: A Multi-Granularity Vision-Language Pre-Trained Model for Urban Indicator Prediction

Xixuan Hao, Wei Chen, Yibo Yan, Siru Zhong, Kun Wang, Qingsong Wen, Yuxuan Liang

AAAI 2025, Philadelphia, America

[Code] [Paper]

UrbanVLP: A Multi-Granularity Vision-Language Pre-Trained Model for Urban Indicator Prediction

Xixuan Hao, Wei Chen, Yibo Yan, Siru Zhong, Kun Wang, Qingsong Wen, Yuxuan Liang

AAAI 2025, Philadelphia, America

[Code] [Paper]

Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach

Huaiwu Zhang, Yutong Xia, Siru Zhong, Kun Wang, Zekun Tong, Qingsong Wen, Roger Zimmermann, Yuxuan Liang

IJCAI 2024, Jeju Island, South Korea

[Code] [Paper] [Dataset]

Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach

Huaiwu Zhang, Yutong Xia, Siru Zhong, Kun Wang, Zekun Tong, Qingsong Wen, Roger Zimmermann, Yuxuan Liang

IJCAI 2024, Jeju Island, South Korea

[Code] [Paper] [Dataset]

Spatio-Temporal Field Neural Networks for Air Quality Inference

Yutong Feng, Qiongyan Wang, Yutong Xia, Junlin Huang, Siru Zhong, Kun Wang, Shifen Cheng, Yuxuan Liang

IJCAI 2024, Jeju Island, South Korea

[Code] [Paper]

Spatio-Temporal Field Neural Networks for Air Quality Inference

Yutong Feng, Qiongyan Wang, Yutong Xia, Junlin Huang, Siru Zhong, Kun Wang, Shifen Cheng, Yuxuan Liang

IJCAI 2024, Jeju Island, South Korea

[Code] [Paper]

UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web

Yibo Yan, Haomin Wen, Siru Zhong, Wei Chen, Haodong Chen, Qingsong Wen, Roger Zimmermann, Yuxuan Liang

WWW 2024, Singapore Oral

[Code] [Paper]

UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web

Yibo Yan, Haomin Wen, Siru Zhong, Wei Chen, Haodong Chen, Qingsong Wen, Roger Zimmermann, Yuxuan Liang

WWW 2024, Singapore Oral

[Code] [Paper]

Awards

Runner-Up Prize for DSA Excellent Research Award, HKUST(GZ)

2024
Best Project Award (1st/15) in Data Science Computing, HKUST(GZ)

2023
Outstanding Student (Top 10%) in Red Bird Summer Camp, HKUST(GZ)

2023
iCode Certification of R&D Engineering Competency Evaluation, Tencent

2022
Silver Award (2nd/12) in Code World Program, Tencent

2022
Outstanding Student Award (8th/100) in New Employee Training, Tencent

2022
Outstanding Graduation Thesis Award (Top 2%), HFUT

2022
First Prize (Top 10) in CSDN Technology Blogger Competition

2020

Activities

International Conference on Machine Learning attendee, Vancouver, Canada

2025
Greater Bay Area Science Forum, Guangzhou, China

2024
ACM Multimedia Conference attendee, Melbourne, Australia

2024
China National Computer Conference attendee, Yiwu, China

2024
HKUST-GZ System Hub Welcome Party performer, Guangzhou, China

2023
Tencent New Year Gala performer, Shenzhen, China

2023
HFUT Chorus member, Hefei, China

2018-2022
HFUT External Relations Department member, Hefei, China

2018-2019

Service

Program Committee of AAAI 2026 (Main Track & AISI Track)
Reviewer of MM 2025 (Main Track & Dataset Track)
Reviewer of KDD 2026 Datasets and Benchmarks Track
Reviewer of ICLR 2025
Web Master of WebST 2025 & UrbComp 2025

Teaching

DSAA2043 Design and Analysis of Algorithms

Fall 2025
PLED5001 Communicating Research in English

Spring 2025
PDEV6800 Introduction to Teaching and Learning in Higher Education

Fall 2024

Action required

Education

Experience

News

Publications (view all )

Learning to Factorize Spatio-Temporal Foundation Models

Learning to Factorize Spatio-Temporal Foundation Models

Time-VLM: Exploring Multimodal Vision-Language Models for Augmented Time Series Forecasting

Time-VLM: Exploring Multimodal Vision-Language Models for Augmented Time Series Forecasting

Multimodal Learning for Spatio-Temporal Data Mining

Multimodal Learning for Spatio-Temporal Data Mining

UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation

UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation

Cross Space and Time: A Spatio-Temporal Unitized Model for Traffic Flow Forecasting

Cross Space and Time: A Spatio-Temporal Unitized Model for Traffic Flow Forecasting

Towards Multi-Scenario Forecasting of Building Electricity Loads with Multimodal Data

Towards Multi-Scenario Forecasting of Building Electricity Loads with Multimodal Data

AirRadar: Inferring Nationwide Air Quality in China with Deep Neural Networks

AirRadar: Inferring Nationwide Air Quality in China with Deep Neural Networks

UrbanVLP: A Multi-Granularity Vision-Language Pre-Trained Model for Urban Indicator Prediction

UrbanVLP: A Multi-Granularity Vision-Language Pre-Trained Model for Urban Indicator Prediction

Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach

Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach

Spatio-Temporal Field Neural Networks for Air Quality Inference

Spatio-Temporal Field Neural Networks for Air Quality Inference

UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web

UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web

Awards

Activities

Service

Teaching