I am an M.Phil. student and an incoming Ph.D. student in Data Science and Analytics at the Hong Kong University of Science and Technology (Guangzhou), under the supervision of Prof. Yuxuan Liang and Prof. Yangqiu Song. I earned my B.E. from the School of Computer and Information at Hefei University of Technology.
My research interests include Multimodal Machine Learning and Data Mining, with a focus on Spatio-Temporal and Urban scenarios. I aim to derive insights from large-scale, heterogeneous, cross-domain data.
Previously, I was a Research Intern at XPENG, working on visual multimodal research and contributing to the XNGP System. I also worked as a Software Engineer at Tencent, where I enhanced QQ’s performance and developed cloud-native tools like CodeSpaces and the Workflow Engine.
",
which does not match the baseurl
("
") configured in _config.yml
.
baseurl
in _config.yml
to "
".
Siru Zhong, Xixuan Hao, Yibo Yan, Ying Zhang, Yangqiu Song, Yuxuan Liang
ACM International Conference on Multimedia (ACM MM) 2024 Poster
First-ever cross-domain framework that integrates the power of LMM and SAM into satellite image-text retrieval.
Yutong Feng, Qiongyan Wang, Yutong Xia, Junlin Huang, Siru Zhong, Kun Wang, Shifen Cheng, Yuxuan Liang
The International Joint Conference on Artificial Intelligence (IJCAI) 2024
A pioneering Spatio-Temporal Field Neural Network model integrates two distinct perspectives on space and time to perform air quality inference.
Huaiwu Zhang, Yutong Xia, Siru Zhong, Kun Wang, Zekun Tong, Qingsong Wen, Roger Zimmermann, Yuxuan Liang
The International Joint Conference on Artificial Intelligence (IJCAI) 2024
A novel deep-learning prediction model for real-time parking availability in Singapore, analyzing external factors, introducing a new dataset.
Yibo Yan, Haomin Wen, Siru Zhong, Wei Chen, Haodong Chen, Qingsong Wen, Roger Zimmermann, Yuxuan Liang
The International World Wide Web Conference (WWW) 2024 Oral
First-ever cross-domain framework that integrates the power of LMM and SAM into satellite image-text retrieval.
Jianxiang Zhou, Erdong Liu, Wei Chen, Siru Zhong, Yuxuan Liang
Under review. 2024
Introduce the Spatio-Temporal Graph Transformer (STGormer), a model that integrates attribute and structure information in traffic data to learn spatio-temporal correlations and uses a mixture-of-experts module to capture heterogeneity, leading to state-of-the-art performance in traffic forecasting.
Xixuan Hao, Wei Chen, Yibo Yan, Siru Zhong, Kun Wang, Qingsong Wen, Yuxuan Liang
Under review. 2024
First urban region representation learning framework that explores multi-granularity cross-modal alignment.