Biography
I am a second-year PhD student at University of Arizona, advised by
Prof. Lei Cao.
I received my B.Eng degree at Sun Yat-Sen University in 2022.
After that, I worked at Tencent CDB/TDSQL-C team as a software engineer for two years.
I am also fortunate to start my first research journey with
Prof. Qizhen
Zhang.
My research interests include
Agentic Data Systems,
Cloud Database Systems, and
LLM for Data Analytics.
Publication
ARTIST: Learning Optimizers for Agentic Workflows
Jia Yuan*, Linan Zheng*, Chuan Lei, Tim Kraska, Sam Madden, Lei Cao (* Equal contribution)
SIGMOD 2027,
In Submission
CacheServer: Disaggregated Caching for Cloud Databases
Junyong Zhao,
Jia Yuan, Yi Lu, Sam Madden, Lei Cao
SIGMOD 2027,
In Submission
HotHash: Cache-aware Load Balancing for Cloud Data Systems
Junyong Zhao,
Jia Yuan, Zui Chen, Yi Lu, Lei Cao, Sam Madden
SIGMOD 2026
Unstructured Data Analysis Using LLMs: A Comprehensive Benchmark
Qiyan Deng, Jianhui Li, Chengliang Chai, Ye Yuan, Jinqi Liu, Junzhi She, Kaisen Jin, Zhaoze Sun, Yuhao Deng,
Jia Yuan, Yuping Wang, Guoren Wang, Lei Cao
VLDB 2026
BRIEF: Bi-level Coreset Selection for Efficient Instruction Tuning in LLMs
Chaoyuan Shen, Chi Zhang, Chengliang Chai, Jiacheng Wang,
Jia Yuan, Yuping Wang, Ye Yuan, Lei Cao, Guoren Wang
VLDB 2026
SCompression: Enhancing Database Knob Tuning Efficiency Through Slice-Based Workload Compression
Baoqing Cai, Yu Liu, Ma Lin, Bingcheng Lian, Ke Zhou,
Jia Yuan, Jie Yang, Xiaofan Cai
VLDB 2025
Education
University of Arizona [Aug. 2024 ~]
PhD in Computer Science
Advisor:
Prof. Lei Cao
Sun Yat-sen University (SYSU), B.Eng. [Sep. 2018 ~ Jun. 2022]
Major: Computer Science and Engineering
Experience
University of Toronto
Remote Intern
Advisor:
Prof. Qizhen Zhang
Projects: Large-scale data shuffle optimization (e.g., Spark shuffle)
Tencent
Software Engineer (Full-time)
Projects: CDBTune,
CDB Workload Generation/Replay, Anomaly Detection and Root Cause Analysis
Hobby
Marathon, Music, Cuisine
Template