Xinyue Zeng - Personal Homepage

About Me

I am Ph.D. candicate in Computer Science at Virginia Tech, advised by Professor Dawei Zhou. I received my M.S. in Biostatistics from the University of North Carolina at Chapel Hill in 2024, where I was luckily advised by Professor Yun Li, and my B.S. in Statistics from Zhejiang University in 2022 under the supervision of Professor Peng Zhang.

My research focuses on LLM Reasoning and Inference, with an emphasis on developing eliability, efficiency and principled evaluation frameworks for AI assurance. I am particularly interested in advancing the theoretical and empirical understanding of LLMs to improve their reliability and generalization in real-world applications.

What I'm Focusing

AI Assurance:
Developed advanced methods to enhance large language models with robust adaptability and precision in open-world settings. [LensLLM, HalluGuard, Plan‑and‑Budget]
Ai4Science:
Applied ML and statistical modeling to biological data for structure prediction, biomarker discovery, and disease analysis. [DISPROTBENCH, Epic-Unmix]
Exploring other areas and welcome collaborations!

Latest News

May 2026

Start as a researcher intern at Microsoft Copilot Frontier AI team. Honored to be invited for this opportunity to extend my research on test-time inference and hallucination control.

Apr 2026

Honored to receive Graduate Student Travel Fellowship by College of Engineering.

Mar 2026

Honored to receive Tier One Financial Travel Award by ICLR'26.

Feb 2026

Honored to be selected as Student Spotlight by Sanghani Center for AI & Data Analytics.

Feb 2026

Honored to be selected as one of twenty recipients of Travel Scholarship by ACM CAPWIC.

Jan 2026

My research about foundational LLM reasoning was accepted by ACM CAPWIC'26, see you in Alexandria on Mar 27-28!

Jan 2026

Two first-author papers (one leading-author: HalluGuard, one co-first: Plan-and-Budget) were accepted by ICLR'26! See you in Brazil this April!

Nov 2025

One paper Epic-Unmix was accepted by Genome Biology!

Aug 2025

My research about foundational LLM fine-tuning was accepted by ICDM'25 PhD forum, see you in D.C. on Nov 12-15!

Jul 2025

Honored to receive GPSS Travel Grant from Graduate School!

Jun 2025

Honored to receive Student Travel Award from CS Department!

May 2025

My first leading paper LensLLM has been accepted as poster at ICML'25 main conference!

Resume

Education

Virginia Tech
2023 — 2028
College of Engineering
PhD candidate in Computer Science
University of North Carolina at Chapel Hill
2022 — 2024
Gillings School of Public Health
Master of Science in Biostatistics
Zhejiang University
2018 — 2022
School of Mathematical Sciences
Bachelor of Science in Statistics

Experience

Research Intern
May 2026 — Aug 2026
Microsoft, Redmond, USA
- • Lead research on RL-based post-training and test-time search to improve the reliability and efficiency of LLMs in agentic Copilot systems, while designing and developing tool-augmented LLM agent pipelines in live environments to enable multi-LLM collaboration and long-horizon tool-use workflows.
Graduate Research Assistant
Aug 2024 — May 2028
Computer Science, Virginia Tech, Blacksburg, USA
- • Developed scalable methods to improve the reliability and efficiency of frontier AI systems, with a focus on hallucination mitigation, uncertainty quantification, and adaptive test-time efficiency—advancing the theoretical and empirical foundations of assured, efficient, and trustworthy large language models in real-world deployments.
Visiting Scholar
July 2023 — July 2024
Computer Science, Virginia Tech, Blacksburg, USA
- • Conducted research at the intersection of statistical learning theory and foundation models, developing principled probabilistic frameworks to characterize LLM generalization and reliability—laying the theoretical groundwork for subsequent work on model selection and uncertainty quantification.
Graduate Research Assistant
Aug 2022 — May 2024
Biostatistics, UNC-Chapel Hill, Chapel Hill, USA
- • Developed scalable Bayesian probabilistic frameworks and uncertainty-aware ML models for large-scale data inference, focusing on principled statistical learning methods with applications to heterogeneous, high-dimensional datasets.
Student Researcher
Oct 2019 — May 2024
Statistics, Zhejiang University, Hangzhou, China
- • Applied statistical optimization and deep learning to build state-of-the-art predictive systems in quantitative finance, developing end-to-end ML pipelines with measurable improvements in forecasting accuracy and computational efficiency.
Software Design Engineer Intern
July 2021 — Sep 2021
Alibaba, Hangzhou, China
- • Redesigned a legacy static-path system with a Java-based Dynamic Source Routing (DSR) algorithm incorporating a node-energy score, boosting end-to-end packet delivery from 88% to 97% and reducing average latency by 30% in 500-node stress tests under highly dynamic traffic.

Publications

HalluGuard: Demystifying Data-Driven and Reasoning-Driven Hallucinations in LLMs
2026
Xinyue Zeng, Junhong Lin, Yujun Yan, Feng Gao, Liang Shi, Jun Wu, Dawei Zhou
ICLR'26
Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning
2026
Junhong Lin*, Xinyue Zeng*, Jie Zhu, Song Wang, Julian Shun, Jun Wu, Dawei Zhou (*Equal Contribution)
ICLR'26
LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection
2025
Xinyue Zeng, Haohui Wang, Junhong Lin, Jun Wu, Tyler Cody, Dawei Zhou
International Conference on Machine Learning (ICML) 2025
DISPROTBENCH: A Disorder-Aware, Task-Rich Benchmark for Evaluating Protein Structure Prediction in Realistic Biological Contexts
2025
Xinyue Zeng, Tuo Wang, Adithya Kulkarni, Alexander Lu, Alexandra Ni, Phoebe Xing, Junhan Zhao, Siwei Chen, Dawei Zhou
Preprint'25
DiagnoLLM: A Hybrid Bayesian Neural Language Framework for Interpretable Disease Diagnosis
2025
Bowen Xu*, Xinyue Zeng*, Jiazhen Hu, Tuo Wang, Adithya Kulkarni (*Equal Contribution)
Preprint'25
Scientific Hypothesis Generation and Validation: Methods, Datasets, and Future Directions
2025
Adithya Kulkarni, Fatimah Alotaibi, Xinyue Zeng, Longfeng Wu, Tong Zeng, Barry Menglong Yao, Minqian Liu, Shuaicheng Zhang, Lifu Huang, Dawei Zhou
Cell-type specific inference from bulk RNA-sequencing data by integrating single cell reference profiles via EPIC-unmix
2024
Chenwei Tang, Quan Sun, Xinyue Zeng, Xiaoyu Yang, Fei Liu, Jinying Zhao, Yin Shen, Bixiang Liu, Jia Wen, Yun Li
Genome Biology
Development of stock investment system based on big data analysis and statistical optimization
2022
Xinyue Zeng, Yuting Fu, Haiyun Zou, Peng Zhang
National Innovation and Entrepreneurship Program at Zhejiang University

Portfolio

Awards & Honors

Tier One Financial Travel Award
2025 — 2026
ICLR'26
Graduate Student Travel Fellowship
2025 — 2026
College of Engineering
Student Spotlight
2025 — 2026
Sanghani Center for AI & Data Analytics
Sanghani Center Travel Grant
2025 — 2026
Sanghani Center for AI & Data Analytics
Travel Scholarship
2025 — 2026
ACM CAPWIC
GPSS Travel Grant
2024 — 2025
Virginia Tech Graduate School
CS Travel Award
2024 — 2025
Virginia Tech CS Department
Biostatistics Travel Award
2023 — 2024
UNC-Chapel Hill
Undergraduate Innovation and Entrepreneurship Award
2021 — 2022
Zhejiang University

Presentations

HalluGuard: Demystifying Data-Driven and Reasoning-Driven Hallucinations in LLMs
2026
Fourteenth International Conference on Learning Representations(ICLR 2026)
Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning
2026
Fourteenth International Conference on Learning Representations(ICLR 2026)
Foundational LLM Reasoning and Inference
2026
ACM CAPWIC 2026
Foundational LLM Fine-tuning
2025
International Conference on Data Mining (ICDM) PhD Forum 2025
LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection
2025
Forty-Second International Conference on Machine Learning (ICML 2025)
Gaussian-Process-Based Cell Type Specific Unmixing of Bulk Expression Profiles
2024
ENAR 2024 Spring Meeting
Wildlife recognition based on deep learning
2022
Dissertation Defense for Undergraduate Thesis at Zhejiang University

Invited Talks

Invited Talk for HalluGuard
Apr 2026
NICE AI
Invited Talk for LensLLM
Oct 2025
Cerebras AI
Invited Talk for LensLLM
Oct 2025
PLOUTOS AI
Guest Lecture: CS 4824
March 2025
Virginia Tech

Services

Reviewer

ICLR 2026, ICLR 2026 workshop, ICML 2026(Golden Reviewer), KDD 2026, NeurIPS 2026

Copyrights

Statistical Analysis Platform
2023
Software Copyright in China

What I'm Focusing

AI Assurance:

Ai4Science:

Exploring other areas and welcome collaborations!

Latest News

Education

Virginia Tech

University of North Carolina at Chapel Hill

Zhejiang University

Experience

Research Intern

Graduate Research Assistant

Visiting Scholar

Graduate Research Assistant

Student Researcher

Software Design Engineer Intern

Publications

Awards & Honors

Tier One Financial Travel Award

Graduate Student Travel Fellowship

Sanghani Center Travel Grant

Travel Scholarship

GPSS Travel Grant

CS Travel Award

Biostatistics Travel Award

Undergraduate Innovation and Entrepreneurship Award

Presentations

HalluGuard: Demystifying Data-Driven and Reasoning-Driven Hallucinations in LLMs

Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning

Foundational LLM Reasoning and Inference

Foundational LLM Fine-tuning

LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection

Gaussian-Process-Based Cell Type Specific Unmixing of Bulk Expression Profiles

Wildlife recognition based on deep learning

Invited Talks

Invited Talk for HalluGuard

Invited Talk for LensLLM

Invited Talk for LensLLM

Guest Lecture: CS 4824

Services

Reviewer

Copyrights

Statistical Analysis Platform

Contact Form