About Me

I am a Ph.D. student at Nanyang Technological University (NTU), advised by Prof. Chau Yuen. My research focuses on the intersection of Large Language Models (LLMs) and engineering automation, organized into three pillars:

  • LLM Evaluation - Designing trustworthy, scalable benchmarks to measure reasoning and generalization. Representative works: DafnyComp (formal verification & compositional generalization), WirelessMathBench (math-centric modeling), and WritingPreferenceBench (subjective evaluation across cultures).
  • Domain LLMs - Adapting foundation models for vertical engineering domains. Representative works: WirelessMathLM (verification-based RL for wireless reasoning) and COIG-Writer (high-quality data synthesis).
  • Multi-Agent Systems - Investigating agent communication protocols and memory mechanisms to enable efficient collaboration. Representative works: LACP (standardizing LLM agent communication) and ongoing research into shared memory structures for autonomous agents.

Previously at MSRA, MEGVII, and Gausium Robotics, I led perception projects from prototype to deployment across SLAM, VIO, and radio/vision stacks.

I received my M.E. from Peking University and my B.E. from Northeastern University, China.

Open to Collaboration: If you are seeking academic collaboration 🤝 on LLMs, agents, or engineering applications, please email me at xin019@e.ntu.edu.sg.

Recent News

Selected Publications

(* indicates equal contribution. For a full list, please see my Google Scholar ⤻.)

Local Success Does Not Compose: Benchmarking Large Language Models for Compositional Formal Verification

X. Xu*, Xin Li*, X. Qu, J. Fu, B. Yuan

ICLR, 2026

WirelessMathLM: Teaching Mathematical Reasoning for LLMs in Wireless Communications with Reinforcement Learning

Xin Li, M. Liu, Y. Zhu, W. Zhang, L. Wei, J. An, C. Yuen

arXiv, 2025

Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures

S. Ying, Y. Li, X. Qu, Xin Li, et al., G. Zhang, W. Huang, W. Che, C. Lin

arXiv, 2025

COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes

Y. Li, S. Ying, X. Qu, Xin Li, et al., E. Zhang

arXiv, 2025

LACP: LLM Agent Communication Protocol Requires Urgent Standardization

Xin Li, M. Liu, C. Yuen

AI4NextG @ NeurIPS, 2025

Re:Form-Reducing Human Priors in Scalable Formal Software Verification with RL in LLMs

C. Yan*, F. Che*, X. Huang*, X. Xu*, Xin Li*, Y. Li*, X. Qu*, et al., J. Fu

Technical Report, 2025

WirelessMathBench: A Mathematical Modeling Benchmark for LLMs in Wireless Communications

Xin Li, M. Liu, L. Wei, J. An, M. Debbah, C. Yuen

Findings of ACL, 2025

Onboard Terrain Classification via Stacked Intelligent Metasurface-Diffractive Deep Neural Networks from SAR Level-0 Raw Data

Mengbing Liu, Xin Li, Jiancheng An, Chau Yuen

ML4RS @ ICLR, 2025

TransPathNet: A Two-Stage Framework for Indoor Radio Map Prediction

Xin Li, Ran Liu, Saihua Xu, Sirajudeen Gulam Razul, Chau Yuen

ICASSP, 2025

Co-Planar Parametrization for Stereo-SLAM and Visual-Inertial Odometry

Xin Li*, Yanyan Li*, Evin Pinar Örnek, Jinlong Lin, Federico Tombari

IEEE Robotics and Automation Letters (RA-L), 2020

Leveraging Planar Regularities for Point-Line Visual-Inertial Odometry

Xin Li*, Yijia He*, Jinlong Lin, Xiao Liu

IROS, 2020

Experience

  • Research Assistant @ NTU Singapore (Apr 2024 - Jan 2025) - Supervised by Prof. Chau Yuen, IEEE Fellow
  • SLAM Algorithm Engineer @ Gausium Robotics, Singapore (Mar 2022 - Feb 2024)
  • Research Intern @ Microsoft Research Asia, Beijing (Sep 2020 - Mar 2021) - Supervised by Dr. Yang Liu and Dr. Yizhong Zhang
  • Research Intern @ MEGVII, Beijing (Feb 2019 - Mar 2020) - Supervised by Dr. Yijia He

Education

Invited Talks

Grants & Awards

  • Research Grants
    • Google Gemini Academic Program Award ($10,000 USD) - 2026
    • Modal Academics Compute Grant ($2,000) - 2025
    • Cohere Labs Catalyst Grant ($1,500) - 2025
    • OpenAI Researcher Access Program ($1,000) - 2025
  • Academic Honors
    • Rohde & Schwarz Award (IEEE 6G Summit Singapore) - 2025
    • PREMIA Best Student Paper Award Finalist - 2025
    • NTU Research Scholarship (Full Ph.D. Funding) - 2025

Professional Activities

  • Conference Reviewer - NeurIPS, ICLR, ICML, AAAI, CVPR, ECCV, AISTATS, SIGGRAPH, IROS, ICRA.
  • Journal Reviewer - IEEE RA-L, ACM TOG, IEEE TNNLS.
  • Workshop Organizer - AIR4D@IROS 2025.