About Me

I am a Ph.D. student at Nanyang Technological University (NTU), advised by Prof. Chau Yuen. My research develops benchmarks, training methods, and agent systems for verifiable and domain-specific reasoning in Large Language Models (LLMs).

  • LLM Evaluation - Trustworthy benchmarks for reasoning and generalization, including DafnyComp, WirelessMathBench, and WritingPreferenceBench.
  • Domain LLMs - Adapting foundation models to engineering domains such as wireless communications and formal verification, including WirelessMathLM and Re:Form.
  • Multi-Agent Systems - Agent communication protocols and memory mechanisms, including LACP and ongoing work on shared-memory structures for autonomous agents.

Previously at MSRA, MEGVII, and Gausium Robotics, I led perception projects from prototype to deployment across SLAM, VIO, and radio/vision stacks.

I received my M.E. from Peking University and my B.E. from Northeastern University, China.

Open to Collaboration: If you are interested in academic collaboration on LLMs, agents, or engineering applications, please email me at xin019@e.ntu.edu.sg.

Recent News

Selected Publications

(* indicates equal contribution. For a full list, please see my Google Scholar ⤻.)

Local Success Does Not Compose: Benchmarking Large Language Models for Compositional Formal Verification

X. Xu*, Xin Li*, X. Qu, J. Fu, B. Yuan

ICLR, 2026

Re:Form: Reducing Human Priors in Scalable Formal Software Verification with RL in LLMs: A Preliminary Study on Dafny

C. Yan*, F. Che*, X. Huang*, X. Xu*, Xin Li*, Y. Li*, X. Qu*, et al., J. Fu

TMLR, 2026

WirelessMathLM: Teaching Mathematical Reasoning for LLMs in Wireless Communications with Reinforcement Learning

Xin Li, M. Liu, Y. Zhu, W. Zhang, L. Wei, J. An, C. Yuen

arXiv, 2025

Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures

S. Ying, Y. Li, X. Qu, Xin Li, et al., G. Zhang, W. Huang, W. Che, C. Lin

arXiv, 2025

LACP: LLM Agent Communication Protocol Requires Urgent Standardization

Xin Li, M. Liu, C. Yuen

AI4NextG @ NeurIPS, 2025

WirelessMathBench: A Mathematical Modeling Benchmark for LLMs in Wireless Communications

Xin Li, M. Liu, L. Wei, J. An, M. Debbah, C. Yuen

Findings of ACL, 2025

TransPathNet: A Two-Stage Framework for Indoor Radio Map Prediction

Xin Li, Ran Liu, Saihua Xu, Sirajudeen Gulam Razul, Chau Yuen

ICASSP, 2025

Co-Planar Parametrization for Stereo-SLAM and Visual-Inertial Odometry

Xin Li*, Yanyan Li*, Evin Pinar Örnek, Jinlong Lin, Federico Tombari

IEEE Robotics and Automation Letters (RA-L), 2020

Leveraging Planar Regularities for Point-Line Visual-Inertial Odometry

Xin Li*, Yijia He*, Jinlong Lin, Xiao Liu

IROS, 2020

Experience

  • Research Assistant @ NTU Singapore (Apr 2024 - Jan 2025) - Supervised by Prof. Chau Yuen, IEEE Fellow
  • SLAM Algorithm Engineer @ Gausium Robotics, Singapore (Mar 2022 - Feb 2024)
  • Research Intern @ Microsoft Research Asia, Beijing (Sep 2020 - Mar 2021) - Supervised by Dr. Yang Liu and Dr. Yizhong Zhang
  • Research Intern @ MEGVII, Beijing (Feb 2019 - Mar 2020) - Supervised by Dr. Yijia He

Education

Grants & Awards

  • Research Grants
    • Google Gemini Academic Program Award ($10,000 USD) - 2026
    • Modal Academics Compute Grant ($2,000) - 2025
    • Cohere Labs Catalyst Grant ($1,500) - 2025
    • OpenAI Researcher Access Program ($1,000) - 2025
  • Academic Honors
    • Rohde & Schwarz Award (IEEE 6G Summit Singapore) - 2025
    • PREMIA Best Student Paper Award Finalist - 2025
    • NTU Research Scholarship (Full Ph.D. Funding) - 2025

Invited Talks

Professional Activities

  • Conference Reviewer - NeurIPS, ICLR, ICML, AAAI, CVPR, ECCV, AISTATS, SIGGRAPH, IROS, ICRA.
  • Journal Reviewer - IEEE RA-L, ACM TOG, IEEE TNNLS.
  • Workshop Organizer - AIR4D@IROS 2025.