Cover

About Me

Hi! I'm Wei Xiao (肖巍), currently working with Yao (Mark) Mu in ScaleLab@SJTU. Previously, I was a Research Assistant at MiLab, Westlake University advised by Donglin Wang. I got my B.Eng in Automation from Xiamen University advised by Qifeng Zhou.
I am looking for research collaborations and a PhD position starting from 2026 Fall. Please drop me an email if you are interested in my research or just want to chat! Email: xiaowei2002103@foxmail.com
I like Robotic Learning (Embodied AI?), Reinforcement Learning, and World Model. I am focusing on building end-to-end robots with Universality, Generalizability, and Robustness, utilizing learning‑based methods that scale with data and computation. I am currently passionate about Manipulation and Locomotion tasks.

Experience

- 2021.7-2024.7: Bachelor of Engineering - Major in Automation, Xiamen University
- 2020.9-2021.7: Major in Biology, Xiamen University
Xiamen University
- 2024.7-2025.6: Research Assistant in Machine Intelligence Lab (MiLAB), Westlake University Westlake University
- 2025.6-Now: Research Intern in Spatial Cognition and Robotic Automative Learning Laboratory (ScaleLAB), Shanghai Jiao Tong University Shanghai Jiao Tong University

News

  • 2025.7:  ⭐ A summary of VLA+RL - Awesome-VLA-RL - is available in Github.
  • 2025.5:  🎉 Our works PORL and LIT are available in arxiv.
  • 2024.11:  🎉 Homepage has been set up.
  • 2024.09:  🎉 Our paper PT4Rec is accepted by ACML2024 and Machine Learning Journal.

Publications

Embodied AI & RL

sym

Efficient Online RL Fine-Tuning with Offline Pre-trained Policy Only

Wei Xiao*, Jiacheng Liu, Zifeng Zhuang, Runze Suo, Shangke Lyu†, Donglin Wang

sym

Learning Robotic Policy with Imagined Transition: Mitigating the Trade-off between Robustness and Optimality

Wei Xiao*, Shangke Lyu†, Zhefei Gong, Renjie Wang, Donglin Wang

sym

TrajBooster: Boosting Humanoid Whole-Body Manipulation via Trajectory-Centric Learning

Jiacheng Liu, Pengxiang Ding, Qihang Zhou, Yuxuan Wu, Da Huang, Zimian Peng, Wei Xiao, Weinan Zhang, Lixin Yang, Cewu Lu†, Donglin Wang†

sym

Integrating Trajectory Optimization and Reinforcement Learning for Quadrupedal Jumping with Terrain-Adaptive Landing

Renjie Wang*, Shangke Lyu†, Xin Lang, Wei Xiao, Donglin Wang

  • The 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2025 Oral).
  • Arxiv

Recommender System

sym

Continuous-Time Sequential Recommendation with State Space Models

Wei Xiao, Huiying Wang, Qifeng Zhou†, Qing Wang

sym

PT4Rec: A Universal Prompt-Tuning Framework for Graph Contrastive Learning-Based Recommendations

Wei Xiao*, Qifeng Zhou

  • The 16th Asian Conference on Machine Learning, Machine Learning Journal, (ACML2024).
  • Paper | Code

Honors and Awards

sym

Tencent Kaiwu Reinforcement Learning Competition

Team: 南强至善- Wei Xiao*, Yifang Lin, Jinyang Lai, Huaming Xu, Zejie Jiang, Yunlong Liu

  • Fourth Place (with Bonus ¥20,000) - 2023.12
  • Leaderboard
sym

The 17th National Smart Car Competition for University Students

Team: 南强至善- Wei Xiao*, Tianhao Hu, Yuhang Liu, Jincai Luo†

sym
  • The 13th Mathorcup Mathematical Modelling Competition, Third prize.
  • Huawei Software Elite Challenge, Third Prize.
  • National Mathematical Modelling Competition for College Students, Second Prize in Fujian Province.
  • National Algorithm Competition for College Students, Excellence Award.
  • and so on.

Visitors