Cover

About Me

Hi! I'm Wei Xiao (肖巍), currently working with Yao (Mark) Mu in ScaleLab@SJTU. Previously, I was a Research Assistant at MiLab, Westlake University advised by Donglin Wang. I got my B.Eng in Automation from Xiamen University advised by Qifeng Zhou.
I am looking for research collaborations and a PhD position starting from 2026 Fall. Please drop me an email if you are interested in my research or just want to chat! Email: xiaowei2002103@foxmail.com
I like Robotic Learning (Embodied AI?), Reinforcement Learning, and World Model. I am focusing on building end-to-end robots with Universality, Generalizability, and Robustness, utilizing learning‑based methods that scale with data and computation. I am currently passionate about Manipulation and Locomotion tasks.

Experience

- 2021.7-2024.7: Bachelor of Engineering - Major in Automation, Xiamen University
- 2020.9-2021.7: Major in Biology, Xiamen University
Xiamen University
- 2024.7-2025.6: Research Assistant in Machine Intelligence Lab (MiLAB), Westlake University Westlake University
- 2025.6-Now: Research Intern in Spatial Cognition and Robotic Automative Learning Laboratory (ScaleLAB), Shanghai Jiao Tong University Shanghai Jiao Tong University

News

  • 2025.7:  ⭐ A summary of VLA+RL - Awesome-VLA-RL - is available in Github.
  • 2025.5:  🎉 Our works PORL and LIT are available in arxiv.
  • 2024.11:  🎉 Homepage has been set up.
  • 2024.09:  🎉 Our paper PT4Rec is accepted by ACML2024 and Machine Learning Journal.

Publications

Embodied AI & RL

sym

Efficient Online RL Fine-Tuning with Offline Pre-trained Policy Only

Wei Xiao*, Jiacheng Liu, Zifeng Zhuang, Runze Suo, Shangke Lyu†, Donglin Wang

sym

Learning Robotic Policy with Imagined Transition: Mitigating the Trade-off between Robustness and Optimality

Wei Xiao*, Shangke Lyu†, Zhefei Gong, Renjie Wang, Donglin Wang

sym

Integrating Trajectory Optimization and Reinforcement Learning for Quadrupedal Jumping with Terrain-Adaptive Landing

Renjie Wang*, Shangke Lyu†, Xin Lang, Wei Xiao, Donglin Wang

  • The 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2025).
sym

Robust Online Residual Refinement via Koopman-Guided Dynamics Modeling

Zhefei Gong, Shangke Lyu, Pengxiang Ding, Wei Xiao, Donglin Wang

sym

Uncertainty-Aware Planning: Mitigating Exploration Loss in Model-Based Reinforcement Learning

Runze Suo*, Zifeng Zhuang, Shangke Lyu, Xiao He, Wei Xiao, Ting Wang, Donglin Wang

Recommender System

sym

Continuous-Time Sequential Recommendation with State Space Models

Wei Xiao, Huiying Wang, Qifeng Zhou†, Qing Wang

sym

PT4Rec: A Universal Prompt-Tuning Framework for Graph Contrastive Learning-Based Recommendations

Wei Xiao*, Qifeng Zhou

  • The 16th Asian Conference on Machine Learning, Machine Learning Journal, (ACML2024).
  • Paper | Code

Honors and Awards

sym

Tencent Kaiwu Reinforcement Learning Competition

Team: 南强至善- Wei Xiao*, Yifang Lin, Jinyang Lai, Huaming Xu, Zejie Jiang, Yunlong Liu

  • Fourth Place (with Bonus ¥20,000) - 2023.12
  • Leaderboard
sym

The 17th National Smart Car Competition for University Students

Team: 南强至善- Wei Xiao*, Tianhao Hu, Yuhang Liu, Jincai Luo†

sym
  • The 13th Mathorcup Mathematical Modelling Competition, Third prize.
  • Huawei Software Elite Challenge, Third Prize.
  • National Mathematical Modelling Competition for College Students, Second Prize in Fujian Province.
  • National Algorithm Competition for College Students, Excellence Award.
  • and so on.

Visitors