


Hi!, I’m Wei Xiao (肖巍), currently a Research Assistant at MiLab, Westlake University advised by Donglin Wang. Prior to that, I got my bachelor’s degree in Automation from Xiamen University advised by Qifeng Zhou.
I am looking for research collaborations and a PhD position starting from 2026 Fall. Please drop me an email if you are interested in my research or just want to chat!
Email: xiaowei2002103@foxmail.com !
I like Robotic Learning (Embodied AI?) and Reinforcement Learning. I am focused on building end-to-end robots with Universality, Generalizability, and Robustness, utilizing learning‑based methods that scale with data and computation. I am currently passionate about Manipulation and Locomotion tasks.
📍 Experience
- 2020.9-2021.7: Major in Biology, Xiamen University


🔥 News
- 2025.5: 🎉🎉 Our works PORL and LIT is available in arxiv.
- 2024.11: 🎉🎉 Homepage has been set up.
- 2024.09: 🎉🎉 Our paper PT4Rec is accepted by ACML2024 and Machine Learning Journal.
📝 Publications
Embodied AI & RL

Efficient Online RL Fine-Tuning with Offline Pre-trained Policy Only
Wei Xiao*, Jiacheng Liu, Zifeng Zhuang, Runze Suo, Shangke Lyu†, Donglin Wang†
- Online RL Fine-Tuning.
- Paper

Learning Robotic Policy with Imagined Transition: Mitigating the Trade-off between Robustness and Optimality
Wei Xiao*, Shangke Lyu†, Zhefei Gong, Renjie Wang, Donglin Wang†
- Robotic Locomotion, RL.
- Paper

Robust Online Residual Refinement via Koopman-Guided Dynamics Modeling
Zhefei Gong, Shangke Lyu, Pengxiang Ding, Wei Xiao, Donglin Wang†
- Robotic Manipulation, Model-Based RL.

Integrating Trajectory Optimization and Reinforcement Learning for Quadrupedal Jumping with Terrain-Adaptive Landing
Renjie Wang*, Shangke Lyu†, Xin Lang, Wei Xiao, Donglin Wang†
- Robotic Locomotion, Trajectory Optimization, RL.

Uncertainty-Aware Planning: Mitigating Exploration Loss in Model-Based Reinforcement Learning
Runze Suo*, Zifeng Zhuang, Shangke Lyu, Xiao He, Wei Xiao, Ting Wang, Donglin Wang†
- Model-Based RL, Planning.
Recommender System


🎖 Honors and Awards

Tencent Kaiwu Reinforcement Learning Competition - 2023.12
Team: 南强至善- Wei Xiao*, Yifang Lin, Jinyang Lai, Huaming Xu, Zejie Jiang, Yunlong Liu†
- Fourth Place (with Bonus ¥20,000)
- Muitl-Agent, Reinforcement Learning. Technical Report

The 17th National Smart Car Competition for University Students - 2022.07
Team: 南强至善- Wei Xiao*, Tianhao Hu, Yuhang Liu, Jincai Luo†
- The Second Prize in South Region
- Computer Vision, PID Control. Technical Blog Video Video

- The 13th Mathorcup Mathematical Modelling Competition, Third prize.
- Huawei Software Elite Challenge, Third Prize.
- National Mathematical Modelling Competition for College Students, Second Prize in Fujian Province.
- National Algorithm Competition for College Students, Excellence Award.
- and so on.