



I like Robotic Learning (Embodied AI?) and Reinforcement Learning. I am focused on building end-to-end robots with Universality, Generalizability, and Robustness, utilizing learning‑based methods that scale with data and computation. I am currently passionate about Manipulation and Locomotion tasks.
📍 Experience
- 2020.9-2021.7: Major in Biology, Xiamen University


🔥 News
- 2025.7: ⭐⭐ A summary of VLA+RL - Awesome-VLA-RL - is available in Github.
- 2025.5: 🎉🎉 Our works PORL and LIT is available in arxiv.
- 2024.11: 🎉🎉 Homepage has been set up.
- 2024.09: 🎉🎉 Our paper PT4Rec is accepted by ACML2024 and Machine Learning Journal.
📝 Publications
Embodied AI & RL

Efficient Online RL Fine-Tuning with Offline Pre-trained Policy Only
Wei Xiao*, Jiacheng Liu, Zifeng Zhuang, Runze Suo, Shangke Lyu†, Donglin Wang†
- Online RL Fine-Tuning.
- Paper

Learning Robotic Policy with Imagined Transition: Mitigating the Trade-off between Robustness and Optimality
Wei Xiao*, Shangke Lyu†, Zhefei Gong, Renjie Wang, Donglin Wang†
- Robotic Locomotion, RL.
- Paper

Robust Online Residual Refinement via Koopman-Guided Dynamics Modeling
Zhefei Gong, Shangke Lyu, Pengxiang Ding, Wei Xiao, Donglin Wang†
- Robotic Manipulation, Model-Based RL.

Integrating Trajectory Optimization and Reinforcement Learning for Quadrupedal Jumping with Terrain-Adaptive Landing
Renjie Wang*, Shangke Lyu†, Xin Lang, Wei Xiao, Donglin Wang†
- Robotic Locomotion, Trajectory Optimization, RL.
- The 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2025), accepted.

Uncertainty-Aware Planning: Mitigating Exploration Loss in Model-Based Reinforcement Learning
Runze Suo*, Zifeng Zhuang, Shangke Lyu, Xiao He, Wei Xiao, Ting Wang, Donglin Wang†
- Model-Based RL, Planning.
Recommender System


🎖 Honors and Awards

Tencent Kaiwu Reinforcement Learning Competition - 2023.12
Team: 南强至善- Wei Xiao*, Yifang Lin, Jinyang Lai, Huaming Xu, Zejie Jiang, Yunlong Liu†
- Fourth Place (with Bonus ¥20,000)
- Muitl-Agent, Reinforcement Learning. Technical Report

The 17th National Smart Car Competition for University Students - 2022.07
Team: 南强至善- Wei Xiao*, Tianhao Hu, Yuhang Liu, Jincai Luo†
- The Second Prize in South Region
- Computer Vision, PID Control. Technical Blog Video1 Video2

- The 13th Mathorcup Mathematical Modelling Competition, Third prize.
- Huawei Software Elite Challenge, Third Prize.
- National Mathematical Modelling Competition for College Students, Second Prize in Fujian Province.
- National Algorithm Competition for College Students, Excellence Award.
- and so on.