I’m Wei Xiao (肖巍), now a Research Assistant at MiLab, Westlake University advised by Donglin Wang. Prior to that, I got my bachelor’s degree in Automation from Xiamen University advised by Qifeng Zhou.
I am looking for research collaborations and a Ph.D/MPhil. position starting from 2025/2026. Please drop me an email if you are interested in my research or just want to chat!
Email: xiaowei2002103@foxmail.com / xiaowei@westlake.edu.cn !
My research interests include Embodied AI and Reinforcement Learning.
I am passionate about building generalizable robots and powerful decision models.
📍 Experience
-
2024.7-now: Research Assistant in Machine Intelligence Lab (MiLAB), Westlake University
-
2021.7-2024.7: (Bachelor Degree)Major in Automation, Xiamen University
-
2020.9-2021.7: Major in Biology, Xiamen University
🔥 News
- 2024.11: 🎉🎉 Homepage has been set up.
- 2024.09: 🎉🎉 Our paper “PT4Rec: A Universal Prompt-Tuning Framework for Graph Contrastive Learning-Based Recommendations” is accepted by ACML2024 and Machine Learning Journal.
📝 Publications
Embodied AI & RL

Wei Xiao*, Shangke Lyu†, Zhefei Gong, Renjie Wang, Donglin Wang†
- Robotic Locomotion.

Renjie Wang*, Shangke Lyu†, Xin Lang, Wei Xiao, Donglin Wang†
- Robotic Locomotion.

Efficient Online RL Fine-Tuning with Offline Pre-trained Policy Only
Wei Xiao*, Jiacheng Liu, Zifeng Zhuang, Runze Suo, Shangke Lyu†, Donglin Wang†
- Online RL Fine-Tuning.

Uncertainty-Aware Planning: Mitigating Exploration Loss in Model-Based Reinforcement Learning
Runze Suo*, Zifeng Zhuang, Shangke Lyu, Xiao He, Wei Xiao, Ting Wang, Donglin Wang†
- Model-Based Reinforcement Learning.
Recommender System

Continuous-Time Sequential Recommendation with State Space Models
Wei Xiao, Huiying Wang, Qifeng Zhou†, Qing Wang

PT4Rec: A Universal Prompt-Tuning Framework for Graph Contrastive Learning-Based Recommendations
Wei Xiao*, Qifeng Zhou†
🎖 Honors and Awards

Tencent Kaiwu Reinforcement Learning Competition - 2023.12
Team: 南强至善- Wei Xiao*, Yifang Lin, Jinyang Lai, Huaming Xu, Zejie Jiang, Yunlong Liu†
- Fourth Place (with Bonus ¥20,000)
- Muitl-Agent, Reinforcement Learning. Technical Report

The 17th National Smart Car Competition for University Students - 2022.07
Team: 南强至善- Wei Xiao*, Tianhao Hu, Yuhang Liu, Jincai Luo†
- The Second Prize in South Region
- Computer Vision, PID Control. Technical Blog Video Video

- The 13th Mathorcup Mathematical Modelling Competition, Third prize.
- Huawei Software Elite Challenge, Third Prize.
- National Mathematical Modelling Competition for College Students, Second Prize in Fujian Province.
- National Algorithm Competition for College Students, Excellence Award.
- and so on.