I’m Wei Xiao (肖巍), now a Research Assistant at MiLab, Westlake University advised by Donglin Wang. Prior to that, I got my bachelor’s degree in Automation from Xiamen University advised by Qifeng Zhou.

I am looking for research collaborations and a Ph.D/MPhil. position starting from 2025/2026. Please drop me an email if you are interested in my research or just want to chat!

Email: xiaowei2002103@foxmail.com / xiaowei@westlake.edu.cn !

My research interests include Embodied AI and Reinforcement Learning.

I am passionate about building generalizable robots and powerful decision models.

📍 Experience

  • 2024.7-now: Research Assistant in Machine Intelligence Lab (MiLAB), Westlake University

  • 2021.7-2024.7: (Bachelor Degree)Major in Automation, Xiamen University

  • 2020.9-2021.7: Major in Biology, Xiamen University

🔥 News

  • 2024.11:  🎉🎉 Homepage has been set up.
  • 2024.09:  🎉🎉 Our paper “PT4Rec: A Universal Prompt-Tuning Framework for Graph Contrastive Learning-Based Recommendations” is accepted by ACML2024 and Machine Learning Journal.

📝 Publications

Embodied AI & RL

(under review)
sym

Learning Robotic Policy with Imagined Transition: Mitigating the Trade-off between Robustness and Optimality

Wei Xiao*, Shangke Lyu†, Zhefei Gong, Renjie Wang, Donglin Wang†

  • Robotic Locomotion.
(under review)
sym

Integrating Trajectory Optimization and Reinforcement Learning for Quadrupedal Jumping with Terrain-Adaptive Landing

Renjie Wang*, Shangke Lyu†, Xin Lang, Wei Xiao, Donglin Wang†

  • Robotic Locomotion.
(under review)
sym

Efficient Online RL Fine-Tuning with Offline Pre-trained Policy Only

Wei Xiao*, Jiacheng Liu, Zifeng Zhuang, Runze Suo, Shangke Lyu†, Donglin Wang†

  • Online RL Fine-Tuning.
(under review)
sym

Uncertainty-Aware Planning: Mitigating Exploration Loss in Model-Based Reinforcement Learning

Runze Suo*, Zifeng Zhuang, Shangke Lyu, Xiao He, Wei Xiao, Ting Wang, Donglin Wang†

  • Model-Based Reinforcement Learning.

Recommender System

(under review)
sym

Continuous-Time Sequential Recommendation with State Space Models

Wei Xiao, Huiying Wang, Qifeng Zhou†, Qing Wang

ACML 2024
sym

PT4Rec: A Universal Prompt-Tuning Framework for Graph Contrastive Learning-Based Recommendations

Wei Xiao*, Qifeng Zhou†

  • GNN, Contrastive Learning, Recommender system.
  • The 16th Asian Conference on Machine Learning, Machine Learning Journal (CCF-B), accepted. Paper Code

🎖 Honors and Awards

Tencent
sym

Tencent Kaiwu Reinforcement Learning Competition - 2023.12

Team: 南强至善- Wei Xiao*, Yifang Lin, Jinyang Lai, Huaming Xu, Zejie Jiang, Yunlong Liu†

  • Fourth Place (with Bonus ¥20,000)
  • Muitl-Agent, Reinforcement Learning. Technical Report
CAA
sym

The 17th National Smart Car Competition for University Students - 2022.07

Team: 南强至善- Wei Xiao*, Tianhao Hu, Yuhang Liu, Jincai Luo†

sym
  • The 13th Mathorcup Mathematical Modelling Competition, Third prize.
  • Huawei Software Elite Challenge, Third Prize.
  • National Mathematical Modelling Competition for College Students, Second Prize in Fujian Province.
  • National Algorithm Competition for College Students, Excellence Award.
  • and so on.

Visitors