πββοΈ I build agentic AI at Alibaba. I was the chief scientist at a startup (raised more than 50M$), previously worked at JD Explore Academy and Tencent AI Lab, and held an adjunct researcher position at ZJU.
π Working on the whole pipeline of LLM R&D and their human-centric applications, including efficient and sufficient training, alignment, evaluations, compression, multilinguality, multimodality, agentic application, and much more.
πͺ I'm keen on bodybuilding (5 years+), marathon (completed first half marathon (126min) in Beijing-2016 and most recent half marathon (86min) in Sydney-2019π . will resume training in 2024πͺπ»).
π₯ I (onceπ ) enjoy cooking.
π I like to spend Sundays with my cats (two from 2020-2023, one from 2023).
π₯ Recent open-source projects on agentic AI, together covering data generation, reuse, evaluation, and context efficiency:
- π AgentHER Hindsight relabeling of failed trajectories for training.
- 𧬠AgentSynth Synthetic agent data from scratch with execution validation.
- π AdaRubric Dynamic rubric evaluation for trajectory quality.
- ποΈ trajectory_tokenization ReAct with compressed history for long-horizon context.



