AndreasXie

Follow

🦢

Focusing

YutaoXie AndreasXie

🦢

Focusing

Follow

5 followers · 7 following

https://andreasxie.github.io/YutaoXie.github.io/

Achievements

Achievements

Highlights

Pro

Pinned Loading

llm-reasoners llm-reasoners Public

Forked from maitrix-org/llm-reasoners

A library for advanced large language model reasoning

Python
tdmpc2 tdmpc2 Public

Forked from nicklashansen/tdmpc2

Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"

Python
tdmpc2_discrete tdmpc2_discrete Public

Python
LLM360/Reasoning360 LLM360/Reasoning360 Public

A repo for open research on building large reasoning models

Python 148 18
ucsd-wang-lab-lm/tips ucsd-wang-lab-lm/tips Public

TIPS: Turn-level Information-Potential Reward Shaping for Search-Augmented LLMs

Python 1
compute-optimal-rl-llm-scaling/compute-optimal-rl-llm-scaling.github.io compute-optimal-rl-llm-scaling/compute-optimal-rl-llm-scaling.github.io Public

Website

HTML