🦢
Focusing
Highlights
- Pro
Pinned Loading
-
llm-reasoners
llm-reasoners PublicForked from maitrix-org/llm-reasoners
A library for advanced large language model reasoning
Python
-
tdmpc2
tdmpc2 PublicForked from nicklashansen/tdmpc2
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
Python
-
-
LLM360/Reasoning360
LLM360/Reasoning360 PublicA repo for open research on building large reasoning models
-
ucsd-wang-lab-lm/tips
ucsd-wang-lab-lm/tips PublicTIPS: Turn-level Information-Potential Reward Shaping for Search-Augmented LLMs
Python 1
-
compute-optimal-rl-llm-scaling/compute-optimal-rl-llm-scaling.github.io
compute-optimal-rl-llm-scaling/compute-optimal-rl-llm-scaling.github.io PublicWebsite
HTML
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

