Skip to content

Pull requests: ml-explore/mlx-lm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Expose tunable Metal memory limits in the trainer
#1312 opened May 26, 2026 by kru2710shna Loading…
Adding --kv_bits as parameter.
#1309 opened May 24, 2026 by Wolfbane1 Loading…
feat(models): warn when MTP weights are discarded at load
#1306 opened May 24, 2026 by kru2710shna Loading…
4 tasks done
Fix _rstrip_until ValueError when until list is empty
#1305 opened May 23, 2026 by hadoobi Loading…
Log detected tool parser on server model load
#1295 opened May 21, 2026 by robertlangdonn Loading…
Add Cohere2 MoE (Command A+) model support
#1294 opened May 21, 2026 by eauchs Loading…
2 of 5 tasks
Fix KeyError: 'name' in qwen3_coder tool parser
#1289 opened May 19, 2026 by DShickle Loading…
Fix tokenizer test failure
#1287 opened May 19, 2026 by zcbenz Collaborator Loading…
[mlx_lm] Expose 'strict' parameter in load() function
#1284 opened May 18, 2026 by zyguy Loading…
Add per-request prompt cache files to server
#1283 opened May 18, 2026 by Quiet-Node-io Loading…
Add timings to server responses
#1279 opened May 16, 2026 by spicyneuron Contributor Loading…
Restrict think-state scan to assistant prefill tail
#1277 opened May 15, 2026 by eilidhmae Loading…
Add Gemma 4 assistant (MTP drafter) model class
#1276 opened May 14, 2026 by broomva Loading…
ProTip! Follow long discussions with comments:>50.