Skip to content

Pull requests: vllm-project/tpu-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Kernel][FusedMoe] Fix sync-barrier caused crash
#1342 opened Dec 19, 2025 by bythew3i Loading…
Precompile functions with large vocab_size tensors before allocating KV cache to avoid OOM ready ONLY add when PR is ready to merge/full CI is needed
#1341 opened Dec 19, 2025 by wenxindongwork Loading…
Add appache license. ready ONLY add when PR is ready to merge/full CI is needed
#1339 opened Dec 19, 2025 by QiliangCui Loading…
Use Topology Order to map KV cache P/D mapping ready ONLY add when PR is ready to merge/full CI is needed
#1338 opened Dec 19, 2025 by mrjunwan-lang Loading…
[CI] Add a vllm upstream integration pipeline
#1337 opened Dec 19, 2025 by weiyu0824 Loading…
Add GPQA Eval to Benchmarking
#1336 opened Dec 19, 2025 by AahilA Loading…
[Refactoring] use jax.shard_map instead of experimental one ready ONLY add when PR is ready to merge/full CI is needed
#1334 opened Dec 18, 2025 by lk-chen Loading…
2 of 3 tasks
[Kernel] Simplify the MLA bkv loading logic ready ONLY add when PR is ready to merge/full CI is needed
#1331 opened Dec 18, 2025 by yaochengji Loading…
WIP: Pd matching
#1329 opened Dec 17, 2025 by richardsliu Draft
Apply temperature scaling before top-p
#1327 opened Dec 17, 2025 by oliverdutton Loading…
Attention DP for Torchax backend ready ONLY add when PR is ready to merge/full CI is needed
#1322 opened Dec 16, 2025 by wenxindongwork Loading…
[CI] This PR enhances testing of the CI procedures on both v6e and v7x. ready ONLY add when PR is ready to merge/full CI is needed
#1311 opened Dec 15, 2025 by dennisYehCienet Loading…
Refactor tuning for RPA HD64 kernel tuning to improve RPA kernel throughput ready ONLY add when PR is ready to merge/full CI is needed
#1308 opened Dec 14, 2025 by helloworld1 Loading…
Allow pytest to correctly discover all tests ready ONLY add when PR is ready to merge/full CI is needed
#1303 opened Dec 12, 2025 by wdhongtw Loading…
[do not merge ]Get all change files instead of last commit when bootstrap. ready ONLY add when PR is ready to merge/full CI is needed
#1299 opened Dec 12, 2025 by QiliangCui Loading…
[test do not review] ready ONLY add when PR is ready to merge/full CI is needed
#1298 opened Dec 12, 2025 by QiliangCui Loading…
[DRAFT] [DP][Bugfix] Fix bad sharding in non_dp case.
#1288 opened Dec 12, 2025 by py4 Loading…
[multihost] Integrate expert parallelism to RayExecutor ready ONLY add when PR is ready to merge/full CI is needed
#1282 opened Dec 10, 2025 by Lumosis Loading…
[do not review][do not submit] ready ONLY add when PR is ready to merge/full CI is needed
#1277 opened Dec 10, 2025 by QiliangCui Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.