-
Notifications
You must be signed in to change notification settings - Fork 62
Pull requests: vllm-project/tpu-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Precompile functions with large vocab_size tensors before allocating KV cache to avoid OOM
ready
ONLY add when PR is ready to merge/full CI is needed
#1341
opened Dec 19, 2025 by
wenxindongwork
Loading…
[DP] Reduce DP scheduling overhead via multiprocessing
#1340
opened Dec 19, 2025 by
wenxindongwork
•
Draft
Add appache license.
ready
ONLY add when PR is ready to merge/full CI is needed
#1339
opened Dec 19, 2025 by
QiliangCui
Loading…
Use Topology Order to map KV cache P/D mapping
ready
ONLY add when PR is ready to merge/full CI is needed
#1338
opened Dec 19, 2025 by
mrjunwan-lang
Loading…
[Misc] Fix tpu platform init failure when vllm_config is not fully initialized
#1335
opened Dec 19, 2025 by
sixiang-google
Loading…
[Refactoring] use ONLY add when PR is ready to merge/full CI is needed
jax.shard_map instead of experimental one
ready
#1334
opened Dec 18, 2025 by
lk-chen
Loading…
2 of 3 tasks
[DeepSeek] Support TPU-Friendly Checkpoints + Add DeepSeek Testing
#1332
opened Dec 18, 2025 by
jrplatin
Loading…
[Kernel] Simplify the MLA bkv loading logic
ready
ONLY add when PR is ready to merge/full CI is needed
#1331
opened Dec 18, 2025 by
yaochengji
Loading…
Attention DP for Torchax backend
ready
ONLY add when PR is ready to merge/full CI is needed
#1322
opened Dec 16, 2025 by
wenxindongwork
Loading…
add testcases to validate both prompt size less and greater than CPU RAM
#1317
opened Dec 15, 2025 by
Sneha-at
Loading…
[CI] This PR enhances testing of the CI procedures on both v6e and v7x.
ready
ONLY add when PR is ready to merge/full CI is needed
#1311
opened Dec 15, 2025 by
dennisYehCienet
Loading…
Refactor tuning for RPA HD64 kernel tuning to improve RPA kernel throughput
ready
ONLY add when PR is ready to merge/full CI is needed
#1308
opened Dec 14, 2025 by
helloworld1
Loading…
Allow pytest to correctly discover all tests
ready
ONLY add when PR is ready to merge/full CI is needed
#1303
opened Dec 12, 2025 by
wdhongtw
Loading…
[do not merge ]Get all change files instead of last commit when bootstrap.
ready
ONLY add when PR is ready to merge/full CI is needed
#1299
opened Dec 12, 2025 by
QiliangCui
Loading…
[test do not review]
ready
ONLY add when PR is ready to merge/full CI is needed
#1298
opened Dec 12, 2025 by
QiliangCui
Loading…
[JAX][MoE] Integrate multiple MoE kernels in MoE modules
#1287
opened Dec 11, 2025 by
bzgoogle
Loading…
[multihost] Integrate expert parallelism to RayExecutor
ready
ONLY add when PR is ready to merge/full CI is needed
#1282
opened Dec 10, 2025 by
Lumosis
Loading…
[multihost] Use make_array_from_process_local_data to create global array instead of device_put
#1281
opened Dec 10, 2025 by
Lumosis
Loading…
[do not review][do not submit]
ready
ONLY add when PR is ready to merge/full CI is needed
#1277
opened Dec 10, 2025 by
QiliangCui
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.