Skip to content

Pull requests: vllm-project/tpu-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Gmm fp4 qwen coder
#1320 opened Dec 16, 2025 by coolkp Draft
Fix lora layer unit tests for v7x2. ready ONLY add when PR is ready to merge/full CI is needed
#1319 opened Dec 16, 2025 by vanbasten23 Loading…
[CI] This PR enhances testing of the CI procedures on both v6e and v7x. ready ONLY add when PR is ready to merge/full CI is needed
#1311 opened Dec 15, 2025 by dennisYehCienet Loading…
Refactor tuning for RPA HD64 kernel tuning to improve RPA kernel throughput ready ONLY add when PR is ready to merge/full CI is needed
#1308 opened Dec 14, 2025 by helloworld1 Loading…
[Torchax] fp8 quantization skeleton
#1307 opened Dec 14, 2025 by xingliu14 Loading…
Allow pytest to correctly discover all tests
#1303 opened Dec 12, 2025 by wdhongtw Loading…
[do not merge ]Get all change files instead of last commit when bootstrap. ready ONLY add when PR is ready to merge/full CI is needed
#1299 opened Dec 12, 2025 by QiliangCui Loading…
[test do not review] ready ONLY add when PR is ready to merge/full CI is needed
#1298 opened Dec 12, 2025 by QiliangCui Loading…
[DRAFT] [DP][Bugfix] Fix bad sharding in non_dp case.
#1288 opened Dec 12, 2025 by py4 Loading…
[multihost] Integrate expert parallelism to RayExecutor ready ONLY add when PR is ready to merge/full CI is needed
#1282 opened Dec 10, 2025 by Lumosis Loading…
[do not review][do not submit] ready ONLY add when PR is ready to merge/full CI is needed
#1277 opened Dec 10, 2025 by QiliangCui Loading…
Move the If nightly==1 check out of command.
#1276 opened Dec 10, 2025 by QiliangCui Loading…
add new kernel and quantization support matrices
#1275 opened Dec 10, 2025 by boe20211 Loading…
docs: update support matrices and improve visuals
#1250 opened Dec 5, 2025 by RobMulla Loading…
Add workflow to build vLLM-TPU wheel using PyPI tpu-inference ready ONLY add when PR is ready to merge/full CI is needed
#1241 opened Dec 4, 2025 by ylangtsou Draft
[CI] Fix awq dtype ready ONLY add when PR is ready to merge/full CI is needed
#1220 opened Dec 2, 2025 by kyuyeunk Loading…
[Oncall] update the SchedulerConfig interface
#1219 opened Dec 2, 2025 by bzgoogle Loading…
Add a SP e2e test.
#1209 opened Dec 2, 2025 by vanbasten23 Loading…
Save size in scalar scratch for bo and bq ready ONLY add when PR is ready to merge/full CI is needed
#1201 opened Dec 1, 2025 by rupengliu-meta Loading…
ProTip! Filter pull requests by the default branch with base:main.