vllm/tests at 8e7a891602eb49d5e520e082148b1f021f9f801e - vllm

basic_correctness

[Core] Deprecate xformers (#29262 )

2025-11-24 04:18:55 +00:00

benchmarks

Feature/video support in random mm dataset (#25963 )

2025-10-29 18:24:52 +08:00

compile

[Frontend][torch.compile] CompilationConfig Overhaul (#20283 ): Set up -O infrastructure (#26847 )

2025-11-27 01:55:58 -08:00

config

[torch.compile] caching of config fields should be opt-out by default (#26468 )

2025-11-19 06:13:54 -08:00

cuda

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

detokenizer

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

distributed

[EPLB] Optimize EPLB for Async Rearrange Experts (#22179 )

2025-11-24 09:08:29 -05:00

engine

[Frontend][torch.compile] CompilationConfig Overhaul (#20283 ): Set up -O infrastructure (#26847 )

2025-11-27 01:55:58 -08:00

entrypoints

[CI] Add batched audios Whisper test (#29308 )

2025-11-27 19:31:52 +00:00

evals

Add output token counting to gsm8k eval (#28594 )

2025-11-14 09:32:03 +00:00

kernels

[Misc] Remove redundant attention var constants (#29650 )

2025-11-28 04:35:19 -08:00

kv_transfer

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

lora

[LoRA] Continue optimizing MoE LoRA weight loading (#29322 )

2025-11-27 05:56:28 -08:00

model_executor

[Frontend][torch.compile] CompilationConfig Overhaul (#20283 ): Set up -O infrastructure (#26847 )

2025-11-27 01:55:58 -08:00

models

[Misc] Remove redundant attention var constants (#29650 )

2025-11-28 04:35:19 -08:00

multimodal

[Bugfix] Handle broken frames in video loading (#29001 )

2025-11-20 04:38:12 +00:00

plugins

[V0 deprecation] Deprecate use_v1 parameter (#28112 )

2025-11-12 14:03:52 +00:00

plugins_tests

[Frontend][4/N] Improve all pooling task | Add plugin pooling task (#26973 )

2025-10-23 14:46:18 +00:00

prompts

…

quantization

[Bugfix] Make compressed-tensors MoEs respect ignored layers (#28878 )

2025-11-26 21:35:13 -05:00

reasoning

reasoning_content -> reasoning (#27752 )

2025-11-08 12:15:08 +00:00

rocm/aiter

[Bugfix] [ROCm] [AITER]: Fix aiter block quant not compatible with torch compile dynamo (#28716 )

2025-11-14 10:30:50 -08:00

samplers

[Core] Switch Flat logprob control from environment variable to SamplingParams (#28914 )

2025-11-19 02:10:02 +00:00

standalone_tests

[CI/Build] Move pre-commit only scripts to tools/pre_commit (#27657 )

2025-10-29 08:04:33 +00:00

system_messages

…

tokenization

[BUGFIX] MistralTokenizer._call__ adds an invalid EOS token (#29607 )

2025-11-28 16:44:47 +08:00

tool_use

[Frontend] Respect Chat Completion parallel_tool_calls param (#26233 )

2025-11-25 09:56:15 +00:00

tools

[CI/Build] Move pre-commit only scripts to tools/pre_commit (#27657 )

2025-10-29 08:04:33 +00:00

tpu

[Frontend][torch.compile] CompilationConfig Overhaul (#20283 ): name change compilation level to compilation mode, deprecation compilation level (#26355 )

2025-10-15 02:51:16 +00:00

transformers_utils

[Feature]: Improve GGUF loading from HuggingFace user experience like repo_id:quant_type (#29137 )

2025-11-25 14:28:53 +00:00

utils_

[Frontend][torch.compile] CompilationConfig Overhaul (#20283 ): Set up -O infrastructure (#26847 )

2025-11-27 01:55:58 -08:00

v1

[BugFix] Fix spec decoding max_tokens scheduling perf issue (#29542 )

2025-11-28 20:52:23 +08:00

vllm_test_utils

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

weight_loading

[ROCm][CI] Fix Weight Loading With Multiple GPU Tests on ROCm (#28984 )

2025-11-19 21:31:33 +00:00

__init__.py

…

ci_envs.py

[Model][0/N] Improve all pooling task | clean up (#25817 )

2025-10-13 16:44:50 +08:00

conftest.py

[BugFix] Fix chunked prompt logprobs + preemption (#29071 )

2025-11-22 16:07:18 -05:00

test_config.py

Improve enable chunked_prefill & prefix_caching logic. (#26623 )

2025-11-27 22:05:48 -08:00

test_embedded_commit.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_envs.py

[Misc] fix comment in test_envs (#28529 )

2025-11-14 09:32:46 +00:00

test_inputs.py

[Bugfix][Perf] Revert applying HF processor on text-only inputs for multimodal models (#28858 )

2025-11-17 14:49:25 +00:00

test_logger.py

[Misc] Colorize logs (#29017 )

2025-11-19 19:26:04 -05:00

test_logprobs.py

[Core] Switch Flat logprob control from environment variable to SamplingParams (#28914 )

2025-11-19 02:10:02 +00:00

test_outputs.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_pooling_params.py

[Frontend][Doc][5/N] Improve all pooling task | Polish encode (pooling) api & Document. (#25524 )

2025-10-30 12:13:05 +00:00

test_regression.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_routing_simulator.py

[MoE][Refactor] Make select_experts a non-static method (#29067 )

2025-11-24 13:38:04 -05:00

test_scalartype.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_seed_behavior.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_sequence.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_triton_utils.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_version.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_vllm_port.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

utils.py

[chore] Move the rest of wikimedia url to S3 (#28921 )

2025-11-18 09:44:18 -08:00