vllm/attention at 413ef7a3b4d8722b8677f15e2320604a5a3d3b69 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-04-20 18:47:07 +08:00

History

Lucas Kabela 94666612a9

[Misc][qwen2_5_vl][torch.compile] Enable supports_torch_compile on generic nn.Module and demonstrate speedup on Qwen Vision model (#23207 )

Signed-off-by: Lucas Kabela <lucaskabela@meta.com>
Signed-off-by: Lucas Kabela <lucasakabela@gmail.com>

2025-10-28 22:36:43 +00:00

..

[Chore] Separate out vllm.utils.importlib (#27022 )

2025-10-17 00:48:59 +00:00

[Chore]:Extract math and argparse utilities to separate modules (#27188 )

2025-10-26 04:03:32 -07:00

[Misc][qwen2_5_vl][torch.compile] Enable supports_torch_compile on generic nn.Module and demonstrate speedup on Qwen Vision model (#23207 )

2025-10-28 22:36:43 +00:00

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

__init__.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

layer.py

[BUGFIX][ROCM] ViT FlashAttention on ROCm (no GFX9) and contiguous on qwen3vl ROCm TORCH_SDPA (#27190 )

2025-10-26 15:08:52 +08:00

selector.py

[Chore] Separate out vllm.utils.importlib (#27022 )

2025-10-17 00:48:59 +00:00