vllm/attention at c1909e7e8ccd2037e76536a8e726120c85d3754e - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2025-12-24 17:16:20 +08:00

History

Chengji Yao 7da296be04

[TPU] kv cache update kernel supports dynamic grid (#20235 )

Signed-off-by: Chengji Yao <chengjiyao@google.com>

2025-07-02 06:33:37 +00:00

..

[Kernel] mark TorchSDPABackend swap_blocks NotImplementedError (#19749 )

2025-06-20 18:18:11 +00:00

[TPU] kv cache update kernel supports dynamic grid (#20235 )

2025-07-02 06:33:37 +00:00

Quick Fix by adding conditional import for flash_attn_varlen_func in flash_attn (#20143 )

2025-06-27 05:48:13 +00:00

__init__.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

layer.py

[Bugfix][V1][ROCm] Fix AITER Flash Attention Backend (Fix API Break and Local Attention Logic: affecting Llama4) (#19904 )

2025-06-26 12:42:31 +00:00

selector.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00