vllm/attention at 6832707e90d460bc1d1eec550e0035af72db7a27 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-07-01 08:07:11 +08:00

History

Michael Goin 6832707e90

[V1][Bugfix] Standardize quantized kv cache rejection for attention backends (#14221 )

Signed-off-by: mgoin <mgoin64@gmail.com>

2025-03-06 14:18:29 -08:00

..

[V1][Bugfix] Standardize quantized kv cache rejection for attention backends (#14221 )

2025-03-06 14:18:29 -08:00

Add authors to license header. (#14371 )

2025-03-06 08:43:09 -08:00

__init__.py

[Attention] MLA with chunked prefill (#12639 )

2025-02-21 15:30:12 -08:00

layer.py

[Bug] Fix Attention when ignored in by quant_method (#14313 )

2025-03-06 14:18:06 -08:00

selector.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00