vllm/attention at 2410132bb1f9faa5b252fad3f2b83dc926946b08 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-06 07:29:08 +08:00

History

TJian 2410132bb1

[ROCm] [Bugfix] Fix torch sdpa hallucination (#30789 )

Signed-off-by: tjtanaa <tunjian.tan@embeddedllm.com>

2025-12-16 15:32:43 -08:00

..

[Misc][PCP&DCP] relocate PCP feature check (#30050 )

2025-12-11 03:36:18 -08:00

[Attention] Cache attention metadata builds across hybrid KV-cache groups (#29627 )

2025-12-16 17:10:16 -05:00

[ROCm] [Bugfix] Fix torch sdpa hallucination (#30789 )

2025-12-16 15:32:43 -08:00

[Attention][UX][1/N] Add AttentionConfig and change attention env vars to CLI arguments (#26315 )

2025-12-05 09:48:43 -08:00

__init__.py

[Attention] Remove imports from vllm/attention/__init__.py (#29342 )

2025-11-26 10:53:15 -07:00

layer.py

[Bugfix] Fix ViT with FlashAttention on ROCm (#30703 )

2025-12-15 19:45:21 +00:00

selector.py

[Platform] Refactor Platform attention backend selection to avoid breakpoint for OOT platform (#30212 )

2025-12-15 17:36:07 +00:00