This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-05-05 15:39:09 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
attention
/
backends
History
Nicolò Lucchesi
066209a045
[Attention] Refactor FA
block_size
limitations to hybrid models only (
#29084
)
...
Signed-off-by: NickLucche <nlucches@redhat.com>
2025-11-22 06:38:44 -08:00
..
__init__.py
…
abstract.py
[Attention] Refactor FA
block_size
limitations to hybrid models only (
#29084
)
2025-11-22 06:38:44 -08:00
registry.py
[Attention] Add ROCM_AITER_MLA_SPARSE to attention backend registry (
#29103
)
2025-11-20 20:24:43 -08:00
utils.py
Update
Optional[x]
->
x | None
and
Union[x, y]
to
x | y
(
#26633
)
2025-10-12 09:51:31 -07:00