This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-06-07 01:15:42 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
model_executor
/
layers
/
mamba
History
Asaf Joseph Gardin
46a13949d5
[v1] - Mamba1 Attention Metadata (
#21249
)
...
Signed-off-by: asafg <asafg@ai21.com> Co-authored-by: asafg <asafg@ai21.com>
2025-08-06 17:03:42 -07:00
..
ops
[Model] Mamba2 preallocate SSM output tensor to avoid d2d copy overhead (
#21075
)
2025-08-02 01:59:34 -07:00
__init__.py
[Kernel/Model] Migrate mamba_ssm and causal_conv1d kernels to vLLM (
#7651
)
2024-08-28 15:06:52 -07:00
abstract.py
[v1][mamba] Added mamba_type into MambaSpec (
#21715
)
2025-07-28 08:15:55 +00:00
mamba2_metadata.py
[Kernel] Triton implementation of causal-conv1d for Mamba-based models (
#18218
)
2025-07-09 12:53:55 -07:00
mamba_mixer2.py
[v1] - Mamba1 Attention Metadata (
#21249
)
2025-08-06 17:03:42 -07:00
mamba_mixer.py
[v1] - Mamba1 Attention Metadata (
#21249
)
2025-08-06 17:03:42 -07:00
mamba_utils.py
[v1] - Mamba1 Attention Metadata (
#21249
)
2025-08-06 17:03:42 -07:00