This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-01-18 08:34:28 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
model_executor
/
layers
/
fused_moe
History
Divakar Verma
a66cf40b20
[Kernel][ROCm][AMD] enable fused topk_softmax kernel for moe layer (
#4927
)
...
This PR enables the fused topk_softmax kernel used in moe layer for HIP
2024-06-02 14:13:26 -07:00
..
configs
[Model] Enable FP8 QKV in MoE and refine kernel tuning script (
#5039
)
2024-05-31 14:29:19 -07:00
__init__.py
[Model] Snowflake arctic model implementation (
#4652
)
2024-05-09 22:37:14 +00:00
fused_moe.py
[Kernel][ROCm][AMD] enable fused topk_softmax kernel for moe layer (
#4927
)
2024-06-02 14:13:26 -07:00