This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-05-03 19:11:19 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
model_executor
/
layers
/
fused_moe
History
Thomas Parnell
9a7e2d0534
[Bugfix] Allow vllm to still work if triton is not installed. (
#6786
)
...
Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
2024-07-29 14:51:27 -07:00
..
configs
Unmark fused_moe config json file as executable (
#5960
)
2024-06-28 06:36:12 -07:00
__init__.py
[Bugfix] Allow vllm to still work if triton is not installed. (
#6786
)
2024-07-29 14:51:27 -07:00
fused_moe.py
[ Misc ] Apply MoE Refactor to Deepseekv2 To Support Fp8 (
#6417
)
2024-07-13 20:03:58 -07:00
layer.py
[ Misc ]
fbgemm
checkpoints (
#6559
)
2024-07-20 09:36:57 -07:00
moe_pallas.py
[Hardware][TPU] Support MoE with Pallas GMM kernel (
#6457
)
2024-07-16 09:56:28 -07:00