This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2025-12-23 02:15:01 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
model_executor
/
layers
/
fused_moe
History
Cyrus Leung
fa6ecb9aa7
[Model] Clean up MiniCPMV (
#10751
)
...
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
2024-11-29 04:47:06 +00:00
..
configs
[Kernel] adding fused moe kernel config for L40S TP4 (
#9245
)
2024-10-11 08:54:22 -07:00
__init__.py
[torch.compile] support moe models (
#9632
)
2024-10-27 21:58:04 -07:00
fused_marlin_moe.py
[torch.compile] directly register custom op (
#9896
)
2024-10-31 21:56:09 -07:00
fused_moe.py
[BugFix] [Kernel] Fix GPU SEGV occuring in fused_moe kernel (
#10385
)
2024-11-16 09:55:05 +00:00
layer.py
[Model] Clean up MiniCPMV (
#10751
)
2024-11-29 04:47:06 +00:00
moe_pallas.py
[Hardware][TPU] Support MoE with Pallas GMM kernel (
#6457
)
2024-07-16 09:56:28 -07:00