vllm/quantization at 2b7949c1c2e34de41d9cfc84dd0e377cc6bd58c2 - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-08-01 01:14:31 +08:00

History

Co-authored-by: mgoin <michael@neuralmagic.com>

2024-04-23 13:59:33 -04:00

AQLM CUDA support (#3287 )

2024-04-23 13:59:33 -04:00

2024-02-12 11:02:17 -08:00

2024-04-03 14:15:55 -07:00

2024-02-01 09:35:09 -08:00

2024-04-11 16:35:51 -04:00

2024-03-01 12:47:51 -08:00

2024-01-03 09:52:29 -08:00