vllm/model_executor at e0dd4d358969144dae3592fd265dea002579a600 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-24 15:07:19 +08:00

History

youkaichao c391e4b68e

[Core] improve robustness of pynccl (#3860 )

2024-04-04 16:52:12 -07:00

..

Enable scaled FP8 (e4m3fn) KV cache on ROCm (AMD GPU) (#3290 )

2024-04-03 14:15:55 -07:00

[Model] Cohere CommandR+ (#3829 )

2024-04-04 13:31:49 -07:00

[Core] improve robustness of pynccl (#3860 )

2024-04-04 16:52:12 -07:00

__init__.py

[Core] Refactor Attention Take 2 (#3462 )

2024-03-25 04:39:33 +00:00

guided_decoding.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00

guided_logits_processors.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00

model_loader.py

Usage Stats Collection (#2852 )

2024-03-28 22:16:12 -07:00

neuron_model_loader.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00

sampling_metadata.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00

utils.py

[Hardware][Neuron] Refactor neuron support (#3471 )

2024-03-22 01:22:17 +00:00

weight_utils.py

[Core] Enable hf_transfer by default if available (#3817 )

2024-04-04 04:02:43 +00:00