vllm/source at 705578ae14b648782a8a321dd0903c163bd77375 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-05-01 06:43:36 +08:00

History

Simon Mo 705578ae14

[Docs] document that Meta Llama 3 is supported (#4175 )

2024-04-18 10:55:48 -07:00

..

fix document error for value and v_vec illustration (#3421 )

2024-03-15 16:06:09 -07:00

[Doc] Add docs about OpenAI compatible server (#3288 )

2024-03-18 22:05:34 -07:00

getting_started

[Doc][Installation] delete python setup.py develop (#3989 )

2024-04-11 03:33:02 +00:00

[Docs] document that Meta Llama 3 is supported (#4175 )

2024-04-18 10:55:48 -07:00

Enable scaled FP8 (e4m3fn) KV cache on ROCm (AMD GPU) (#3290 )

2024-04-03 14:15:55 -07:00

[Doc] Fix getting stared to use publicly available model (#3963 )

2024-04-10 18:05:52 +00:00

conf.py

[Frontend] [Core] feat: Add model loading using tensorizer (#3476 )

2024-04-13 17:13:01 -07:00

index.rst

Enable scaled FP8 (e4m3fn) KV cache on ROCm (AMD GPU) (#3290 )

2024-04-03 14:15:55 -07:00