vllm/source at 0580aab02ffe60fee50bddc80b787828eb233c44 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-05-01 04:03:31 +08:00

History

Hongxia Yang 0580aab02f

[ROCm] support Radeon™ 7900 series (gfx1100) without using flash-attention (#2768 )

2024-02-10 23:14:37 -08:00

..

Update README.md (#1292 )

2023-10-08 23:15:50 -07:00

[DOC] Add additional comments for LLMEngine and AsyncLLMEngine (#1011 )

2024-01-11 19:26:49 -08:00

getting_started

[ROCm] support Radeon™ 7900 series (gfx1100) without using flash-attention (#2768 )

2024-02-10 23:14:37 -08:00

Add Internlm2 (#2666 )

2024-02-01 09:27:40 -08:00

Support FP8-E5M2 KV Cache (#2279 )

2024-01-28 16:43:54 -08:00

docs: fix langchain (#2736 )

2024-02-03 18:17:55 -08:00

conf.py

[DOC] Add additional comments for LLMEngine and AsyncLLMEngine (#1011 )

2024-01-11 19:26:49 -08:00

index.rst

Bump up version to v0.3.0 (#2656 )

2024-01-31 00:07:07 -08:00