vllm/tests at 9d9072a069202e7892a40ef94e9085019e73f370 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-25 02:27:15 +08:00

History

Zhuohan Li 9d9072a069

Implement prompt logprobs & Batched topk for computing logprobs (#1328 )

Co-authored-by: Yunmo Chen <16273544+wanmok@users.noreply.github.com>

2023-10-16 10:56:50 -07:00

..

Implement prompt logprobs & Batched topk for computing logprobs (#1328 )

2023-10-16 10:56:50 -07:00

TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181 )

2023-10-02 15:36:09 -07:00

TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181 )

2023-10-02 15:36:09 -07:00

Implement PagedAttention V2 (#1348 )

2023-10-16 00:59:57 -07:00

Add tests for models (#922 )

2023-09-01 11:19:43 +09:00

Implement prompt logprobs & Batched topk for computing logprobs (#1328 )

2023-10-16 10:56:50 -07:00

conftest.py

Implement prompt logprobs & Batched topk for computing logprobs (#1328 )

2023-10-16 10:56:50 -07:00