This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2025-12-10 08:04:58 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
History
Zhuohan Li
9d9072a069
Implement prompt logprobs & Batched topk for computing logprobs (
#1328
)
...
Co-authored-by: Yunmo Chen <16273544+wanmok@users.noreply.github.com>
2023-10-16 10:56:50 -07:00
..
async_engine
Implement prompt logprobs & Batched topk for computing logprobs (
#1328
)
2023-10-16 10:56:50 -07:00
distributed
TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (
#1181
)
2023-10-02 15:36:09 -07:00
engine
TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (
#1181
)
2023-10-02 15:36:09 -07:00
kernels
Implement PagedAttention V2 (
#1348
)
2023-10-16 00:59:57 -07:00
models
Add tests for models (
#922
)
2023-09-01 11:19:43 +09:00
samplers
Implement prompt logprobs & Batched topk for computing logprobs (
#1328
)
2023-10-16 10:56:50 -07:00
conftest.py
Implement prompt logprobs & Batched topk for computing logprobs (
#1328
)
2023-10-16 10:56:50 -07:00