This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-03-19 01:37:11 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
spec_decode
/
e2e
History
Abhinav Goyal
2416b26e11
[Speculative Decoding] Medusa Implementation with Top-1 proposer (
#4978
)
2024-07-09 18:34:02 -07:00
..
__init__.py
…
conftest.py
[CORE] Adding support for insertion of soft-tuned prompts (
#4645
)
2024-07-09 13:26:36 -07:00
test_compatibility.py
…
test_integration_dist_tp2.py
[Speculative Decoding] MLPSpeculator Tensor Parallel support (1/2) (
#6050
)
2024-07-02 07:20:29 -07:00
test_integration_dist_tp4.py
[Speculative Decoding] Support draft model on different tensor-parallel size than target model (
#5414
)
2024-06-25 09:56:06 +00:00
test_integration.py
…
test_logprobs.py
…
test_medusa_correctness.py
[Speculative Decoding] Medusa Implementation with Top-1 proposer (
#4978
)
2024-07-09 18:34:02 -07:00
test_mlp_correctness.py
[CORE] Quantized lm-head Framework (
#4442
)
2024-07-02 22:25:17 +00:00
test_multistep_correctness.py
[Speculative Decoding 2/2 ] Integrate typical acceptance sampler into Spec Decode Worker (
#5348
)
2024-07-01 00:33:05 -07:00
test_ngram_correctness.py
…