This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-01-15 03:54:29 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
core
History
Nicolò Lucchesi
9d43afcc53
[Feature] [Spec decode]: Combine chunked prefill with speculative decoding (
#9291
)
...
Signed-off-by: NickLucche <nlucches@redhat.com>
2024-11-07 08:15:14 -08:00
..
block
[Hardware][Intel-Gaudi] Add Intel Gaudi (HPU) inference backend (
#6143
)
2024-11-06 01:09:10 -08:00
__init__.py
Change the name to vLLM (
#150
)
2023-06-17 03:07:40 -07:00
block_manager.py
[Core] Deprecating block manager v1 and make block manager v2 default (
#8704
)
2024-10-17 11:38:15 -05:00
evictor.py
[CI/Build] drop support for Python 3.8 EOL (
#8464
)
2024-11-06 07:11:55 +00:00
interfaces.py
[Core] Deprecating block manager v1 and make block manager v2 default (
#8704
)
2024-10-17 11:38:15 -05:00
placeholder_block_space_manager.py
[Model] Support Mamba (
#6484
)
2024-10-11 15:40:06 +00:00
scheduler.py
[Feature] [Spec decode]: Combine chunked prefill with speculative decoding (
#9291
)
2024-11-07 08:15:14 -08:00