Logo
Explore Help
Sign In
xinyun/vllm
1
0
Fork 0
You've already forked vllm
mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2025-12-21 19:15:02 +08:00
Code Issues Packages Projects Releases Wiki Activity
vllm/vllm/model_executor
History
Cyrus Leung df76e5af26
[VLM] Simplify post-processing of replacement info (#12269)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
2025-01-21 16:48:13 -08:00
..
guided_decoding
[bugfix] catch xgrammar unsupported array constraints (#12210)
2025-01-20 16:42:02 -08:00
layers
[BugFix] Fix GGUF tp>1 when vocab_size is not divisible by 64 (#12230)
2025-01-21 12:23:14 +08:00
model_loader
[Core] Interface for accessing model from VllmRunner (#10353)
2025-01-20 15:00:59 +08:00
models
[VLM] Simplify post-processing of replacement info (#12269)
2025-01-21 16:48:13 -08:00
__init__.py
[Performance] Optimize e2e overheads: Reduce python allocations (#7162)
2024-08-08 21:34:28 -07:00
custom_op.py
[platform] support pytorch custom op pluggable (#11328)
2025-01-10 10:02:38 +00:00
parameter.py
[Misc][Quark] Upstream Quark format to VLLM (#10765)
2025-01-15 11:05:15 -05:00
pooling_metadata.py
[Model][Misc] Add e5-mistral-7b-instruct and Embedding API (#3734)
2024-05-11 11:30:37 -07:00
sampling_metadata.py
[Misc] typo find in sampling_metadata.py (#10740)
2024-11-29 05:17:57 +00:00
utils.py
[platforms] enable platform plugins (#11602)
2024-12-30 20:24:45 +08:00
Powered by Gitea Version: 1.23.1 Page: 135ms Template: 4ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API