This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-01-12 02:24:29 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
examples
/
offline_inference
History
Cyrus Leung
8896eb72eb
[Deprecation] Remove
prompt_token_ids
arg fallback in
LLM.generate
and
LLM.embed
(
#18800
)
...
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
2025-08-22 10:56:57 +08:00
..
basic
[Kernel/Quant] Remove AQLM (
#22943
)
2025-08-16 19:38:21 +00:00
disaggregated-prefill-v1
…
openai_batch
…
profiling_tpu
…
qwen2_5_omni
…
async_llm_streaming.py
…
audio_language.py
…
automatic_prefix_caching.py
…
batch_llm_inference.py
…
chat_with_tools.py
…
context_extension.py
…
convert_model_to_seq_cls.py
…
data_parallel.py
[Kernels] Clean up FusedMoeMethodBase and modular kernel setup. Remove extra arguments from modular kernel methods. (
#22035
)
2025-08-15 14:46:00 -04:00
disaggregated_prefill.py
…
embed_jina_embeddings_v3.py
…
embed_matryoshka_fy.py
…
encoder_decoder_multimodal.py
…
encoder_decoder.py
[New Model]mBART model (
#22883
)
2025-08-16 12:16:58 +00:00
llm_engine_example.py
…
load_sharded_state.py
…
logits_processor.py
[V1] Logits processors extensibility (
#19912
)
2025-08-16 12:59:17 -07:00
lora_with_quantization_inference.py
…
metrics.py
…
mistral-small.py
…
mlpspeculator.py
…
multilora_inference.py
…
neuron_eagle.py
…
neuron_int8_quantization.py
…
neuron_multimodal.py
…
neuron_speculation.py
…
neuron.py
…
prefix_caching.py
…
prithvi_geospatial_mae.py
…
profiling.py
…
prompt_embed_inference.py
…
qwen3_reranker.py
…
qwen_1m.py
…
reproducibility.py
…
rlhf_colocate.py
…
rlhf_utils.py
…
rlhf.py
…
save_sharded_state.py
…
simple_profiling.py
…
skip_loading_weights_in_engine_init.py
…
spec_decode.py
[Deprecation] Remove
prompt_token_ids
arg fallback in
LLM.generate
and
LLM.embed
(
#18800
)
2025-08-22 10:56:57 +08:00
structured_outputs.py
[Deprecation] Remove
prompt_token_ids
arg fallback in
LLM.generate
and
LLM.embed
(
#18800
)
2025-08-22 10:56:57 +08:00
torchrun_example.py
…
tpu.py
…
vision_language_multi_image.py
[Model][VLM] Support R-4B Model (
#23246
)
2025-08-21 04:08:52 +00:00
vision_language_pooling.py
…
vision_language.py
[Bugfix] Fix extra whitespace in strings caused by newline (
#23272
)
2025-08-20 22:03:00 -07:00