This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2025-12-27 13:28:42 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
History
Cyrus Leung
8c054b7a62
[Frontend] Clean up type annotations for mistral tokenizer (
#8314
)
2024-09-10 16:49:11 +00:00
..
async_engine
[Frontend] Clean up type annotations for mistral tokenizer (
#8314
)
2024-09-10 16:49:11 +00:00
basic_correctness
…
compile
…
core
…
data
…
distributed
…
engine
…
entrypoints
…
fp8_kv
…
kernels
[Misc] Fused MoE Marlin support for GPTQ (
#8217
)
2024-09-09 23:02:52 -04:00
lora
…
metrics
…
model_executor
…
models
…
multi_step
…
multimodal
…
plugins
/vllm_add_dummy_model
…
prefix_caching
…
prompt_adapter
…
prompts
…
quantization
…
samplers
…
spec_decode
…
tensorizer_loader
…
tokenization
…
tool_use
[Bugfix] Streamed tool calls now more strictly follow OpenAI's format; ensures Vercel AI SDK compatibility (
#8272
)
2024-09-09 10:45:11 -04:00
tpu
…
tracing
…
weight_loading
[Misc] Fused MoE Marlin support for GPTQ (
#8217
)
2024-09-09 23:02:52 -04:00
worker
…
__init__.py
…
conftest.py
…
test_cache_block_hashing.py
…
test_config.py
…
test_embedded_commit.py
…
test_inputs.py
…
test_logger.py
…
test_logits_processor.py
…
test_regression.py
…
test_sampling_params.py
…
test_scalartype.py
…
test_sequence.py
…
test_sharded_state_loader.py
…
test_utils.py
…
utils.py
…