vllm/vllm at 0bb1e885a08df59beec7149b1d0d646e24ab1a42 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-24 00:07:41 +08:00

History

Antoni Baum 0bb1e885a0

Make max_model_len configurable (#972 )

2023-09-12 16:29:19 -07:00

..

Make AsyncLLMEngine more robust & fix batched abort (#969 )

2023-09-07 13:43:45 -07:00

Make max_model_len configurable (#972 )

2023-09-12 16:29:19 -07:00

Start background task in AsyncLLMEngine.generate (#988 )

2023-09-08 00:03:39 -07:00

Use FP32 in RoPE initialization (#1004 )

2023-09-11 00:26:35 -07:00

transformers_utils

Only emit warning about internal tokenizer if it isn't being used (#939 )

2023-09-05 00:50:55 +09:00

Align vLLM's beam search implementation with HF generate (#857 )

2023-09-04 17:29:42 -07:00

__init__.py

Bump up the version to v0.1.7 (#1013 )

2023-09-11 00:54:30 -07:00

block.py

[Quality] Add code formatter and linter (#326 )

2023-07-03 11:31:55 -07:00

config.py

Make max_model_len configurable (#972 )

2023-09-12 16:29:19 -07:00

logger.py

[Quality] Add code formatter and linter (#326 )

2023-07-03 11:31:55 -07:00

outputs.py

Align vLLM's beam search implementation with HF generate (#857 )

2023-09-04 17:29:42 -07:00

sampling_params.py

Align vLLM's beam search implementation with HF generate (#857 )

2023-09-04 17:29:42 -07:00

sequence.py

Align vLLM's beam search implementation with HF generate (#857 )

2023-09-04 17:29:42 -07:00

utils.py

[Quality] Add code formatter and linter (#326 )

2023-07-03 11:31:55 -07:00