vllm/models at 0deacbce6e96a1af5885babc4e470ce2a0cecf95 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-05-01 10:43:31 +08:00

History

Woosuk Kwon 0deacbce6e

Implement single_query_cached_kv_attention kernel (#3 )

2023-03-01 15:02:19 -08:00

..

__init__.py

Add input metadata

2023-02-22 19:01:20 +00:00

attention.py

Implement single_query_cached_kv_attention kernel (#3 )

2023-03-01 15:02:19 -08:00

input_metadata.py

Fix attention

2023-02-23 23:02:25 +00:00

model_utils.py

Fix a bug in tying OPT embeddings (#1 )

2023-02-24 16:29:36 -08:00

opt.py

Fix a bug in tying OPT embeddings (#1 )

2023-02-24 16:29:36 -08:00

sample.py

Fix sampler

2023-02-23 20:30:12 +00:00