4 Commits

Author SHA1 Message Date
Woosuk Kwon
e9d3f2ff77
Add memory analyzer & utomatically configure KV cache size (#6) 2023-03-11 23:23:14 -08:00
Woosuk Kwon
3e9f991d6a
Use FlashAttention for multi_query_kv_attention (#4) 2023-03-01 21:13:08 -08:00
Woosuk Kwon
c84c708a1d Add README 2023-02-24 12:04:49 +00:00
Woosuk Kwon
e7d9d9c08c Initial commit 2023-02-09 11:24:15 +00:00