Fix scheduler

This commit is contained in:
Woosuk Kwon 2024-04-25 05:06:22 +00:00
parent 98eda57899
commit b62170e4e3

View File

@ -666,6 +666,10 @@ class Scheduler:
budget.add_num_batched_tokens(seq_group.request_id, num_new_tokens)
budget.add_num_seqs(seq_group.request_id, num_new_seqs)
# FIXME(woosuk): For TPUs, we want to schedule only one prompt
# per scheduling step.
break
# Queue requests that couldn't be scheduled.
waiting_queue.extendleft(leftover_waiting_sequences)
if len(seq_groups) > 0: