2 Commits

Author SHA1 Message Date
Richard Zou
902d7df99a Fix eagle dp tests on A100
`TP_SIZE=1 DP_SIZE=2 pytest -v -s tests/v1/distributed/test_eagle_dp.py` fails
on A100 for me before this PR.

Here's what I think is happening:
- the test is checking that the tokens produced by a model with eagle is
  identical to a model without eagle
- the model with eagle uses a draft model to produce draft tokens
- the target model takes all of the draft tokens and then does a forward
  pass to see how many of the tokens to accept/reject. The target model
  is using a batch_size > 1.
- the model without eagle just generates the tokens one-by-one, that is,
  it has batch_size = 1.
- For these two models to be *consistent*, we need batch invariance. So
  I turned on batch invariance (which also required the selection of an
  attention backend)

Signed-off-by: Richard Zou <zou3519@gmail.com>
2025-12-24 12:07:47 -08:00
Rémi Delacourt
12c007e288
EAGLE Support DP>1 (#26086)
Signed-off-by: Rémi Delacourt <remi@mistral.ai>
Signed-off-by: Rémi Delacourt <54138269+Flechman@users.noreply.github.com>
Signed-off-by: remi <remi@mistral.ai>
2025-11-25 07:32:21 +00:00