[Hardware][Intel-Gaudi] [CI/Build] Add tensor parallel size = 2 test to HPU CI (#18709)

Signed-off-by: Lukasz Durejko <ldurejko@habana.ai>
This commit is contained in:
Łukasz Durejko 2025-05-26 14:26:07 +02:00 committed by GitHub
parent 0877750029
commit e76be06550
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -21,4 +21,6 @@ remove_docker_container
# Run the image and launch offline inference
docker run --runtime=habana --name=hpu-test --network=host -e HABANA_VISIBLE_DEVICES=all -e VLLM_SKIP_WARMUP=true --entrypoint="" hpu-test-env python3 examples/offline_inference/basic/generate.py --model facebook/opt-125m
docker run --runtime=habana --name=hpu-test --network=host -e HABANA_VISIBLE_DEVICES=all -e VLLM_SKIP_WARMUP=true --entrypoint="" hpu-test-env python3 examples/offline_inference/basic/generate.py --model facebook/opt-125m --tensor-parallel-size 2
EXITCODE=$?