Skip to content

E2E (NVIDIA L40S x4) (python 3.11) #314

E2E (NVIDIA L40S x4) (python 3.11)

E2E (NVIDIA L40S x4) (python 3.11) #314

Manually triggered May 31, 2025 18:47
Status Success
Total duration 3h 57m 55s
Artifacts 2

e2e-nvidia-l40s-x4.yml

on: workflow_dispatch
start-large-ec2-runner
2m 59s
start-large-ec2-runner
stop-large-ec2-runner
3s
stop-large-ec2-runner
Fit to window
Zoom out
Zoom in

Annotations

6 errors
start-large-ec2-runner
AWS EC2 instance starting error
start-large-ec2-runner
InsufficientInstanceCapacity: We currently do not have sufficient g6e.12xlarge capacity in the Availability Zone you requested (us-east-2a). Our system will be working on provisioning additional capacity. You can currently get g6e.12xlarge capacity by not specifying an Availability Zone in your request or choosing us-east-2b, us-east-2c.
start-large-ec2-runner
We currently do not have sufficient g6e.12xlarge capacity in the Availability Zone you requested (us-east-2a). Our system will be working on provisioning additional capacity. You can currently get g6e.12xlarge capacity by not specifying an Availability Zone in your request or choosing us-east-2b, us-east-2c.
start-large-ec2-runner
AWS EC2 instance starting error
start-large-ec2-runner
InsufficientInstanceCapacity: We currently do not have sufficient g6e.12xlarge capacity in the Availability Zone you requested (us-east-2b). Our system will be working on provisioning additional capacity. You can currently get g6e.12xlarge capacity by not specifying an Availability Zone in your request or choosing us-east-2a, us-east-2c.
start-large-ec2-runner
We currently do not have sufficient g6e.12xlarge capacity in the Availability Zone you requested (us-east-2b). Our system will be working on provisioning additional capacity. You can currently get g6e.12xlarge capacity by not specifying an Availability Zone in your request or choosing us-east-2a, us-east-2c.

Artifacts

Produced during runtime
Name Size Digest
phase-1-training-log.jsonl Expired
3.83 KB
sha256:2579c652cd546ea7d60e2e1e1d66fdefd7c086ce906d0b113bd89b18424f6df4
phase-2-training-log.jsonl Expired
6.95 KB
sha256:be414e3ef1a0427d93cea7fba458b7eff26d7a33417423520de4f4c1b7564fbc