E2E (NVIDIA L40S x4) (python 3.11) #314
e2e-nvidia-l40s-x4.yml
on: workflow_dispatch
Annotations
6 errors
|
start-large-ec2-runner
AWS EC2 instance starting error
|
|
start-large-ec2-runner
InsufficientInstanceCapacity: We currently do not have sufficient g6e.12xlarge capacity in the Availability Zone you requested (us-east-2a). Our system will be working on provisioning additional capacity. You can currently get g6e.12xlarge capacity by not specifying an Availability Zone in your request or choosing us-east-2b, us-east-2c.
|
|
start-large-ec2-runner
We currently do not have sufficient g6e.12xlarge capacity in the Availability Zone you requested (us-east-2a). Our system will be working on provisioning additional capacity. You can currently get g6e.12xlarge capacity by not specifying an Availability Zone in your request or choosing us-east-2b, us-east-2c.
|
|
start-large-ec2-runner
AWS EC2 instance starting error
|
|
start-large-ec2-runner
InsufficientInstanceCapacity: We currently do not have sufficient g6e.12xlarge capacity in the Availability Zone you requested (us-east-2b). Our system will be working on provisioning additional capacity. You can currently get g6e.12xlarge capacity by not specifying an Availability Zone in your request or choosing us-east-2a, us-east-2c.
|
|
start-large-ec2-runner
We currently do not have sufficient g6e.12xlarge capacity in the Availability Zone you requested (us-east-2b). Our system will be working on provisioning additional capacity. You can currently get g6e.12xlarge capacity by not specifying an Availability Zone in your request or choosing us-east-2a, us-east-2c.
|
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
phase-1-training-log.jsonl
Expired
|
3.83 KB |
sha256:2579c652cd546ea7d60e2e1e1d66fdefd7c086ce906d0b113bd89b18424f6df4
|
|
|
phase-2-training-log.jsonl
Expired
|
6.95 KB |
sha256:be414e3ef1a0427d93cea7fba458b7eff26d7a33417423520de4f4c1b7564fbc
|
|