Skip to content

Add scheduled workflow to rerun GPU health check failures#1683

Merged
v0i0 merged 1 commit into
mainfrom
rerun-gpu-failures
Mar 23, 2026
Merged

Add scheduled workflow to rerun GPU health check failures#1683
v0i0 merged 1 commit into
mainfrom
rerun-gpu-failures

Conversation

@v0i0
Copy link
Copy Markdown
Contributor

@v0i0 v0i0 commented Mar 14, 2026

No description provided.

@meta-cla meta-cla Bot added the CLA Signed This label is managed by the Meta Open Source bot. label Mar 14, 2026
@v0i0 v0i0 force-pushed the rerun-gpu-failures branch from 43b609d to a645bca Compare March 20, 2026 23:23
@v0i0 v0i0 marked this pull request as ready for review March 20, 2026 23:23
@v0i0 v0i0 merged commit 2e33fca into main Mar 23, 2026
20 of 21 checks passed
@huydhn
Copy link
Copy Markdown
Contributor

huydhn commented Mar 25, 2026

We can revert this change now as #1808 is working. I have a test PR to fail CUDA check step https://github.com/pytorch/helion/actions/runs/23565563780/attempts/2, the second retry attempt is from the bot right after the workflow fails. The 3rd one is from #1683, which is not needed anymore.

Uploading Screenshot 2026-03-25 at 14.55.40.png…

huydhn added a commit that referenced this pull request Mar 26, 2026
)"

This reverts commit 2e33fca.

Signed-off-by: Huy Do <huydhn@gmail.com>
huydhn added a commit that referenced this pull request Mar 27, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants