Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ubuntu-24.04-arm - consistent failures of 1-3 jobs at steps with actions in a matrix jobs #37

Open
userdocs opened this issue Jan 23, 2025 · 7 comments

Comments

@userdocs
Copy link

I am testing a matrix job that start 20 jobs and I and getting 1-3 jobs fail per run, though it's not the same matrix job failing. It will be 1-4 of that 20 at random. The reason for the failure seems to happen at specific points but may not have consistent errors.

Here is an example that I ran as I typed this (which i won't rerun failed jobs on)

https://github.com/userdocs/qbt-musl-cross-make-test/actions/runs/12939478388/job/36091830161

Image

Over the course of today I have been working on this repo I have started 30 ish matrix jobs and I don't think a single one have not had a random error and failure of a matrix job. I don't have this with the same jobs that have been running on ubuntu-latest for ages.

@jakob-keller
Copy link

Same here: aio-libs/aiobotocore#1266

The issue also occurs with ubuntu-22.04-arm, leading me to believe it might be the runner and not the images that are to blame.

@userdocs
Copy link
Author

I have not used that image but it seems to fails at steps with actions, here the setup qemu one is the main offender but it has been the at updating host, checkout, setting up image. Most consistently in the first stages of the job starting.

@userdocs
Copy link
Author

userdocs commented Jan 24, 2025

Just another one for no obvious reason.

Image

Image

@userdocs
Copy link
Author

@userdocs
Copy link
Author

@userdocs
Copy link
Author

userdocs commented Jan 27, 2025

Another one https://github.com/userdocs/qbt-workflow-files/actions/runs/12990519066/job/36225908769

Image

Can someone at least acknowledge the issues here so we can know i the stability issues are being worked on.

As it stands ubuntu-24.04-arm clearly has some consistency issues and is consistently failing at predictable points.

Having to rerun a job 5 times or more when it won't fail on ubuntu-24.04 is not something i'd assume you yolo into production then ignore?

@Rhys-T
Copy link

Rhys-T commented Jan 27, 2025

Same here - not Docker, but a similar timeout while trying to talk to systemd from the Nix installer. And when it's not that, it's the checkout step silently failing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants