-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
systemd unit ldconfig.service
fails to start
#1493
Comments
I had this issue once in a VM and I suspected a memory issue. |
I could reliably reproduce the issue by bisecting the RAM size of a qemu-arm64 vm on arm64. 256MB of RAM, the system OOMs and does not boot. 512 MB of RAM, the system runs fine. But on 384 MB of RAM, I got this in the journalctl logs:
|
See: flatcar/Flatcar#1493 Signed-off-by: Adrian Vladu <[email protected]>
See: flatcar/Flatcar#1493 Signed-off-by: Adrian Vladu <[email protected]>
ldconfig runs after switch-root:
The segfault due to low memory may be different than other failure modes. I debugged the coredump for an occurance of this in tests once, and saw that it had to do with parsing |
|
See: flatcar/Flatcar#1493 Signed-off-by: Adrian Vladu <[email protected]>
I doubt this is memory-related. I just saw this on a local 2GB QEMU instance. It seems random, it worked many other times. I think the |
This is in hope to either fix or pinpoint the cause of flatcar/Flatcar#1493.
This is in hope to either fix or pinpoint the cause of flatcar/Flatcar#1493.
This is in hope to either fix or pinpoint the cause of flatcar/Flatcar#1493.
Description
systemd unit
ldconfig.service
fails to start sometimes on the first boot or the subsequent reboots, but it cannot be reproduced reliably.ldconfig.service
runs before the switchroot during the initrd stage and is more cumbersome to properly reproduce it. [UPDATE]ldconfig.service
runs after switchroot.This is an issue that I have seen in the wild for a while, usually after rebooting a Flatcar instance on ARM64. From what I know, this issue does not affect the functionality of the Flatcar instance.
This issue has made a recurrence recently in the Github Actions. From the GitHub Actions:
Debugging this issue, I could get a warning message by running manually
ldconfig -X
, which had exit code 0 and error message:Message: /lib/ld.so.conf is not an ELF file - it has the wrong magic bytes ...
.This issue might be due to the wrong path for the /lib/ld.so.conf and it might be related to this commit: flatcar/scripts@ba45a2b.
I do not know yet if the warning messsage and the systemd unit failure are related, but maybe the warning message sometimes is seen as error output and thus fails the unit.
The definition of the systemd unit:
Impact
The test framework needs to re-run the test, sometimes 3 or 4 times. In real world scenarios, because of the systemd unit failure, automation might break or sanitiy check tools might flag this issue.
Environment and steps to reproduce
Example test run that had to retry some of the Mantle tests:
https://github.com/flatcar/scripts/actions/runs/9777950641?pr=2089
Expected behavior
systemd unit ldconfig.service should not fail.
The text was updated successfully, but these errors were encountered: