internal/exec/stages/disks: prevent races with udev#1319

pothos · 2022-02-16T10:51:58Z

The "udevadm settle" command used to wait for udev to process the disk
changes and recreate the entries under /dev was still prone to races
where udev didn't get notified yet of the final event to wait for.
This caused the boot with a btrfs root filesystem created by Ignition
to fail almost every time on certain hardware.

Issue tagged events and wait for them to be processed by udev. This is
actually meanigful in all stages not only for the other parts of the
initramfs which may be surprised by sudden device nodes disappearing
shortly like the case was with systemd's fsck service but also for the
inter-stage dependencies which currently are using the waiter for
systemd device units but that doesn't really prevent from races with
udev device node recreation. Thus, these changes are complementary to
the existing waiter which mainly has the purpose to wait for unmodified
devices. For newly created RAIDs we can wait for the new node to be
available as udev will not recreate it.

pothos · 2022-04-22T04:41:35Z

FYI, Flatcar uses this already for a while now

jlebon

Thanks for upstreaming this and writing a detailed commit message! Always fun to debug udev races. :)

Had a minor comment, but this looks sane to me overall.

One concern is that we're calling this on every separate filesystem/disk/LUKS/RAID device. Would it be more efficient to instead do it only once per section for the last device in each associated list?

internal/exec/stages/disks/disks.go

pothos · 2022-04-27T02:49:21Z

One concern is that we're calling this on every separate filesystem/disk/LUKS/RAID device. Would it be more efficient to instead do it only once per section for the last device in each associated list?

That would rely on inotify for the other devices and I think there is no guarantee that the inotify event goes into the queue and gets fully processed if we trigger a tagged event and wait for this event's completion - maybe if we check all implementation details we know that it works for the current udev implementation but I would rather be on the safe side since absence of the race condition is not verifiable through testing…

jlebon

LGTM! Will let @bgilbert take a look as well.

internal/exec/stages/disks/luks.go

bgilbert · 2022-11-14T04:49:03Z

I won't be able to look at this for some weeks, unfortunately, but it's still on my list and I'll circle back as soon as I can.

bgilbert · 2023-08-01T07:50:47Z

Your reasoning makes sense to me, but I haven't thought it through carefully. I'll defer to re-review from @jlebon, which actually I should have done months ago. 😳 Thanks for your patience.

PR needs rebase to pick up CI changes.

pothos · 2023-08-01T14:02:51Z

I added a release note entry as required

pothos · 2023-09-28T10:40:27Z

@jlebon Can you do the approval (actually, you did already, not sure you need to submit again) and hit the merge button?

The "udevadm settle" command used to wait for udev to process the disk changes and recreate the entries under /dev was still prone to races where udev didn't get notified yet of the final event to wait for. This caused the boot with a btrfs root filesystem created by Ignition to fail almost every time on certain hardware. Issue tagged events and wait for them to be processed by udev. This is actually meanigful in all stages not only for the other parts of the initramfs which may be surprised by sudden device nodes disappearing shortly like the case was with systemd's fsck service but also for the inter-stage dependencies which currently are using the waiter for systemd device units but that doesn't really prevent from races with udev device node recreation. Thus, these changes are complementary to the existing waiter which mainly has the purpose to wait for unmodified devices. For newly created RAIDs we can wait for the new node to be available as udev will not recreate it.

jlebon

Thanks for this. It (still) makes sense to me and in fact we do something similar in other places of the stack for similar reasons. I'm pretty sure we should be able to drop some of those with this.

I just tweaked the error messages and the release note item. Let me know if you disagree with some of them.

pothos mentioned this pull request Feb 16, 2022

internal/exec/stages/disks: prevent races with udev flatcar/ignition#35

Merged

1 task

jlebon reviewed Apr 26, 2022

View reviewed changes

internal/exec/stages/disks/disks.go Outdated Show resolved Hide resolved

pothos force-pushed the kai/udev-race-prevention branch from 848f20a to 716cec5 Compare April 27, 2022 02:41

jlebon approved these changes Apr 27, 2022

View reviewed changes

jlebon mentioned this pull request Apr 27, 2022

NEWS: update v2.14.0 #1354

Merged

defo89 mentioned this pull request May 6, 2022

Beta 3185.1.0: ignition fails to create partition on second disk (vmware) flatcar/Flatcar#729

Closed

bgilbert reviewed May 10, 2022

View reviewed changes

internal/exec/stages/disks/luks.go Show resolved Hide resolved

pothos force-pushed the kai/udev-race-prevention branch 3 times, most recently from d14a76a to fba4ff4 Compare May 13, 2022 13:04

pothos requested a review from bgilbert October 10, 2022 15:00

bgilbert removed their request for review July 28, 2023 06:53

pothos force-pushed the kai/udev-race-prevention branch 2 times, most recently from d12147c to 3c6297c Compare August 1, 2023 14:02

jlebon force-pushed the kai/udev-race-prevention branch from 3c6297c to 3f78465 Compare September 29, 2023 15:34

jlebon approved these changes Sep 29, 2023

View reviewed changes

jlebon enabled auto-merge September 29, 2023 15:39

jlebon merged commit de4da0e into coreos:main Sep 29, 2023

jlebon mentioned this pull request Oct 19, 2023

sgdisk: Run partx after partition changes #1717

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

internal/exec/stages/disks: prevent races with udev#1319

internal/exec/stages/disks: prevent races with udev#1319
jlebon merged 1 commit intocoreos:mainfrom
pothos:kai/udev-race-prevention

pothos commented Feb 16, 2022

Uh oh!

pothos commented Apr 22, 2022

Uh oh!

jlebon left a comment

Uh oh!

Uh oh!

pothos commented Apr 27, 2022

Uh oh!

jlebon left a comment

Uh oh!

Uh oh!

bgilbert commented Nov 14, 2022

Uh oh!

bgilbert commented Aug 1, 2023

Uh oh!

pothos commented Aug 1, 2023

Uh oh!

pothos commented Sep 28, 2023 •

edited

Loading

Uh oh!

jlebon left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

pothos commented Feb 16, 2022

Uh oh!

pothos commented Apr 22, 2022

Uh oh!

jlebon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pothos commented Apr 27, 2022

Uh oh!

jlebon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

bgilbert commented Nov 14, 2022

Uh oh!

bgilbert commented Aug 1, 2023

Uh oh!

pothos commented Aug 1, 2023

Uh oh!

pothos commented Sep 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jlebon left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pothos commented Sep 28, 2023 •

edited

Loading