Skip to content

Bug 1888565: daemon: Explicitly start rpm-ostreed#2291

Merged
openshift-merge-robot merged 1 commit intoopenshift:masterfrom
cgwalters:explicit-start-rpmostree
Dec 12, 2020
Merged

Bug 1888565: daemon: Explicitly start rpm-ostreed#2291
openshift-merge-robot merged 1 commit intoopenshift:masterfrom
cgwalters:explicit-start-rpmostree

Conversation

@cgwalters
Copy link
Copy Markdown
Member

See https://bugzilla.redhat.com/show_bug.cgi?id=1888565
for one instance of this. Since the DBus activation timeout is 25s
but the systemd timeout is 90s, let's defer to the latter by explicitly
starting the service. (This is slightly racy because rpm-ostreed
auto-exits on idle, but when we go to send the DBus message we'll
auto-activate again if really necessary).

Also tell systemd we're going to need the service via the
firstboot unit file.

See https://bugzilla.redhat.com/show_bug.cgi?id=1888565
for one instance of this.  Since the DBus activation timeout is 25s
but the systemd timeout is 90s, let's defer to the latter by explicitly
starting the service.  (This is slightly racy because rpm-ostreed
auto-exits on idle, but when we go to send the DBus message we'll
auto-activate again if really necessary).

Also tell systemd we're going to need the service via the
firstboot unit file.
@openshift-ci-robot openshift-ci-robot added bugzilla/severity-high Referenced Bugzilla bug's severity is high for the branch this PR is targeting. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. labels Dec 9, 2020
@openshift-ci-robot
Copy link
Copy Markdown
Contributor

@cgwalters: This pull request references Bugzilla bug 1888565, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker.

3 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target release (4.7.0) matches configured target release for branch (4.7.0)
  • bug is in the state ASSIGNED, which is one of the valid states (NEW, ASSIGNED, ON_DEV, POST, POST)
Details

In response to this:

Bug 1888565: daemon: Explicitly start rpm-ostreed

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci-robot openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Dec 9, 2020
@ashcrow
Copy link
Copy Markdown
Member

ashcrow commented Dec 10, 2020

/retest

@sinnykumari
Copy link
Copy Markdown
Contributor

Few observation:

  1. We are explicitly starting rpm-ostreed daemon only before fetching OSImageURL. Won't we have to ensure same at other places where we perform rpm-ostree operations?
  2. This seems like a workaround to me. For long term solution, are we planning to make changes in rpm-ostree? Perhaps when user calls rpm-ostree functionality, rpm-ostree first checks and starts rpm-ostreed if not already running.

@ashcrow ashcrow requested review from darkmuggle and sinnykumari and removed request for ashcrow December 10, 2020 14:12
@cgwalters
Copy link
Copy Markdown
Member Author

We are explicitly starting rpm-ostreed daemon only before fetching OSImageURL. Won't we have to ensure same at other places where we perform rpm-ostree operations?

I think the main problem is during OS boot, there's a lot going on, a lot of services starting at once competing for I/O on the disk.

This seems like a workaround to me. For long term solution, are we planning to make changes in rpm-ostree? Perhaps when user calls rpm-ostree functionality, rpm-ostree first checks and starts rpm-ostreed if not already running.

That already happens today - that's how it's all working without this! The problem as noted in the commit message is the disparity between the DBus activation timeout and the systemd one.

But yes, we will likely work around this in rpm-ostree upstream too. It'll just take much longer to land. (And in any case I think the last bit of Wants= is better regardless because it tells systemd explicitly what's happening)

@sinnykumari
Copy link
Copy Markdown
Contributor

Thanks for the explanation, makes sense then.

Comment thread pkg/daemon/rpm-ostree.go

// Start ensures the daemon is running; the DBus activation timeout
// is shorter than the systemd timeout. xref
// https://bugzilla.redhat.com/show_bug.cgi?id=1888565#c3
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just wanted to confirm, this solves referenced bug but it doesn't solve bug https://bugzilla.redhat.com/show_bug.cgi?id=1888565 where we are seeing rpm-ostree cleanup -r failing because of slow disk, right?

@sinnykumari
Copy link
Copy Markdown
Contributor

/lgtm

@openshift-ci-robot
Copy link
Copy Markdown
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: cgwalters, sinnykumari

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:
  • OWNERS [cgwalters,sinnykumari]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci-robot openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Dec 10, 2020
@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

10 similar comments
@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

12 similar comments
@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Copy Markdown
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-merge-robot openshift-merge-robot merged commit d6b5d19 into openshift:master Dec 12, 2020
@openshift-ci-robot
Copy link
Copy Markdown
Contributor

@cgwalters: All pull requests linked via external trackers have merged:

Bugzilla bug 1888565 has been moved to the MODIFIED state.

Details

In response to this:

Bug 1888565: daemon: Explicitly start rpm-ostreed

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. bugzilla/severity-high Referenced Bugzilla bug's severity is high for the branch this PR is targeting. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. lgtm Indicates that a PR is ready to be merged.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants