Appending /etc/hosts is triggering master reboot cycles #63

wking · 2018-12-08T09:03:50Z

Since #56, this operator has been adding content to /etc/hosts. But that file is managed by the machine-config daemon, and when the MCD detects the altered content, it reboots the node. The node comes back up, and tries to restore the expected pods, and presumably the whole cycle repeats again. This makes it harder for external clients to connect to the Kubernetes and OpenShift API servers (openshift/origin#21612), and that shows up in failed aws-e2e CI runs as random, unrelated flakes. I'm not sure what the /etc/hosts additions are for, but can you find another approach that accomplishes the same goal? Or find some way to ask the MCD to append your content, instead of reaching around the MCD and touching the file directly?

CC @abhinavdahiya, @ironcladlou, @pravisankar

The text was updated successfully, but these errors were encountered:

wking · 2018-12-08T09:08:35Z

Possible resolution is for the MCD to stop caring about /etc/hosts: openshift/machine-config-operator#225.

Miciah · 2018-12-10T06:12:45Z

I'm not sure what the /etc/hosts additions are for, but can you find another approach that accomplishes the same goal? Or find some way to ask the MCD to append your content, instead of reaching around the MCD and touching the file directly?

We update /etc/hosts to add the registry's name and address so that the container runtime can resolve it. We considered using MCD, but we did not know a way to use it to update the file without triggering a reboot. We did not realize that MCD would trigger a reboot anyway if we modified /etc/hosts ourselves.

wking · 2018-12-10T07:01:57Z

We update /etc/hosts to add the registry's name and address so that the container runtime can resolve it.

Well, with openshift/machine-config-operator#225 landed, the MCD is out of the business of touching /etc/hosts (it had been injecting this). Can you take over exclusive /etc/hosts management in this operator? It seems like a reasonable fit for the DNS operator, but I'm not familiar enough to know.

ironcladlou · 2018-12-10T15:13:32Z

@knobunc @smarterclayton any concerns with cluster-dns-operator assuming management responsibilities for /etc/hosts for now? I don't know of any other components who need an interface.

ironcladlou · 2018-12-10T15:15:28Z

@wking want to close this one out as resolved by
openshift/machine-config-operator#225?

wking · 2018-12-10T15:40:36Z

Works for me, just so y'all know you're officially in charge of the file now ;).

knobunc · 2018-12-10T19:28:03Z

@wking cool. Thanks... we just have a different hack to manage the same thing... but a little more dynamically :-)

This was referenced Dec 8, 2018

Rebooting after: "Updating machineconfig from {hash} to {same-hash}" openshift/machine-config-operator#224

Closed

data/aws/variables-aws: Bump master volume to 500 GiB for I/O openshift/installer#844

Closed

This was referenced Dec 9, 2018

templates: drop files related to registry openshift/machine-config-operator#225

Merged

pkg: Pin to RHCOS 47.198 and quay.io/openshift-release-dev/ocp-release:4.0.0-4 openshift/installer#848

Closed

wking closed this as completed Dec 10, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Appending /etc/hosts is triggering master reboot cycles #63

Appending /etc/hosts is triggering master reboot cycles #63

wking commented Dec 8, 2018 •

edited

Loading

wking commented Dec 8, 2018

Miciah commented Dec 10, 2018

wking commented Dec 10, 2018 •

edited

Loading

ironcladlou commented Dec 10, 2018

ironcladlou commented Dec 10, 2018

wking commented Dec 10, 2018

knobunc commented Dec 10, 2018

Appending /etc/hosts is triggering master reboot cycles #63

Appending /etc/hosts is triggering master reboot cycles #63

Comments

wking commented Dec 8, 2018 • edited Loading

wking commented Dec 8, 2018

Miciah commented Dec 10, 2018

wking commented Dec 10, 2018 • edited Loading

ironcladlou commented Dec 10, 2018

ironcladlou commented Dec 10, 2018

wking commented Dec 10, 2018

knobunc commented Dec 10, 2018

wking commented Dec 8, 2018 •

edited

Loading

wking commented Dec 10, 2018 •

edited

Loading