typhoon/google-cloud/fedora-coreos/kubernetes
Dalton Hubble 70bdc9ec94 Allow bootstrap re-apply for Fedora CoreOS GCP
* Problem: Fedora CoreOS images are manually uploaded to GCP. When a
cluster is created with a stale image, Zincati immediately checks
for the latest stable image, fetches, and reboots. In practice,
this can unfortunately occur exactly during the initial cluster
bootstrap phase.

* Recommended: Upload the latest Fedora CoreOS image regularly
* Mitigation: Allow a failed bootstrap.service run (which won't touch
the done ConditionalPathExists) to be re-run by running `terraforma apply`
again. Add a known issue to CHANGES
* Update docs to show the current Fedora CoreOS stable version to
reduce likelihood users see this issue

 Longer term ideas:

* Ideal: Fedora CoreOS publishes a stable channel. Instances will always
boot with the latest image in a channel. The problem disappears since
it works the same way AWS does
* Timer: Consider some timer-based approach to have zincati delay any
system reboots for the first ~30 min of a machine's life. Possibly just
configured on the controller node https://github.com/coreos/zincati/pull/251
* External coordination: For Container Linux, locksmith filled a similar
role and was disabled to allow CLUO to coordinate reboots. By running
atop Kubernetes, it was not possible for the reboot to occur before
cluster bootstrap
* Rely on https://github.com/coreos/zincati/issues/115 to delay the
reboot since bootstrap involves an SSH session
* Use path-based activation of zincati on controllers and set that
path at the end of the bootstrap process

Rel: https://github.com/coreos/fedora-coreos-tracker/issues/239
2020-03-28 18:12:31 -07:00
..
fcc Allow bootstrap re-apply for Fedora CoreOS GCP 2020-03-28 18:12:31 -07:00
workers Fix delete-node.service kubectl service exec's 2020-03-28 12:35:23 -07:00
LICENSE Add module for Fedora CoreOS on Google Cloud 2020-02-01 15:21:40 -08:00
README.md Update docs from Kubernetes v1.17.4 to v1.18.0 2020-03-25 20:28:30 -07:00
apiserver.tf Add module for Fedora CoreOS on Google Cloud 2020-02-01 15:21:40 -08:00
bootstrap.tf Update Kubernetes from v1.17.4 to v1.18.0 2020-03-25 17:51:50 -07:00
controllers.tf Add module for Fedora CoreOS on Google Cloud 2020-02-01 15:21:40 -08:00
ingress.tf Add module for Fedora CoreOS on Google Cloud 2020-02-01 15:21:40 -08:00
network.tf Add module for Fedora CoreOS on Google Cloud 2020-02-01 15:21:40 -08:00
outputs.tf Add module for Fedora CoreOS on Google Cloud 2020-02-01 15:21:40 -08:00
ssh.tf Add module for Fedora CoreOS on Google Cloud 2020-02-01 15:21:40 -08:00
variables.tf Add module for Fedora CoreOS on Google Cloud 2020-02-01 15:21:40 -08:00
versions.tf Add module for Fedora CoreOS on Google Cloud 2020-02-01 15:21:40 -08:00
workers.tf Add module for Fedora CoreOS on Google Cloud 2020-02-01 15:21:40 -08:00

README.md

Typhoon

Typhoon is a minimal and free Kubernetes distribution.

  • Minimal, stable base Kubernetes distribution
  • Declarative infrastructure and configuration
  • Free (freedom and cost) and privacy-respecting
  • Practical for labs, datacenters, and clouds

Typhoon distributes upstream Kubernetes, architectural conventions, and cluster addons, much like a GNU/Linux distribution provides the Linux kernel and userspace components.

Features

Docs

Please see the official docs and the Google Cloud tutorial.