typhoon

mirror of https://github.com/puppetmaster/typhoon.git synced 2025-08-01 09:21:34 +02:00

Author	SHA1	Message	Date
Dalton Hubble	983489bb52	Re-run terraform fmt for formatting	2018-05-14 23:38:16 -07:00
Dalton Hubble	c2b719dc75	Configure Prometheus to scrape Kubelets directly * Use Kubelet bearer token authn/authz to scrape metrics * Drop RBAC permission from nodes/proxy to nodes/metrics * Stop proxying kubelet scrapes through the apiserver, since this required higher privilege (nodes/proxy) and can add load to the apiserver on large clusters	2018-05-14 23:06:50 -07:00
Dalton Hubble	37981f9fb1	Allow bearer token authn/authz to the Kubelet * Require Webhook authorization to the Kubelet * Switch apiserver X509 client cert org to systems:masters to grant the apiserver admin and satisfy the authorization requirement. kubectl commands like logs or exec that have the apiserver make requests of a kubelet continue to work as before * https://kubernetes.io/docs/admin/kubelet-authentication-authorization/ * https://github.com/poseidon/typhoon/issues/215	2018-05-13 23:20:42 -07:00
Dalton Hubble	5eb11f5104	Allow Flatcar Linux os_image on AWS, rename os_channel * Replace os_channel variable with os_image to align naming across clouds. Users who set this option to stable, beta, or alpha should now set os_image to coreos-stable, coreos-beta, or coreos-alpha. * Default os_image to coreos-stable. This continues to use the most recent image from the stable channel as always. * Allow Container Linux derivative Flatcar Linux by setting os_image to `flatcar-stable`, `flatcar-beta`, `flatcar-alpha`	2018-05-12 11:41:58 -07:00
Dalton Hubble	f2ee75ac98	Require Terraform v0.11.x, drop v0.10.x support * Raise minimum Terraform version to v0.11.0 * Terraform v0.11.x has been supported since Typhoon v1.9.2 and Terraform v0.10.x was last released in Nov 2017. I'd like to stop worrying about v0.10.x and remove migration docs as a later followup * Migration docs docs/topics/maintenance.md#terraform-v011x	2018-05-10 02:20:46 -07:00
Dalton Hubble	8b8e364915	Update etcd from v3.3.4 to v3.3.5 * https://github.com/coreos/etcd/releases/tag/v3.3.5	2018-05-10 02:12:53 -07:00
Dalton Hubble	fb88113523	Disable default Google Analytics in Grafana addon * Its come to my attention Grafana reports analytics data by default. Typhoon's philosophy requires user permission for data collection so the addon should have this disabled * http://docs.grafana.org/installation/configuration/#analytics	2018-05-10 01:18:47 -07:00
Dalton Hubble	1854f5c104	Update Grafana from v5.1.1 to v5.1.2 * https://github.com/grafana/grafana/releases/tag/v5.1.2	2018-05-10 01:09:08 -07:00
Dalton Hubble	726b58b697	Update Grafana from v5.0.4 to v5.1.1 * https://github.com/grafana/grafana/releases/tag/v5.1.1 * https://github.com/grafana/grafana/releases/tag/v5.1.0	2018-05-07 22:05:19 -07:00
Michael Holt	a5916da0e2	Update min AWS provider from v1.11 to v1.13	2018-05-02 15:16:03 -07:00
Dalton Hubble	a54e3c0da1	Fix Prometheus data dir to /var/lib/prometheus * A data volume (emptyDir) is mounted to /var/lib/prometheus * Users could swap emptyDir for any desired volume if data persistence is desired. Prometheus previously defaulted to keeping its data in ./data relative to /prometheus. Override this behavior to store data in /var/lib/prometheus	2018-05-01 22:05:27 -07:00
Dalton Hubble	9d4cbb38f6	Rerun terraform fmt	2018-05-01 21:41:22 -07:00
Dalton Hubble	cc29530ba0	Allow preemptible workers on AWS via spot instances * Add `worker_price` to allow worker spot instances. Defaults to empty string for the worker autoscaling group to use regular on-demand instances. * Add `spot_price` to internal `workers` module for spot worker pools * Note: Unlike GCP `preemptible` workers, spot instances require you to pick a bid price.	2018-04-29 13:31:17 -07:00
Dalton Hubble	385584b712	Add changelog notes for release v1.10.2	2018-04-29 12:04:44 -07:00
Dalton Hubble	731a6ec23a	Update nginx-ingress from 0.13.0 to 0.14.0 * https://github.com/kubernetes/ingress-nginx/releases/tag/nginx-0.14.0	2018-04-28 13:10:03 -07:00
Dalton Hubble	e889430926	Update kube-dns from v1.14.9 to v1.14.10 * https://github.com/kubernetes/kubernetes/pull/62676	2018-04-28 00:43:09 -07:00
Dalton Hubble	d81a091756	Switch Atomic docs to reference v1.10.2 tag	2018-04-28 00:27:23 -07:00
Dalton Hubble	32ddfa94e1	Update Kubernetes from v1.10.1 to v1.10.2 * https://github.com/kubernetes/kubernetes/releases/tag/v1.10.2	2018-04-28 00:27:00 -07:00
Dalton Hubble	681450aa0d	Update etcd from v3.3.3 to v3.3.4 * https://github.com/coreos/etcd/releases/tag/v3.3.4	2018-04-27 23:57:26 -07:00
Dalton Hubble	fafa028052	Add Typhoon for Fedora Atomic to changelog	2018-04-27 23:55:59 -07:00
Dalton Hubble	86e5adf348	Set commit hash so tutorials work right now * These modules are alpha, anyone wanting to try then is probably fine using the raw sha	2018-04-26 09:08:06 -07:00
Dalton Hubble	a89f25e31a	Fix typo in announcement	2018-04-26 08:36:50 -07:00
Dalton Hubble	2e4bf4d7ae	Add Fedora Atomic announcement and improve docs	2018-04-26 08:18:39 -07:00
Dalton Hubble	b6a51d0b68	Add architecture docs on operating systems	2018-04-25 22:59:48 -07:00
Dalton Hubble	567e18f015	Fix conflict between Calico and NetworkManager * Observed frequent kube-scheduler and controller-manager restarts with Calico as the CNI provider. Root cause was unclear since control plane was functional and tests of pod to pod network connectivity passed * Root cause: Calico sets up cali* and tunl* network interfaces for containers on hosts. NetworkManager tries to manage these interfaces. It periodically disconnected veth pairs. Logs did not surface this issue since its not an error per-se, just Calico and NetworkManager dueling for control. Kubernetes correctly restarted pods failing health checks and ensured 2 replicas were running so the control plane functioned mostly normally. Pod to pod connecitivity was only affected occassionally. Pain to debug. * Solution: Configure NetworkManager to ignore the Calico ifaces per Calico's recommendation. Cloud-init writes files after NetworkManager starts, so a restart is required on first boot. On subsequent boots, the file is present so no restart is needed	2018-04-25 21:45:58 -07:00
Dalton Hubble	0a7fab56e2	Load ip_vs kernel module on boot as workaround * (containerized) kube-proxy warns that it is unable to load the ip_vs kernel module despite having the correct mounts. Atomic uses an xz compressed module and modprobe in the container was not compiled with compression support * Workaround issue for now by always loading ip_vs on-host * https://github.com/kubernetes/kubernetes/issues/60	2018-04-25 21:45:58 -07:00
Dalton Hubble	d784b0fca6	Switch to quay.io/poseidon tagged system containers	2018-04-25 18:15:18 -07:00
Dalton Hubble	cd913986df	Write documentation for Fedora Atomic	2018-04-24 01:10:27 -07:00
Dalton Hubble	af54efec28	Organize docs by operating system	2018-04-23 19:55:28 -07:00
Dalton Hubble	7198b9016c	Update Calico from v3.0.4 to v3.1.1 for Atomic	2018-04-21 18:46:56 -07:00
Dalton Hubble	f36c890234	Fix ostree repo to be called fedora-atomic on bare-metal * atomic host updates were fetching updates from the repo cache fedora-atomic-27, instead of from upstream	2018-04-21 18:46:56 -07:00
Dalton Hubble	233ec6dcb0	Update Fedora Atomic AMI to version 27.122 * http://www.projectatomic.io/blog/2018/04/fedora-atomic-20-apr-18/ * Atomic publishes nightly AMIs which sometimes don't boot or have issues. Until there is a source of reliable AMIs, pin the best known working AMI * Rel 66a66f0d18544591ffdbf8fae9df790113c93d72	2018-04-21 18:46:56 -07:00
Dalton Hubble	3f2978821b	Add atomic_assets_endpoint var for fedora-atomic bare-metal	2018-04-21 18:46:56 -07:00
Dalton Hubble	9b88d4bbfd	Use bootkube system container on fedora-atomic * Use the upstream bootkube image packaged with the required metadata to be usable as a system container under systemd * Run bootkube with runc so no host level components use Docker any more. Docker is still the runtime * Remove bootkube script and old systemd unit	2018-04-21 18:46:56 -07:00
Dalton Hubble	3dde4ba8ba	Mount host's /etc/os-release in kubelet system containers * Fix `kubectl describe node` to reflect the host's operating system	2018-04-21 18:46:56 -07:00
Dalton Hubble	e148552220	Enable kubelet allocatable enforcement and QoS cgroup hierarchy * Change kubelet system image to use --cgroups-per-qos=true (default) instead of false * Change kubelet system image to use --enforce-node-allocatable=pods instead of an empty string	2018-04-21 18:46:56 -07:00
Dalton Hubble	d8d1468f03	Update kubelet system container image to mount /etc/hosts * Fix kubelet port-forward on Google Cloud / Fedora Atomic * Mount the host's /etc/hosts in kubelet system containers * Problem: kubelet runc system containers on Atomic were not mounting the host's /etc/hosts, like rkt-fly does on Container Linux. `kubectl port-forward` calls socat with localhost. DNS servers on AWS, DO, and in many bare-metal environments resolve localhost to the caller as a convenience. Google Cloud notably does not nor is it required to do so and this surfaced the missing /etc/hosts in runc kubelet namespaces.	2018-04-21 18:46:56 -07:00
Dalton Hubble	2b74aba564	Add Google Cloud fedora-atomic module * Network load balancer for ingress doesn't work yet because Compute Engine packages are missing * port-forward / socat is broken	2018-04-21 18:46:56 -07:00
Dalton Hubble	24d230505a	Add cloud-metadata.service on AWS fedora-atomic	2018-04-21 18:46:56 -07:00
Dalton Hubble	cf22e70b46	Name ostree remote repo fedora-atomic across platforms	2018-04-21 18:46:56 -07:00
Dalton Hubble	b3cf9508b6	Update Fedora Atomic modules to Kubernetes v1.10.1	2018-04-21 18:46:56 -07:00
Dalton Hubble	5212684472	Temporarily pin Fedora Atomic AMI * Atomic has published AMI images that shutdown immediately after being powered on	2018-04-21 18:46:56 -07:00
Dalton Hubble	f990473cde	Update control plane manifests and add etcd metrics * Enable etcd v3.3 metrics to expose metrics for scraping by Prometheus * Use k8s.gcr.io instead of gcr.io/google_containers * Add flexvolume plugin mount to controller manager * Update kube-dns from v1.14.8 to v1.14.9	2018-04-21 18:46:56 -07:00
Dalton Hubble	8523a086e2	Fix kubelet system container to mount CNI plugins * Mount /opt/cni/bin in kubelet system container so CNI plugin binaries can be found. Before, flannel worked because the kubelet falls back to flannel plugin baked into the hyperkube (undesired) * Move the CNI bin install location later, since /opt changes may be lost between ostree rebases	2018-04-21 18:46:56 -07:00
Dalton Hubble	19bc5aea9e	Use kubelet system container on fedora-atomic * Use the upstream hyperkube image packaged with the required metadata to be usable as a system container under systemd * Fix port-forward since socat is included	2018-04-21 18:46:56 -07:00
Dalton Hubble	8d7cfc1a45	Use etcd system container on fedora-atomic * Use the upstream etcd image packaged with the required metadata to be usable as a system container (runc) under systemd	2018-04-21 18:46:56 -07:00
Dalton Hubble	9969c357da	Change AWS Fedora module to fedora-atomic	2018-04-21 18:46:56 -07:00
Dalton Hubble	4e43b2ff48	Change DO Fedora module to fedora-atomic	2018-04-21 18:46:56 -07:00
Dalton Hubble	ddc75e99ac	Add bare-metal Fedora Atomic module * Several known hacks and broken areas * Download v1.10 Kubelet from release tarball * Install flannel CNI binaries to /opt/cni * Switch SELinux to Permissive * Disable firewalld service * port-forward won't work, socat missing	2018-04-21 18:46:56 -07:00
Dalton Hubble	b80a2eb8a0	Sync fedora-cloud modules with Container Linux * Update manifests for Kubernetes v1.10.0 * Update etcd from v3.3.2 to v3.3.3 * Add disk_type optional variable on AWS * Remove redundant kubeconfig copy on AWS * Distribute etcd secres only to controllers * Organize module variables and ssh steps	2018-04-21 18:46:56 -07:00

... 18 19 20 21 22 ...

1318 Commits