typhoon

mirror of https://github.com/puppetmaster/typhoon.git synced 2025-10-06 04:04:39 +02:00

Author	SHA1	Message	Date
Dalton Hubble	598f707cbd	Update Kubernetes from v1.31.0 to v1.31.1 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.31.md#v1311	2024-09-20 14:43:39 -07:00
Dalton Hubble	3412060c3c	Use Cilium kube-proxy replacement when Cilium CNI is used * When using the Cilium component, disable bootstrapping the kube-proxy DaemonSet. Instead, configure Cilium to provide its kube-proxy replacement with BPF * Update the self-managed Cilium component to use kube-proxy replacement as well	2024-08-23 12:33:32 -07:00
Dalton Hubble	808b8a948f	aws: Switch EC2 instances to use resource-based hostnames * Use EC2 resource-based hostnames instead of IP-based hostnames. The Amazon DNS server can resolve A and AAAA queries to IPv4 and IPv6 node addresses * For example, nodes used to be named like `ip-10-11-12-13.us-east-1.compute.internal` but going forward use the instance id `i-0123456789abcdef.us-east-1.compute.internal` * Tag controller node EBS volumes with a name based on the controller node name	2024-08-22 20:02:53 -07:00
Dalton Hubble	10be34daa2	Update Kubernetes from v1.30.4 to v1.31.0 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.31.md#v1310	2024-08-17 08:32:35 -07:00
Dalton Hubble	320d76c934	Update Kubernetes from v1.30.3 to v1.30.4 * Update Cilium from v1.16.0 to v1.16.1	2024-08-16 08:27:07 -07:00
Dalton Hubble	0120b9f38d	Remove the cluster_domain_suffix variable * Drop support for `cluster_domain_suffix` customization and always use `cluster.local`. Many components in the Kubernetes ecosystem assume this default suffix and its very rare to be setting a special value here these days * Cleanup a few variables that are seldom used	2024-08-02 15:05:25 -07:00
Dalton Hubble	1104b4bf28	AWS: Add CPU pricing mode and controller/worker disk variables * Add `controller_disk_type`, `controller_disk_size`, and `controller_disk_iops` variables * Add `worker_disk_type`, `worker_disk_size`, and `worker_disk_iops` variables and fix propagation to worker nodes * Remove `disk_type`, `disk_size`, and `disk_iops` variables * Add `controller_cpu_credits` and `worker_cpu_credits` variables to set CPU pricing mode for burstable instance types	2024-07-31 15:02:28 -07:00
Dalton Hubble	0669d44026	Update Kubernetes from v1.30.2 to v1.30.3 * Update builtin Cilium manifests from v1.15.6 to v1.15.7 * Update builtin flannel manifests from v0.25.4 to v0.25.5	2024-07-20 11:04:32 -07:00
Dalton Hubble	931d6d18de	Update Kubernetes from v1.30.1 to v1.30.2 * Update CoreDNS from v1.9.4 to v1.11.1 * Update Cilium from v1.15.5 to v1.15.6 * Update flannel from v0.25.1 to v0.25.4	2024-06-17 08:20:03 -07:00
Dalton Hubble	563feacd29	Update Kubernetes from v1.30.0 to v1.30.1 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.30.md#v1301	2024-05-15 21:59:00 -07:00
dghubble-renovate[bot]	e8a42ae33e	Bump provider ct to v0.13.0	2024-05-04 09:01:19 -07:00
Dalton Hubble	6ac5a0222b	Update Kubernetes from v1.29.3 to v1.30.0 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.30.md#v1300	2024-04-23 20:51:54 -07:00
Dalton Hubble	8524aa00bc	Update Kubernetes from v1.29.2 to v1.29.3 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.29.md#v1293	2024-03-23 00:47:10 -07:00
Dalton Hubble	f2f625984e	Update Kubernetes from v1.29.1 to v1.29.2 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.29.md#v1292	2024-02-18 18:31:31 -08:00
Dalton Hubble	e247673a20	Update Kubernetes from v1.29.0 to v1.29.1 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.29.md#v1291	2024-02-04 10:47:42 -08:00
Dalton Hubble	808eafd178	Fix AWS launch template to retain support for IMDVv1 * AWS has recently started defaulting launch templates to IMDSv2 being "required". aws_launch_template is supposed to default to "optional" but it doesn't. * Requiring IMDSv2 sessions breaks a number of applications which don't use AWS SDKs and were never meant to be complex applications (e.g. shell scripts and the like)	2024-02-04 10:38:50 -08:00
Dalton Hubble	84e4f02917	Update Kubernetes from v1.28.4 to v1.29.0 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.29.md	2023-12-22 10:27:24 -08:00
Dalton Hubble	8254d8f3db	Update Kubernetes from v1.28.3 to v1.28.4 * https://github.com/kubernetes/kubernetes/releases/tag/v1.28.4	2023-11-21 06:16:58 -08:00
Dalton Hubble	005a1119f3	Update Kubernetes from v1.28.2 to v1.28.3 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.28.md#v1283	2023-10-22 18:43:54 -07:00
Dalton Hubble	f5bc1fb1fd	Update Kubernetes from v1.28.1 to v1.28.2 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.28.md#v1282	2023-09-14 13:01:33 -07:00
Dalton Hubble	126973082a	Update Kubernetes from v1.28.0 to v1.28.1 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.28.md#v1281	2023-08-26 13:29:48 -07:00
Dalton Hubble	81eed2e909	Update Kubernetes from v1.27.4 to v1.28.0 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.28.md#v1280	2023-08-20 15:41:23 -07:00
Dalton Hubble	0a6183f859	Update Kubernetes from v1.27.3 to v1.27.4 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.27.md#v1274	2023-07-21 08:00:50 -07:00
Dalton Hubble	7255f82d71	Update Kubernetes fromv 1.27.2 to v1.27.3 * Update Cilium v1.13.3 to v1.13.4 Rel: https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.27.md#v1273	2023-06-16 08:28:17 -07:00
Dalton Hubble	094811dc73	Relax aws Terraform Provider version constraint * aws provider v5.0+ works alright and should be permitted, relax the version constraint for the Typhoon AWS kubernetes module and worker module for Fedora CoreOS and Flatcar Linux	2023-06-11 19:46:01 -07:00
Dalton Hubble	8ebf31073c	Update Kubernetes from v1.27.1 to v1.27.2 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.27.md#v1272	2023-05-21 14:02:49 -07:00
Dalton Hubble	501e6d25e0	Update Kubernetes from v1.27.0 to v1.27.1 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.27.md#v1271	2023-04-15 23:16:51 -07:00
Dalton Hubble	4322857bec	Update Kubernetes from v1.26.3 to v1.27.0 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.27.md#v1270	2023-04-15 22:49:12 -07:00
Dalton Hubble	3670ec7ed7	Update Kubernetes from v1.26.2 to v1.26.3 * Update Cilium from v1.13.0 to v1.13.1 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.26.md#v1263	2023-03-21 18:18:19 -07:00
Dalton Hubble	76ebc08fd2	Update Kubernetes from v1.26.1 to v1.26.2 * https://github.com/poseidon/terraform-render-bootstrap/pull/345	2023-03-01 17:13:16 -08:00
Dalton Hubble	f2bf5ac3fb	Update Kubernetes from v1.26.0 to v1.26.1 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.26.md#v1261	2023-01-19 08:27:56 -08:00
Dalton Hubble	d6cbcf9f96	Update Kubernetes from v1.26.0-rc.1 to v1.26.0 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.26.md#v1260	2022-12-08 08:47:24 -08:00
Dalton Hubble	0dc8740c77	Update Kubernetes from v1.26.0-rc.0 to v1.26.0-rc.1 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.26.md#v1260-rc1	2022-12-05 09:31:45 -08:00
Dalton Hubble	a9b12b6bca	Update Kubernetes from v1.25.4 to v1.26.0-rc.0 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.26.md#v1260-rc0	2022-11-30 08:47:40 -08:00
Dalton Hubble	da76d32aba	Migrate AWS launch configurations to launch templates * Same features, but AWS will soon require launch templates * Starting Dec 31, 2022 AWS will not add new instance types (e.g. graviton 4) to launch configuration support Rel: https://aws.amazon.com/blogs/compute/amazon-ec2-auto-scaling-will-no-longer-add-support-for-new-ec2-features-to-launch-configurations/	2022-11-30 00:26:03 -08:00
Dalton Hubble	26dbc7e91d	Update Kubernetes from v1.25.3 to v1.25.4 * Update Calico from v3.24.3 to v3.24.5 * Update Prometheus and Grafana addons	2022-11-10 09:42:21 -08:00
Dalton Hubble	937acc4b5a	Re-enable Graceful Node Shutdown feature * Kubelet GracefulNodeShutdown works, but only partially handles gracefully stopping the Kubelet. The most noticeable drawback is that Completed Pods are left around * Use a project like poseidon/scuttle or a similar systemd unit as a snippet to add drain and/or delete behaviors if desired * This reverts commit `1786e34f33`. Rel: * https://www.psdn.io/posts/kubelet-graceful-shutdown/ * https://github.com/poseidon/scuttle	2022-11-02 20:49:01 -07:00
Dalton Hubble	0f38a6d405	Remove defunct delete-node.service from worker nodes * delete-node.service used to be used to remove nodes from the cluster on shutdown, but its long since it last worked properly * If there is still a desire for this concept, it can be added with a custom snippet and with a better systemd unit	2022-10-20 08:43:48 -07:00
Dalton Hubble	651151805d	Update Kubernetes v1.25.2 to v1.25.3 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.25.md#v1253	2022-10-13 21:02:39 -07:00
Dalton Hubble	3ee462a24c	Update Kubernetes from v1.25.1 to v1.25.2 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.25.md#v1252	2022-09-22 08:15:30 -07:00
Dalton Hubble	74d4d56dbd	Remove workaround for v1.25.0 ConfigMap rendering issue * LocalStorageCapacityIsolationFSQuotaMonitoring was reverted back to alpha in v1.25.1, so we don't need to explicitly disable it anymore Rel: https://github.com/kubernetes/kubernetes/issues/112081	2022-09-19 09:10:24 -07:00
Dalton Hubble	09751cc0e8	Update Kubernetes from v1.25.0 to v1.25.1 * https://github.com/kubernetes/kubernetes/releases/tag/v1.25.1	2022-09-15 08:23:22 -07:00
Dalton Hubble	1786e34f33	Revert Graceful Node Shutdown feature * Disable Kubelet Graceful Node Shutdown on worker nodes (enabled in Kubernetes v1.25.0 https://github.com/poseidon/typhoon/pull/1222) * Graceful node shutdown shutdown allows 30s for critical pods to shutdown and 15s for regular pods to shutdown before releasing the inhibitor lock to allow the host to shutdown * Unfortunately, both pods and the node are shutdown at the same time at the end of the 45s period without further configuration options. As a result, regular pods and the node are shutdown at the same time. In practice, enabling this feature leaves Error or Completed pods in kube-apiserver state until manually cleaned up. This feature is not ready for general use * Fix issue where Error/Completed pods are accumulating whenever any node restarts (or auto-updates), visible in kubectl get pods * This issue wasn't apparent in initial testing and seems to only affect non-critical pods (due to critical pods being killed earlier) But its very apparent on our real clusters Rel: https://github.com/kubernetes/kubernetes/issues/110755	2022-09-10 14:58:44 -07:00
Dalton Hubble	393a38deff	Configure Graceful Node Shutdown and lengthen max inhibitor delay * Configure Kubelet Graceful Node Shutdown to detect system shutdown events and stop running containers gracefully when possible * Allow up to 30s for critical pods to gracefully shutdown * Allow up to 15s for regular pods to gracefully shutdown * Node will be marked as NotReady promptly, instead of having to wait for health checks * Kubelet uses systemd inhibitor locks to delay shutdown for a limited number of seconds * Raise the default max inhibitor time from 5s to 45s Verify systemd inhibitor locks are present: ``` sudo systemd-inhibit --list WHO UID USER PID COMM WHAT WHY MODE kubelet 0 root 4581 kubelet shutdown Kubelet needs time to handle node shutdown delay ``` Tail journal logs and then shutdown a node via systemctl reboot or via the cloud console to watch container shutdown Rel: * https://kubernetes.io/blog/2021/04/21/graceful-node-shutdown-beta/ * https://kubernetes.io/docs/reference/config-api/kubelet-config.v1beta1/ * https://github.com/kubernetes/kubernetes/issues/107043 * https://github.com/coreos/fedora-coreos-tracker/issues/821 * https://www.freedesktop.org/software/systemd/man/systemd-inhibit.html * https://github.com/kubernetes/kubernetes/blob/release-1.24/pkg/kubelet/nodeshutdown/nodeshutdown_manager_linux.go * https://github.com/godbus/dbus/blob/master/conn.go	2022-08-28 10:37:33 -07:00
Dalton Hubble	275fc0f9e8	Disable LocalStorageCapacityIsolationFSQuotaMonitoring feature * Kubernetes v1.25.0 moved the LocalStorageCapacityIsolationFSQuotaMonitoring feature from alpha to beta, but it breaks Kubelet updating ConfigMaps in Pods, as shown by conformance tests * Kubernetes is rolling LocalStorageCapacityIsolationFSQuotaMonitoring back to alpha so its not enabled by default, but that will require a release * Disable the feature gate directly as a workaround for now to make Kubernetes v1.25.0 usable ``` FailedMount: MountVolume.SetUp failed for volume "configmap-volume" : requesting quota on existing directory /var/lib/kubelet/pods/f09fae17-ff16-4a05-aab3-7b897cb5b732/volumes/kubernetes.io~configmap/configmap-volume but different pod 673ad247-abf0-434e-99eb-1c3f57d7fdaa a4568e94-2b2d-438f-a4bd-c9edc814e478 ``` Rel: * https://github.com/kubernetes/kubernetes/pull/112076 * https://github.com/kubernetes/kubernetes/pull/107329	2022-08-27 09:49:35 -07:00
Dalton Hubble	3fb59a3289	Migrate most Kubelet flags to KubeletConfiguration file * Add a KubeletConfiguration file to replace most Kubelet flags, to prepare for upcoming changes * Pass Kubelet the --config flag to specify the location of the KubeletConfiguration * Remove flsgs / configuration where it matches the defaults * Remove --cgroups-per-qos, defaults to true * Remove --container-runtime, defaults to remote * Remove enforce-node-allocatable=pods, defaults to pods Rel: * https://kubernetes.io/docs/reference/command-line-tools-reference/kubelet/ * https://kubernetes.io/docs/reference/config-api/kubelet-config.v1beta1/	2022-08-27 09:28:15 -07:00
Dalton Hubble	a31dbceac6	Update Kubernetes from v1.24.4 to v1.25.0 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.25.md	2022-08-25 09:18:14 -07:00
Dalton Hubble	760b4cd5ee	Update Kubernetes from v1.24.3 to v1.24.4 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.24.md#v1244	2022-08-17 20:09:30 -07:00
Dalton Hubble	52427a4271	Refresh instances in autoscaling group when launch configuration changes * Changes to worker launch configurations start an autoscaling group instance refresh to replace instances * Instance refresh creates surge instances, waits for a warm-up period, then deletes old instances * Changing worker_type, disk_, worker_price, worker_target_groups, or Butane worker_snippets on existing worker nodes will replace instances New AMIs or changing `os_stream` will be ignored, to allow Fedora CoreOS or Flatcar Linux to keep themselves updated * Previously, new launch configurations were made in the same way, but not applied to instances unless manually replaced	2022-08-14 21:43:49 -07:00
Dalton Hubble	87a8278c9d	Improve AWS autoscaling group and launch config names * Rename launch configuration to use a name_prefix named after the cluster and worker to improve identifiability * Shorten AWS autoscaling group name to not include the launch config id. Years ago this used to be needed to update the ASG but the AWS provider detects changes to the launch configuration just fine	2022-08-08 20:46:08 -07:00

1 2 3

104 Commits