typhoon

mirror of https://github.com/puppetmaster/typhoon.git synced 2024-12-27 07:39:33 +01:00

Author	SHA1	Message	Date
Dalton Hubble	83f1bd2373	Update ARM64 cluster and hybrid cluster docs * Typhoon now supports arbitrary combinations of controller, worker, and worker pool architectures so we can drop the specific details of full-cluster vs hybrid cluster. Just pick the architecture for each group of nodes accordingly. * However, if a custom node taint is set, continue to configure the cluster's daemonsets accordingly with `daemonset_tolerations`	2024-08-02 20:34:23 -07:00
Dalton Hubble	0120b9f38d	Remove the cluster_domain_suffix variable * Drop support for `cluster_domain_suffix` customization and always use `cluster.local`. Many components in the Kubernetes ecosystem assume this default suffix and its very rare to be setting a special value here these days * Cleanup a few variables that are seldom used	2024-08-02 15:05:25 -07:00
Dalton Hubble	af27661432	Configure controller and worker node architecture separately * On platforms that support ARM64 instances, configure controller and worker node host architectures separately * For example, you can run arm64 controllers and amd64 workers * Add `controller_arch` and `worker_arch` variables * Remove `arch` variable	2024-08-02 15:04:57 -07:00
Dalton Hubble	516786d7bb	google: Configure controller and worker disk sizes * Add `controller_disk_size` and `worker_disk_size` variables * Remove `disk_size` variable	2024-08-02 13:07:41 -07:00
Dalton Hubble	1104b4bf28	AWS: Add CPU pricing mode and controller/worker disk variables * Add `controller_disk_type`, `controller_disk_size`, and `controller_disk_iops` variables * Add `worker_disk_type`, `worker_disk_size`, and `worker_disk_iops` variables and fix propagation to worker nodes * Remove `disk_type`, `disk_size`, and `disk_iops` variables * Add `controller_cpu_credits` and `worker_cpu_credits` variables to set CPU pricing mode for burstable instance types	2024-07-31 15:02:28 -07:00
Dalton Hubble	0669d44026	Update Kubernetes from v1.30.2 to v1.30.3 * Update builtin Cilium manifests from v1.15.6 to v1.15.7 * Update builtin flannel manifests from v0.25.4 to v0.25.5	2024-07-20 11:04:32 -07:00
Dalton Hubble	0d10d180f8	Change worker node pools from uniform to flexible orchestration mode * Use flexible orchestration mode. Azure has started to recommend this mode because it allows interacting with VMSS instances like regular VMs via the CLI or via the Azure Portal * Add options to allow workers nodes to use ephemeral local disks * Add `controller_disk_type` and `controller_disk_size` variables * Add `worker_disk_type`, `worker_disk_size`, and `worker_ephemeral_disk` variables	2024-07-14 11:58:15 -07:00
Dalton Hubble	a4fab61066	Remove an IPv4 address from Azure clusters * Consolidate load balancer frontend IPs to just the minimal IPv4 and IPv6 addresses that are needed per load balancer. apiserver and ingress use separate ports, so there is not a true need for a separate public IPv4 address just for apiserver * Some might prefer a separate IP just because it slightly hides the apiserver, but these are public hosted endpoints that can be discovered * Reduce the cost of an Azure cluster since IPv4 public IPs are billed ($3.60/mo/cluster)	2024-07-10 22:29:43 -07:00
Dalton Hubble	24b7f31c55	Rename Azure cluster region variable to location * Rename the region variable to location to align with Azure platform conventions, where resources are created within an Azure location, which are themselves part of broader geographical regions	2024-07-09 07:56:58 -07:00
Dalton Hubble	48d4973957	Add IPv6 support for Typhoon Azure clusters * Define a dual-stack virtual network with both IPv4 and IPv6 private address space. Change `host_cidr` variable (string) to a `network_cidr` variable (object) with "ipv4" and "ipv6" fields that list CIDR strings. * Define dual-stack controller and worker subnets. Disable Azure default outbound access (a deprecated fallback mechanism) * Enable dual-stack load balancing to Kubernetes Ingress by adding a public IPv6 frontend IP and LB rule to the load balancer. * Enable worker outbound IPv6 connectivity through load balancer SNAT by adding an IPv6 frontend IP and outbound rule * Configure controller nodes with a public IPv6 address to provide direct outbound IPv6 connectivity * Add an IPv6 worker backend pool. Azure requires separate IPv4 and IPv6 backend pools, though the health probe can be shared * Extend network security group rules for IPv6 source/destinations Checklist: Access to controller and worker nodes via IPv6 addresses: * SSH access to controller nodes via public IPv6 address * SSH access to worker nodes via (private) IPv6 address (via controller) Outbound IPv6 connectivity from controller and worker nodes: ``` nc -6 -zv ipv6.google.com 80 Ncat: Version 7.94 ( https://nmap.org/ncat ) Ncat: Connected to [2607:f8b0:4001:c16::66]:80. Ncat: 0 bytes sent, 0 bytes received in 0.02 seconds. ``` Serve Ingress traffic via IPv4 or IPv6 just requires setting up A and AAAA records and running the ingress controller with `hostNetwork: true` since, hostPort only forwards IPv4 traffic	2024-07-09 07:55:00 -07:00
Dalton Hubble	7b8a51070f	Add Terraform modules for CoreDNS, Cilium, and flannel * With the new component system, these components can be managed independent from the cluster and rolled or edited in advanced ways	2024-05-19 17:00:10 -07:00
Dalton Hubble	533ace7011	Update Cilium from v1.15.4 to v1.15.5 * https://github.com/cilium/cilium/releases/tag/v1.15.5	2024-05-19 16:38:08 -07:00
Dalton Hubble	b3c384fbc0	Introduce the component system for managing pre-installed addons * Previously: Typhoon provisions clusters with kube-system components like CoreDNS, kube-proxy, and a chosen CNI provider (among flannel, Calico, or Cilium) pre-installed. This is convenient since clusters come with "batteries included". But it also means upgrading these components is generally done in lock-step, by upgrading to a new Typhoon / Kubernetes release * It can be valuable to manage these components with a separate plan/apply process or through automations and deploy systems. For example, this allows managing CoreDNS separately from the cluster's lifecycle. * These "components" will continue to be pre-installed by default, but a new `components` variable allows them to be disabled and managed as "addons", components you apply after cluster creation and manage on a rolling basis. For some of these, we may provide Terraform modules to aide in managing these components. ``` module "cluster" { # defaults components = { enable = true coredns = { enable = true } kube_proxy = { enable = true } # Only the CNI set in var.networking will be installed flannel = { enable = true } calico = { enable = true } cilium = { enable = true } } } ``` An earlier variable `install_container_networking = true/false` has been removed, since it can now be achieved with this more extensible and general components mechanism by setting the chosen networking provider enable field to false.	2024-05-19 16:33:57 -07:00
Dalton Hubble	563feacd29	Update Kubernetes from v1.30.0 to v1.30.1 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.30.md#v1301	2024-05-15 21:59:00 -07:00
Dalton Hubble	3f34e047f1	azure: Add controller security group and subnet outputs * Output the network security group name and address prefixes for controller nodes, to allow adding custom network security rules that apply specifically to controller nodes	2024-05-14 21:34:31 -07:00
Dalton Hubble	cc80ec9b98	Add firewall and security rules for Cilium/Hubble metrics * Add firewall or security riles to allow node-to-node traffic on ports 9962-9965 for Cilium and Hubble metrics. Cilium runs with host network, so these require cloud firewall changes	2024-05-13 21:27:38 -07:00
Dalton Hubble	d08cd317d9	Allow CoreDNS and kube-proxy to be optional components * Allow for more minimal base cluster setups, that manage CoreDNS or kube-proxy as applications, with rolling updates, or deploy systems. Or in the case of kube-proxy, its becoming more common to not install it and instead use Cilium * Add a `components` pass-through variable to configure pre-installed components like kube-proxy and CoreDNS. These components can be disabled (individually or together) to allow for managing components with separate plan/apply processes or automations * terraform-render-bootstrap manifest assets are now structured as manifests/{coredns,kube-proxy,network} so adapt the controller layout scripts accordingly * This is similar to some changes in v1.29.2 that allowed for the container networking provider manifests to be skipped Related: https://github.com/poseidon/typhoon/pull/1419, https://github.com/poseidon/typhoon/pull/1421	2024-05-12 21:20:27 -07:00
Dalton Hubble	78d5100181	Update Cilium and flannel container images * Update Cilium from v1.15.3 to v1.25.4 * Update flannel from v0.24.4 to v0.25.1	2024-05-12 08:27:27 -07:00
Dalton Hubble	6ac5a0222b	Update Kubernetes from v1.29.3 to v1.30.0 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.30.md#v1300	2024-04-23 20:51:54 -07:00
Dalton Hubble	cafcdbc3e7	Update etcd from v3.5.12 to v3.5.13 and bump Calico/Cilium * Update Cilium from v1.15.2 to v1.15.3 * Update Calico from v3.27.2 to v3.27.3	2024-04-03 22:51:07 -07:00
Dalton Hubble	fbe36b8b16	Update Cilium and flannel container image versions * https://github.com/cilium/cilium/releases/tag/v1.15.2 * https://github.com/flannel-io/flannel/releases/tag/v0.24.4	2024-03-22 11:19:49 -07:00
Dalton Hubble	41907a0ba6	Update Calico from v3.26.3 to v3.27.2 * Update fixes Calico incompatibility with Fedora CoreOS Rel: https://github.com/projectcalico/calico/issues/8372	2024-02-25 12:11:56 -08:00
Dalton Hubble	2325a503e1	Add an `install_container_networking` variable (default `true`) * When `true`, the chosen container `networking` provider is installed during cluster bootstrap * Set `false` to self-manage the container networking provider. This allows flannel, Calico, or Cilium to be managed via Terraform (like any other Kubernetes resources). Nodes will be NotReady until you apply the self-managed container networking provider. This may become the default in future.	2024-02-24 18:49:38 -08:00
Dalton Hubble	7a46eb03ae	Update Cilium from v1.14.3 to v1.15.1 * https://github.com/cilium/cilium/releases/tag/v1.15.1	2024-02-23 22:59:31 -08:00
Dalton Hubble	0e7977694f	Allow CNI networking to be set to none * Set CNI networking to "none" to skip installing any CNI provider (i.e. no flannel, Calico, or Cilium). In this mode, cluster nodes will be NotReady until you add your own CNI stack * Motivation: I now tend to manage CNI components as addon modules just like other applications overlaid onto a cluster. It allows for faster iteration and may eventually become the recommendation	2024-02-23 22:57:47 -08:00
Dalton Hubble	f2f625984e	Update Kubernetes from v1.29.1 to v1.29.2 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.29.md#v1292	2024-02-18 18:31:31 -08:00
Dalton Hubble	e247673a20	Update Kubernetes from v1.29.0 to v1.29.1 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.29.md#v1291	2024-02-04 10:47:42 -08:00
Dalton Hubble	84e4f02917	Update Kubernetes from v1.28.4 to v1.29.0 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.29.md	2023-12-22 10:27:24 -08:00
Dalton Hubble	0d997def31	Add release note for v1.28.4	2023-12-10 21:02:21 -08:00
Dalton Hubble	435fa196da	Relax the provider version constraint for Google Cloud * Allow upgrading to the v5.x Google Cloud Terrform Provider * Relax the version constraint to ease future compatibility, though it does allow users to upgrade prematurely	2023-10-30 09:05:06 -07:00
Dalton Hubble	39af942f4d	Update etcd from v3.5.9 to v3.5.10 * https://github.com/etcd-io/etcd/releases/tag/v3.5.10	2023-10-29 18:21:40 -07:00
Dalton Hubble	4c8bfa4615	Update Calico from v3.26.1 to v3.26.3	2023-10-29 18:19:10 -07:00
Dalton Hubble	386a004072	Update Cilium from v1.14.2 to to v1.14.3	2023-10-29 18:17:55 -07:00
Dalton Hubble	291107e4c9	Workaround problems in Cilium v1.14 partial kube-proxy replacement * With Cilium v1.14, Cilium's kube-proxy partial mode changed to either be enabled or disabled (not partial). This somtimes leaves Cilium (and the host) unable to reach the kube-apiserver via the in-cluster Kubernetes Service IP, until the host is rebooted * As a workaround, configure Cilium to rely on external DNS resolvers to find the IP address of the apiserver. This is less portable and less "clean" than using in-cluster discovery, but also what Cilium wants users to do. Revert this when the upstream issue https://github.com/cilium/cilium/issues/27982 is resolved	2023-10-29 16:16:56 -07:00
Dalton Hubble	005a1119f3	Update Kubernetes from v1.28.2 to v1.28.3 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.28.md#v1283	2023-10-22 18:43:54 -07:00
Dalton Hubble	0ce8dfbb95	Workaround to allow use of ed25519 keys on Azure * Allow passing a dummy RSA key to Azure to satisfy its obtuse requirements (recommend deleting the corresponding private key) * Then `ssh_authorized_key` can be used to provide Fedora CoreOS or Flatcar Linux with a modern ed25519 public key to set in the authorized_keys via Ignition	2023-09-17 23:21:42 +02:00
Dalton Hubble	8cbcaa5fc6	Update Cilium from v1.14.1 to v1.14.2 * https://github.com/cilium/cilium/releases/tag/v1.14.2	2023-09-16 17:10:07 +02:00
Dalton Hubble	f5bc1fb1fd	Update Kubernetes from v1.28.1 to v1.28.2 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.28.md#v1282	2023-09-14 13:01:33 -07:00
Dalton Hubble	126973082a	Update Kubernetes from v1.28.0 to v1.28.1 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.28.md#v1281	2023-08-26 13:29:48 -07:00
Dalton Hubble	c259142c28	Update Cilium from v1.14.0 to v1.14.1	2023-08-20 16:09:22 -07:00
Dalton Hubble	d29e6e3de1	Upgrade Cilium from v1.13.4 to v1.14.0 * https://github.com/poseidon/terraform-render-bootstrap/pull/360 * Also update flannel from v0.22.0 to v0.22.1	2023-07-30 09:36:23 -07:00
Dalton Hubble	0a6183f859	Update Kubernetes from v1.27.3 to v1.27.4 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.27.md#v1274	2023-07-21 08:00:50 -07:00
Dalton Hubble	9a28fe79a1	Upgrade Calico from v3.25.1 to v3.26.1 * Add new CRD bgpfilters and new ClusterRoles calico-cni-plugin Rel: https://github.com/poseidon/terraform-render-bootstrap/pull/358	2023-06-19 12:28:53 -07:00
Dalton Hubble	7255f82d71	Update Kubernetes fromv 1.27.2 to v1.27.3 * Update Cilium v1.13.3 to v1.13.4 Rel: https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.27.md#v1273	2023-06-16 08:28:17 -07:00
Dalton Hubble	6f4b4cc508	Update Cilium from v1.13.2 to v1.13.3 * Also update flannel v0.21.2 to v0.22.0 Rel: https://github.com/poseidon/terraform-render-bootstrap/pull/355	2023-06-11 19:59:10 -07:00
Dalton Hubble	094811dc73	Relax aws Terraform Provider version constraint * aws provider v5.0+ works alright and should be permitted, relax the version constraint for the Typhoon AWS kubernetes module and worker module for Fedora CoreOS and Flatcar Linux	2023-06-11 19:46:01 -07:00
Dalton Hubble	2a5a43f3a4	Update etcd from v3.5.8 to v3.5.9 * https://github.com/etcd-io/etcd/releases/tag/v3.5.9	2023-06-11 19:28:23 -07:00
Dalton Hubble	784f60f624	Enable boot diagnostics for Azure controller and worker VMs * When invalid Ignition snippets are provided to Typhoon, it can be useful to view Azure's boot logs for the instance, which requires boot diagnostics be enabled	2023-06-11 19:24:09 -07:00
Dalton Hubble	8ebf31073c	Update Kubernetes from v1.27.1 to v1.27.2 * https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG/CHANGELOG-1.27.md#v1272	2023-05-21 14:02:49 -07:00
Dalton Hubble	fc444d25f8	Update poseidon/ct provider and Butane Config version * Update Fedora CoreOS Butane configs from v1.4.0 to v1.5.0 * Require Fedora CoreOS Butane snippets update to v1.1.0 * Require poseidon/ct Terraform provider v0.13 or newer * Use Ignition v3.4.0 spec for all node provisioning	2023-04-21 08:58:20 -07:00

1 2 3 4 5 ...

971 Commits