Elastic Kubernetes Service¶
Info
Updated 9/12/2023
Amazon EKS is a fully managed service to run Kubernetes. It is highly available, AWS manages the control plane, and user defines the worker nodes and then the workloads deployed:
- There are 3 public subnets and 3 private subnets to be created across 3 availability zones and we provision the EKS node group in the private subnets and it communicates with the EKS control plane through the private cluster mode by using the private endpoint.
- Because the Pods inside the private EKS cluster might have to access the internet resources so we shall setup the NAT gateway in the public subnets.
- Application Load Balancer is used as the ingress to route the traffic for the deployed application in private subnets, and only ALB can access EKS cluster by referencing the ALB security group from EKS data plane security group. It is integrated with VPC for isolation, IAM for authentication, ELB for load distribution, and ECR for container image registry.
- Worker nodes can be EC2 instances in a
Managed Node Group
(EKS managed) or Fargate worker nodes as part ofFargate profile
.
Major characteristics¶
-
Scale K8s control plane across multiple AZs.
-
No need to install, operate and maintain k8s cluster.
- Automatically scales control plane instances based on load, detects and replaces unhealthy control plane instance.
- It supports EC2 as worker nodes or Fargate to deploy serverless containers or nodes in AWS Outposts.
- Fully compatible with other CNSF kubernetes.
- Can be deployed on-premises with Amazon EKS Distro (EKS-D) distribution.
The EKS node types are:
-
Managed node groups: to automate the provisioning and lifecycle management of EC2 (could be On-demand or spot instances). Nodes run using the latest Amazon EKS optimized AMIs. This is the recommended way to allocate capacity. EC2 are managed by AWS and assigned to a ASG managed by EKS. Nodes are tagged for auto-discovery by k8s cluster autoscaler. Can configure
Launch template
to be used with custom AMI or with custom user data.The following figure illustrates a classical multi-AZ, one ASG/MMG deployment for stateless applications.
While for stateful apps, we need to consider one MMG/ASG per AZ.
-
Self-managed nodes: nodes are managed by the user and attached to EKS cluster by using an ASG. User uses his own AMI-based EC2 instances that are part of the auto-scaling group and will serve as worker nodes for the cluster. Spot instances can be used to reduce cost.
- AWS Fargate which represents a cost optimized deployment for EKS worker nodes. Each time a pod is created it is assigned to an existing EC2 instance and runs within a microVM. It works with ALB.
Data volumes (EBS, EFS, FSx) are defined with StorageClass and they need to have Container Storage Interface compliant driver.
See Pricing calculator: pay for cluster control plane, EC2 instances, Fargate or AWS outposts.
Cluster management¶
- EKS runs a single tenant Kubernetes control plane for each cluster. 3
etcd
instances in 3 AZs within one region. - A set of add-ons are defined by default: CoreDNS, kube-proxy, VPC CNI. Others can be added like metrics server, cluster autoscaler, EFS CSI driver, EBS CSI driver, fluentbit (logging and metrics processor and forwarder).
- EKS uses IAM to provide authentication to the Kubernetes cluster, and k8s RBAC for authorization.
- EKS hosts a public OIDC discovery endpoint per cluster containing the signing keys for the
ProjectedServiceAccountToken
JSON web tokens so external systems, like IAM, can validate and accept the Kubernetes-issued OIDC tokens. OIDC federation access allows user or app to assume IAM roles via the Secure Token Service (STS), enabling authentication with an OIDC provider, receiving a JSON Web Token (JWT), which in turn can be used to assume an IAM role.
IAM role for service account¶
IAM role for service account helps to define role for pods and not the EC2 running the pods. This is a fine grained access control.
To use IAM roles for service accounts in the cluster, we must create an IAM OIDC Identity Provider.
Fargate profile¶
A fargate profile specifies which pods use Fargate. EKS uses special controller to schedule pods to Fargate. Pods must match a Fargate profile at the time that they're scheduled to run on Fargate. They run only on private subnets. Each pod is isolated within a microVM (Firecrakers).
ECS comparisons¶
- An EC2 instance with the ECS agent installed and configured is called a container instance. In Amazon EKS, it is called a worker node.
- An ECS container is called a task. In Amazon EKS, it is called a pod.
- While Amazon ECS runs on AWS native technology, Amazon EKS runs Kubernetes.
- ECS scale at thousand of tasks, EKS is still limited in number of nodes and total pods.
What to do the first time¶
-
Install kubernetes tools like
kubectl
and AWS CLI:sudo curl --silent --location -o /usr/local/bin/kubectl https://s3.us-west-2.amazonaws.com/amazon-eks/1.23.7/2022-06-29/bin/linux/amd64/kubectl sudo chmod +x /usr/local/bin/kubectl curl "https://awscli.amazonaws.com/awscli-exe-linux-x86_64.zip" -o "awscliv2.zip" unzip awscliv2.zip sudo ./aws/install
-
Install jq, gettext...
sudo yum -y install jq gettext bash-completion moreutils # Verify the path for command in kubectl jq envsubst aws do which $command &>/dev/null && echo "$command in path" || echo "$command NOT FOUND" done # Enable kubectl bash_completion kubectl completion bash >> ~/.bash_completion . /etc/profile.d/bash_completion.sh . ~/.bash_completion
-
Download
eksctl
(eksctl.io). (It also installskubectl
)brew tap weaveworks/tap brew install weaveworks/tap/eksctl # Verify it eksctl version
-
Create IAM Role and attach the required Amazon EKS IAM managed policy to it. Kubernetes clusters managed by Amazon EKS make calls to other AWS services on our behalf to manage the resources that we use with the service.
# under the labs/eks folder aws iam create-role \ --role-name myAmazonEKSClusterRole \ --assume-role-policy-document file://"eks-cluster-role-trust-policy.json" # Attach the required Amazon EKS managed IAM policy to the role. aws iam attach-role-policy \ --policy-arn arn:aws:iam::aws:policy/AmazonEKSClusterPolicy \ --role-name myAmazonEKSClusterRole
Working with cluster¶
To create an EKS cluster with EC2 or Fargate, within its own VPC, in private subnets, with security groups, we can use different approaches:
- AWS Console.
- Using
eksctl
command line and a yaml config file defining the cluster. - CDK to create the infrastructure and then EKS cluster itself. See examples in the product doc, see also a CDK example in
labs/cdk/eks-single
folder or thelabs/eks/eks-cdk
folder for a python based deployment. - Using CloudFormation for the VPC infrastructure with a predefined stack.
eksctl approach¶
Based on the EKS workshop), using Cloud9, quick summary of steps:
- Create workspace in Cloud9 and install dependencies (See previous section).
- Create IAM role named
eks-admin
with AdministratorAccess, and modify the Cloud9, EC2 instance IAM role using:Actions > Security > Modify IAM Role.
- Update Cloud9 workspace to disable Cloud9 to manage IAM credentials dynamically (This is not compatible with the EKS IAM authentication).
Gear > AWS Settings >
. -
Save region and account as env variable and configure aws CLI:
echo "export ACCOUNT_ID=${ACCOUNT_ID}" | tee -a ~/.bash_profile echo "export AWS_REGION=${AWS_REGION}" | tee -a ~/.bash_profile aws configure set default.region ${AWS_REGION} aws configure get default.region # validate that the Cloud9 IDE is using the correct IAM role aws sts get-caller-identity --query Arn | grep eks-admin -q && echo "IAM role valid" || echo "IAM role NOT valid"
-
Use
eksctl create cluster
which creates CloudFormation to deploy a EKS cluster with managed nodes:eksctl create cluster -f eks-cluster.yaml
See the schema definition here.
Find the cluster credentials added to
~/.kube/config
-
Update kubeconfig to interact with the cluster
aws eks update-kubeconfig --name MyEKS --region ${AWS_REGION}
-
Verify kubectl works with the cluster
kubectl get nodes
-
Export worker role name
STACK_NAME=$(eksctl get nodegroup --cluster myeks -o json | jq -r '.[].StackName') ROLE_NAME=$(aws cloudformation describe-stack-resources --stack-name $STACK_NAME | jq -r '.StackResources[] | select(.ResourceType=="AWS::IAM::Role") | .PhysicalResourceId') echo "export ROLE_NAME=${ROLE_NAME}" | tee -a ~/.bash_profile
-
Create an eks-admin service account and cluster role binding
kubectl apply -f eks-admin-service-account.yaml
-
Deploy Prometheus (be sure to have helm cli)
kubectl create namespace prometheus helm repo add prometheus-community https://prometheus-community.github.io/helm-charts helm upgrade -i prometheus prometheus-community/prometheus \ --namespace prometheus \ --set alertmanager.persistentVolume.storageClass="gp2",server.persistentVolume.storageClass="gp2" kubectl get pods -n prometheus
To access Prometheus web app:
kubectl --namespace=prometheus port-forward deploy/prometheus-server 8080:9090
-
For administration, deploy Kubernetes dashboard:
kubectl apply -f https://raw.githubusercontent.com/kubernetes/dashboard/v2.7.0/aio/deploy/recommended.yaml # Connect to the dashboard by first getting the admin user secret kubectl -n kube-system describe secret $(kubectl -n kube-system get secret | grep eks-admin | awk '{print $1}') # the following could also be used aws eks get-token --cluster-name myeks | jq -r '.status.token' # Then local proxy kubectl proxy --port=8080 --disable-filter=true # append the api to the exported URL: api/v1/namespaces/kubernetes-dashboard/services/https:kubernetes-dashboard:/proxy/?#!/login
-
For monitoring, it is interesting to deploy AWS Distro for OpenTelemetry to collect metric logs to AWS CloudWatch, and
Container Insight
feature of CW to see application performance. Deploy the AWS Distro for OpenTelemetry collector as a DaemonSet by entering the following command.curl https://raw.githubusercontent.com/aws-observability/aws-otel-collector/main/deployment-template/eks/otel-container-insights-infra.yaml | kubectl apply -f - # Verify the running pods kubectl get pods -l name=aws-otel-eks-ci -n aws-otel-eks
After a few mins, the CloudWatch Logs Group should have 3 logs groups created by the OTEL DaemonSet.
-
Verify nodes and pods
kubectl get nodes -o wide # across namespaces kubectl get pods --all-namespaces -o wide
-
Deploy an application using deployment, services and a service account. It is important to use the concept of iam role to service account mapping to control what AWS services the pod is able to access instead of using iam role and policies set at the EC2 level, as all pods within the EC2 will have the same policies. Applying the least permission.
-
Expose the application to the internet.
To do so we need to deploy the AWS Load Balancer controller inside the EKS cluster. Using Helm we can do:
helm repo add eks https://aws.github.io/eks-charts # Install the TargetGroupBinding CRDs kubectl apply -k "github.com/aws/eks-charts/stable/aws-load-balancer-controller//crds?ref=master" # install controller helm install aws-load-balancer-controller eks/aws-load-balancer-controller -n kube-system --set clusterName=web-host-on-eks --set serviceAccount.create=false --set serviceAccount.name=aws-load-balancer-controller
Delete cluster¶
- List all services
kubectl get svc --all-namespaces
- Delete any services that have an associated EXTERNAL-IP value. These services are fronted by an ELB load balancer, and we must delete them in Kubernetes to allow the load balancer and associated resources to be properly released.
kubectl delete svc <service-name>
- Delete the cluster
eksctl delete cluster --name <cluster name>
EKS with Fargate node using CDK¶
AWS Fargate is a technology that provides on-demand, right-sized compute capacity for containers. We control which pods start on Fargate and how they run with Fargate Profiles.
- See instructions EKS fargate getting started.
- Fargate profiles are associated to namespaces.
- Only private subnets are supported for pods that are running on Fargate.
- Pods that match a selector are scheduled on Fargate.
- Kubernetes affinity/anti-affinity rules do not apply and aren't necessary with Amazon EKS Fargate pod.
Here is an example of CDK declaration for a Fargate Cluster. See the local CDK file in labs/eks/cdk-fargate
:
# EKS cluster that only uses Fargate capacity
aws_eks.FargateCluster(self,"MyEKS",
version=aws_eks.KubernetesVersion.V1_24)
To add FargateProfile to an existing cluster:
cluster = aws_eks.Cluster(self, 'demo-cluster',
masters_role=self.eks_admin_role,
vpc=self.vpc,
default_capacity=0,
vpc_subnets=[aws_ec2.SubnetSelection(subnet_type=aws_ec2.SubnetType.PRIVATE_WITH_EGRESS)],
version=aws_eks.KubernetesVersion.V1_24,
output_cluster_name=True
)
cluster.add_fargate_profile("MyProfile",
selectors=[eks.Selector(namespace="default")]
)
See Python cdk eks documentation.
If we do not specify a VPC while defining the cluster, one will be created on our behalf, which we can then access via cluster.vpc
.
The fargate cluster declaration for example will create a CloudFormation template with:
- VPC, two public subnets, route table for each public subnet, with default route to internet gateway
- An internet gateway attached to the VPC
- Elastic Interface per public subnet
- NAT gateway for each public subnet
- Two private subnets, with route table and route with outbound to NAT gateway
- An IAM role to allow access to EKS principal with cluster access
- An IAM role to be able to create EKS cluster
- An IAM role for the EKS master admin user
- An IAM role for fargate pod execution
- Security group for the control plane
- A custom kubernetes resources for Authentication
EKS Blueprint¶
The EKS Blueprints is an open-source development framework that abstracts the complexities of cloud infrastructure from developers.
Concepts¶
- A blueprint combines clusters, add-ons, and teams into a cohesive object that can be deployed as a whole.
- Team is a logical grouping of IAM identities that have access to a Kubernetes namespace(s), or cluster administrative access depending upon the team type.
- Once a blueprint is configured, it can be easily deployed across any number of AWS accounts and regions.
- Blueprints also leverage GitOps tooling to facilitate cluster bootstrapping and workload onboarding.
EKS with CDK Hands-on¶
This is a summary of the steps to get a running demonstration of creating EKS and Day 2 add-on.
Single Cluster¶
- Using CDK typescript here are the commands:
mkdir my-eks-blueprints
cd my-eks-blueprints
cdk init app --language typescript
npm i typescript@~4.8.4
npm i @aws-quickstart/eks-blueprints
-
If not done before, bootstrap CDK (the following command is to bootstrap CDK in 3 regions)
cdk bootstrap --trust=$ACCOUNT_ID \ --cloudformation-execution-policies arn:aws:iam::aws:policy/AdministratorAccess \ aws://$ACCOUNT_ID/$AWS_REGION aws://$ACCOUNT_ID/us-east-2 aws://$ACCOUNT_ID/us-east-1
See the code in labs/cdk/eks-single
- Create a Cluster using the eks-blueprints package, which is published as a npm module.
import * as cdk from 'aws-cdk-lib';
import { Construct } from 'constructs';
import * as blueprints from '@aws-quickstart/eks-blueprints';
export default class ClusterConstruct extends Construct {
constructor(scope: Construct, id: string, props?: cdk.StackProps) {
super(scope, id);
const account = props?.env?.account!;
const region = props?.env?.region!;
const blueprint = blueprints.EksBlueprint.builder()
.account(account)
.region(region)
.addOns()
.teams()
.build(scope, id+'-stack');
}
}
And in the app instantiate the cluster construct:
#!/usr/bin/env node
import 'source-map-support/register';
import * as cdk from 'aws-cdk-lib';
import ClusterConstruct from '../lib/my-eks-blueprints-stack';
const app = new cdk.App();
const account = process.env.CDK_DEFAULT_ACCOUNT!;
const region = process.env.CDK_DEFAULT_REGION;
const env = { account, region }
new ClusterConstruct(app, 'cluster', { env });
- Deploy the cluster:
cdk deploy cluster-stack
, then config kubectl
export KUBE_CONFIG=$(aws cloudformation describe-stacks --stack-name cluster-stack | jq -r '.Stacks[0].Outputs[] | select(.OutputKey|match("ConfigCommand"))| .OutputValue')
$KUBE_CONFIG
kubectl get svc
Onboard teams¶
We want two teams: platform and application teams.
mkdir teams && cd teams && mkdir platform-team && mkdir application-team
aws iam create-user --user-name platform
aws iam create-user --user-name application
Under platform-team
create a init.ts
, Add a IAM Principal to add users to the platform using their IAM credentials
import { ArnPrincipal } from "aws-cdk-lib/aws-iam";
import { PlatformTeam } from '@aws-quickstart/eks-blueprints';
export class TeamPlatform extends PlatformTeam {
constructor(accountID: string) {
super({
name: "platform",
users: [new ArnPrincipal(`arn:aws:iam::${accountID}:user/platform`)]
})
}
}
And do the same for application team. Then modify the cluster definition to add team instances: The cdk deploy cluster-stack
will create a new namespace for the team application.
import { TeamPlatform, TeamApplication } from '../teams';
...
.teams(new TeamPlatform(account), new TeamApplication('burnham',account))
A command like kubectl describe role -n team-burnham
gives information on the role and actions that member can do.
Using Kubernetes constructs such as namespaces, quotas, and network policies, we can prevent applications deployed in different namespaces from communicating with each other.
-
To see the application user access limitation, login to the console in incognito mode, use the account ID, application as user and be sure to have setup a password in IAM for the
application
user. Once logged assume the role of cluster-stack-teamburnhamAccessRole3.... Then go to the EKS console. We should see an error message that the Team Burnham user is NOT allowed to list deployments in all the namespaces. But selecting theteam-burnham
namespace we should see pods and other elements. -
The user platform with the role
cluster-stack-templatformAccessRole5...
can access the cluster as admin and see all namespaces.
Adding add-ons¶
See the list of supported add-ons. To add them use the addOn() function in the blueprint:
const blueprint = blueprints.EksBlueprint.builder()
.account(account)
.region(region)
.addOns(new blueprints.ClusterAutoScalerAddOn)
.teams(new TeamPlatform(account), new TeamApplication('burnham',account))
.build(scope, id+'-stack');
}
Autoscaling the cluster¶
-
Horizontal Pod Autoscaler (HPA) scales the pods in a deployment or replica set. It is implemented as a K8s API resource and a controller. The controller manager queries the resource utilization against the metrics specified in each HorizontalPodAutoscaler definition. It obtains the metrics from either the resource metrics API (for per-pod resource metrics), or the custom metrics API (for all other metrics).
# Example of setting HPA on a pod/deployment kubectl autoscale deployment php-apache `#The target average CPU utilization` \ --cpu-percent=50 \ --min=1 `#The lower limit for the number of pods that can be set by the autoscaler` \ --max=10 kubectl get hpa
-
Cluster Autoscaler (CAS) is a component that automatically adjusts the size of a Kubernetes Cluster so that all pods have a place to run and there are no unneeded nodes. Cluster Autoscaler typically runs as a Deployment in our cluster. It delegates to the Auto Scaling Group for EC2s to manage node groups. So be sure to configure the ASG with min, max and desired EC2 instance counts. With Fargate we do not need the cluster autoscaler. When creating a managed node group, we can choose either the On-Demand or Spot capacity type. The allocation strategy to provision Spot capacity is set to capacity-optimized to ensure that the Spot nodes are provisioned in the optimal Spot capacity pools.
-
Karpenter observes the aggregate resource requests of unscheduled pods and makes decisions to launch and terminate nodes to minimize scheduling latencies and infrastructure cost. It is installed to the EKS cluster using Helm, and configured with CRD. Karpenter scales up nodes in a group-less approach. Karpenter selects which nodes to scale , based on the number of pending pods and the Provisioner configuration. It selects how the best instances for the workload should look like, and then provisions those instances. Karpenter uses cordon and drain best practices to terminate nodes. The configuration of when a node is terminated can be controlled with
ttlSecondsAfterEmpty
.
See support note: How do I install Karpenter in my Amazon EKS cluster.
Deeper Dive¶
- Product documentation - Elastic Kubernetes Service
- EKS workshops.
- Web Application Hosts on EKS Workshop uses terraform to create the VPC, and EKS cluster + deploy a web app as an example.
- EKS Blueprints for CDK Workshop.
- Getting started with Amazon EKS – eksctl.
- EKS Best Practices Guides in github.
- EKS Blueprint bootstrap.
- Amazon EKS Blueprints for Terraform.
- EKS SaaS workshop.
- kOps open source project for running k8s cluster on any cloud provider.
- Cluster Autoscaler CAS FAQs.
- EKS improves control plane scaling and update speed by up to 4x.
- Eliminate Kubernetes node scaling lag with pod priority and over-provisioning.
- How H2O.ai optimized and secured their AI/ML infrastructure with Karpenter and Bottlerocket.