Using taints and tolerations in EKS to run machine learning pods on GPU nodes

One of our clients uses EKS and they had a problem: running machine learning pods on a special Kubernetes node group that has GPU nodes. We solved this problem using Kubernetes taints, tolerations, labels, and node selectors. This is how to configure it: Choose an Amazon EKS-optimized Arm Amazon Linux AMI and install […]