EKS News 031

  • Understanding and Cost Optimizing Amazon EKS Control Plane Logs
    • After describing the different information captured in the control plane logs, this blog proposes several ways you can optimize the cost of retaining and analyzing those logs.
      • Disable or filter control plane logging in non-production environments.
      • CloudWatch Logs retains logs indefinitely by default. To lower your costs, impose a retention policy.
      • Export your logs to S3 for long term archival purposes and/or use Anthena to analyze logs in S3
      • Enable GuardDuty KAL
  • Introducing vended logs for Amazon Managed Service for Prometheus
    • This blog demonstrate how you can troubleshoot your alert manager pipeline via vended logs to correct common misconfigurations, such as not having the correct permissions, having an invalid alert manager template, and rule evaluation failures.
    • Vended logs are basically Prometheus logs that are routed to CloudWatch Logs.
  • How to Apply GitOps to Everything Using Amazon Elastic Kubernetes Service (Amazon EKS), Crossplane, and Flux
    • Walks through how to use a GitOps approach to vend workload clusters and then deploy composite applications, e.g. an application that interfaces with an RDS database, onto those clusters using Crossplane and Flux.
  • Scaling AI and Machine Learning Workloads with Ray on AWS
    • Describes the AWS contributions to the Ray community to enable enterprise-scale AI and machine learning deployments with Ray on AWS. These contributions and AWS service integrations allow AWS customers to scale their Ray-based workloads utilizing secure, cost-efficient, and enterprise-ready AWS services across the complete end-to-end AI and machine learning pipeline with both CPUs and GPUs.
    • EKS supports Ray on Kubernetes through the KubeRay EKS Blueprint, contributed by the Amazon EKS team, that quickly deploys a scalable and observable Ray cluster on your Amazon EKS cluster. As compute-demand increases or decreases, Ray works with the Kubernetes-native autoscaler (CAS) to resize the Amazon EKS cluster as needed.