Production Kubernetes experience (deployments, scaling, upgrades, debugging, networking).
Experience managing cluster autoscaling/capacity, ideally with Karpenter.
CI/CD expertise using Jenkins and GitHub Actions (pipeline/workflow as code, reusable templates).
Experience running ArgoCD / GitOps for continuous delivery.
Production experience with service mesh (mTLS, traffic management, routing, troubleshooting), ideally with Istio.
Strong Infrastructure-as-Code experience using Terraform ecosystem (Terraform + Terragrunt + Atlantis) for multi-environment provisioning, orchestration, and PR-based plan/apply workflows.
Observability experience with Datadog, Grafana/Prometheus, and ELK (Elasticsearch/Kibana).
Strong Linux + networking fundamentals (VPC, LB, DNS, TLS, TCP/IP).
Experience running production data and messaging systems, such as: RDS PostgreSQL, MongoDB, RabbitMQ, and MQTT (monitoring, backup/recovery, performance, troubleshooting).
Strong experience with GitHub-based workflows (branching, PR reviews, release/versioning practices).
Familiarity with Sonar/SonarQube and Dependabot (or equivalents).
NICE-TO-HAVE
Azure experience (App Service/Container App).
Experience in mobile app (Android/IOS) CI/CD.
Experience with HashiCorp Vault for secrets management (policies, auth methods, rotation).
Experience with Kubernetes packaging/releases using Helm charts (authoring, templating, versioning).
Multi-cluster Kubernetes or cross-region production experience.