Search by job, company or skills

Glints

Site Reliability Engineer

4-6 Years
Save
  • Posted 17 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

We're looking for a Site Reliability Engineer (SRE) to help build and maintain a highly reliable, scalable, and secure production environment. You'll work closely with engineering teams to improve system availability, automate operations, and respond to production incidents.

What You'll Do

  • Monitor and maintain production systems to ensure high availability and performance.
  • Respond to incidents, troubleshoot production issues, and drive root cause analysis.
  • Improve system reliability through automation, monitoring, and observability.
  • Design and implement deployment, rollback, and disaster recovery strategies.
  • Build and maintain monitoring, alerting, and health check solutions.
  • Collaborate with Software Engineers, Platform Engineers, AI Engineers, and Security teams to improve platform reliability.
  • Develop operational runbooks and continuously improve production processes.

What We're Looking For

  • 4+ years of experience in Site Reliability Engineering, DevOps, Platform Engineering, or Production Engineering.
  • Strong knowledge of Linux, networking, and container technologies (Docker or Podman).
  • Experience with monitoring and observability tools such as Prometheus, Grafana, Alertmanager, or OpenTelemetry.
  • Experience with relational databases, distributed systems, and production troubleshooting.
  • Familiarity with Python, CI/CD pipelines, infrastructure automation, and configuration management tools such as Ansible.
  • Experience participating in on-call rotations and handling production incidents.

Nice to Have

  • Experience with HAProxy or Nginx.
  • Knowledge of PostgreSQL and systemd.
  • Experience working in cloud or containerized production environments.

Linux • Python • Docker/Podman • PostgreSQL • Prometheus • Grafana • Alertmanager • OpenTelemetry • Nginx • HAProxy • Git • CI/CD • Ansible

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 150599785

Similar Jobs

Indonesia

Skills:

KibanaPostgreSQLPrometheusGrafanaRedisUbuntuJenkinsGcpLinuxDockerTerraformMySQLElasticsearchCentosMongoDBKubernetesAWSGKEGithub actionEKSElastic

Indonesia

Skills:

NginxGcpDockerTerraformMySQLPostgreSQLPulumiKubernetesRedisAWSOpenTelemetry

Indonesia

Skills:

KibanaPostgreSQLPrometheusKafkaBashGrafanaRedisRabbitmqGitLinux OsDockerTerraformMySQLElasticsearchMongoDBPythonAWSKarpenterK8SGo

Indonesia

Skills:

PrometheusBashGrafanaGroovyJenkinsTerraformDockerAnsibleKubernetesPythonGitOpsArgoCD

Indonesia

Skills:

Docker SwarmVpcIp SubnettingWindowsBash ScriptingFirewallGcpVpnDockerTerraformLinuxAnsibleKubernetesAWSnetworking fundamentalsAlibaba Cloud