Senior Machine Learning Engineer

TyrAds

Bali, Kuta, Indonesia

5-7 Years

Save

Posted 18 hours ago
Be among the first 10 applicants

Early Applicant

Job Description

Job Description

Location: Bali

About Us

TyrAds is a leading tech-driven loyalty and rewards platform that partners with businesses to create meaningful and rewarding experiences for their customers. With millions of users across our platforms, we specialize in innovative AdTech solutions powered by big data, machine learning, and deep learning technologies.

Our systems process over 10,000 events per second, delivering real-time insights at scale. We build seamless user experiences across web, Android, and iOS, while leveraging ML/AI to power personalization and advertising effectiveness.

We are a team of 90+ employees worldwide and growing, guided by values of transparency, ownership, learning from mistakes, and respect for diverse cultures. Our product development follows Agile methodologies with weekly releases and collaboration tools such as GitHub, Jira, and Slack.

We're looking for a Senior ML Engineer to build real-time ML systems for our mobile advertising platform. You'll own the full lifecycle - feature engineering, model training, deployment, and production monitoring.

Job Requirements

5+ years Python, 3+ years ML Engineering
Strong MLFlow experience (Feature Store, MLflow, Unity Catalog)
AWS production experience (ECS, S3, CloudWatch)
Built real-time ML systems with latency requirements

Nice To Have

AdTech, fraud detection, or recommendation systems experience
Kafka, Redis, streaming pipelines
Grafana, Prometheus, Sentry

Tech Stack: Databricks, PySpark, PyTorch, LightGBM, FastAPI, AWS ECS, Redis, Kafka, MLflow, Grafana, Prometheus, Sentry

Job Responsibilities

What You'll Build:

ML Serving Infrastructure

Real-time, low-latency model serving
Deep learning (PyTorch) and tree-based models (LightGBM/XGBoost) behind FastAPI on AWS ECS

MLOps Pipelines

Model promotion workflows: local staging production
Automated validation gates before promotion (offline eval metrics, data quality checks)
Retraining pipelines via Databricks Workflows (scheduled + drift-triggered)
CI/CD for both code and model artifacts
Blue-green or canary deployments for safe rollouts

Monitoring & Observability

Model quality tracking: accuracy, precision, recall, AUC over time
Data drift and feature drift detection with automated alerts
Feature freshness monitoring (detecting feature store lag before it impacts predictions)
Inference latency: p50, p95, p99 via Grafana dashboards
Sentry for error tracking, CloudWatch + Prometheus for infra metrics
A/B testing infrastructure for model experiments in production

You Will