Senior Site Reliability Engineer (Kannur)
Senior Site Reliability Engineer (Kannur)
-
Kannur, India
-
Posted: a week ago
-
Save
Description
Role Overview We are looking for a skilled and proactive Site Reliability Engineer (SRE) to take end-to-end ownership of production reliability, observability, and performance engineering across MyOperator’s AI-powered communication infrastructure.
This role is not operational-only — it requires strong system design thinking, deep troubleshooting ability, and a production ownership mindset. You will define reliability standards, build observability frameworks, lead incident response, and drive SLO-based engineering practices across distributed AWS and Kubernetes environments.
About MyOperator MyOperator is a Business AI Operator platform that enables businesses, teams, and AI agents to work together seamlessly for customer operations such as Sales, Support, Escalations, Feedback, and Refund processes. With 12,000+ businesses using our platform, we operate at meaningful scale and power mission-critical communication workflows including voice bots, WhatsApp automation, and intelligent call routing. We are building for reliability, speed, and impact. MyOperator values ownership, critical thinking, and execution. This is a high-expectation, high-learning workplace where engineers are empowered to solve complex problems and build systems that directly affect customer outcomes.
Key Responsibilities
- Own production reliability, uptime, latency, and error budgets across critical services.
- Design and manage production-grade monitoring using Grafana, VictoriaMetrics (Prometheus), and AWS CloudWatch.
- Define and enforce SLIs, SLOs, and SLA thresholds for AI communication systems (voice bots, WhatsApp APIs, call routing).
- Build real-time operational dashboards for incident response, capacity planning, and leadership visibility.
- Implement end-to-end distributed tracing using OpenTelemetry (OTEL Collector).
- Design and maintain centralized logging with strong correlation between logs, metrics, and traces.
- Create SLO-based alerting systems with minimal noise and fa Apply on Kit Job: kitjob.in/job/4lb5s4
This role is not operational-only — it requires strong system design thinking, deep troubleshooting ability, and a production ownership mindset. You will define reliability standards, build observability frameworks, lead incident response, and drive SLO-based engineering practices across distributed AWS and Kubernetes environments.
About MyOperator MyOperator is a Business AI Operator platform that enables businesses, teams, and AI agents to work together seamlessly for customer operations such as Sales, Support, Escalations, Feedback, and Refund processes. With 12,000+ businesses using our platform, we operate at meaningful scale and power mission-critical communication workflows including voice bots, WhatsApp automation, and intelligent call routing. We are building for reliability, speed, and impact. MyOperator values ownership, critical thinking, and execution. This is a high-expectation, high-learning workplace where engineers are empowered to solve complex problems and build systems that directly affect customer outcomes.
Key Responsibilities
- Own production reliability, uptime, latency, and error budgets across critical services.
- Design and manage production-grade monitoring using Grafana, VictoriaMetrics (Prometheus), and AWS CloudWatch.
- Define and enforce SLIs, SLOs, and SLA thresholds for AI communication systems (voice bots, WhatsApp APIs, call routing).
- Build real-time operational dashboards for incident response, capacity planning, and leadership visibility.
- Implement end-to-end distributed tracing using OpenTelemetry (OTEL Collector).
- Design and maintain centralized logging with strong correlation between logs, metrics, and traces.
- Create SLO-based alerting systems with minimal noise and fa Apply on Kit Job: kitjob.in/job/4lb5s4
Highlights
-
Company nameMyOperator
-
Job positionSenior Site Reliability Engineer (Kannur)
Safety Tips
Be careful if you are offered a job on the spot.
More info about this ad
Senior Site Reliability Engineer (Kannur) has been posted in the Pāppinisseri Engineering category on Locanto.
For Pāppinisseri, there are no other ads posted in this category.
There are more ads within a 15 km radius for this category. If you want to view those ads, click here.