Snapp!

Infrastructure Observability Engineer

Snapp!  •  Tehran, IR (Onsite)  •  14 days ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

Our Journey So Far

At Snapp, we’re redefining how cities move. Our ride-hailing and mobility platform connects millions of riders and drivers every day, delivering safe, reliable, and efficient transport solutions. Powered by real-time data and robust infrastructure, we make urban travel faster, simpler, and more sustainable.

We operate with the mindset of a global tech leader and the agility of a startup, building services that scale across markets while staying responsive to local needs.

Your Impact

As an Infrastructure Observability Engineer within the Platform team, you will work across observability platforms, infrastructure monitoring, and DevOps automation to ensure comprehensive visibility and high system reliability. You will maintain and enhance monitoring and logging stacks, analyze infrastructure events, and drive proactive improvements that strengthen performance and resilience. This highly technical role emphasizes automation and continuous optimization rather than reactive support.

What You’ll Drive Forward

  • Build, operate, and optimize monitoring and logging systems (Prometheus, Grafana, ELK, Zabbix, etc.)

  • Ensure full observability coverage for infrastructure, networks, and services.

  • Maintain alerting rules, dashboards, SLO/SLA metrics, and anomaly detection.

  • Analyze logs and metrics to identify patterns and potential risks.

  • Monitor infrastructure health across compute, storage, virtualization, and network layers.

  • Perform root cause analysis of network-related incidents (Routing/Switching, load balancing, DNS, firewalls)

  • Collaborate with network and datacenter teams on incident follow-ups.

  • Maintain knowledge of network topologies, protocols, and traffic flows.

  • Support improvement of infrastructure reliability and performance.

  • Work with CI/CD pipelines to ensure reliable delivery and deployment processes.

  • Develop automation for observability, monitoring, and operational workflows.

  • Maintain Linux-based systems and automate routine infrastructure tasks.

  • Contribute to reliability engineering initiatives (IaC, Docker, GitOps, auto-remediation, etc.)

What Powers Your Drive

  • At least 2+ years of experience in NOC/IOC, SRE, infrastructure operations, DevOps, or a similar technical role.

  • Strong hands-on experience with monitoring & logging stacks (Prometheus, Grafana, ELK, Zabbix, etc.).

  • Solid understanding of networking fundamentals (CCNA Routing, Switching, VLANs, BGP, OSPF, load balancing)

  • Strong Linux administration background.

  • Familiarity with CI/CD tools (GitLab CI, ArgoCD, Jenkins, GitHub Actions, etc.)

  • Hands-on experience with containerization (Docker) and service mesh tools

  • Practical knowledge of automation using Bash, Python, or similar scripting languages.

  • Ability to read and interpret logs, metrics, traces, and alerts.

  • Strong communication and documentation skills, especially in technical reporting.

Preferred Qualifications (optional)

  • Experience designing observability architecture for large-scale infrastructure.

  • Contribute to reliability engineering initiatives (Terraform, Ansible, Docker, GitOps, auto-remediation, etc.)

  • Knowledge of ITIL Incident/Problem Management practices.

  • Experience with cloud infrastructure or private cloud platforms.

  • Experience with Kubernetes (cluster operation, troubleshooting, manifests, Helm, etc.)

Ready to Get on Board?

Help us shape the future of ride-hailing and urban mobility. Submit your CV and let’s build smarter cities together.

Snapp!

About Snapp!

Your all-in-one solution for transportation, delivery services and more in Iran; Snapp super app is more than just a number of services; it's a commitment to transforming the way you travel, explore, and connect. At Snapp, we believe in providing reliable, convenient, and affordable options that connect you with the places and services you need, right at your fingertips.

Established in 2014, Snapp has set the record of more than 4 million rides a day and is the largest and fastest-growing internet company in the Middle East. Snapp super app is currently offering 20 different services to over 60 million users in 287 cities. Our commitment to innovation and customer satisfaction has driven us to create an ecosystem that seamlessly integrates various services to meet the diverse needs of our users.

With Snapp Super app, you have access to a range of services tailored to enhance your daily life. Whether you're looking for a comfortable ride to your destination, a quick delivery of food or grocery or even planning a trip to another country, we've got you covered.

At Snapp, we value your time and understand the importance of reliability. That's why we strive to provide prompt and efficient services, aiming to exceed your expectations every time. We continuously invest in cutting-edge technology and constantly improve our systems to deliver a seamless experience that caters to your needs.

We are proud to be a part of Iran's tech ecosystem, contributing to the growth and development of our nation's infrastructure.

Willing to join us on this journey? We always welcome the talents of all backgrounds to our international team.

Industry
IT & Software
Company Size
1,001-5,000 employees
Headquarters
, IR
Year Founded
2014
Website
snapp.ir
Social Media