METRO AG

Site Reliability Engineer

METRO AG  •  Pune, IN (Onsite)  •  18 days ago
Apply
AI can make mistakes so check important info. Chat history is never stored.
39
AI Success™

Job Description

About us: 

Metro Global Solution Center (MGSC) is internal solution partner for METRO, a €29.8 Billion international wholesaler with operations in 32 countries through 625 stores & a team of 91,000 people globally. Metro operates in a further 10 countries with its Food Service Distribution (FSD) business and it is thus active in a total of 34 countries.

MGSC, location wise is present in Pune (India), Düsseldorf (Germany) and Szczecin (Poland). We provide HR, Finance, IT & Business operations support to 31 countries, speak 24+ languages and process over 18,000 transactions a day. We are setting tomorrow’s standards for customer focus, digital solutions, and sustainable business models. For over 10 years, we have been providing services and solutions from our two locations in Pune and Szczecin. This has allowed us to gain extensive experience in how we can best serve our internal customers with high quality and passion. We believe that we can add value, drive efficiency, and satisfy our customers.

Website: https://www.metro-gsc.in

Company Size: 600-650

Headquarters: Pune, Maharashtra, India

Type: Privately Held

Inception: 2011



We are seeking a Senior Site Reliability Engineer with strong experience in building and

maintaining scalable, resilient systems. The ideal candidate will have hands-on expertise in

cloud-native technologies, infrastructure as code, observability, and automation, with a

focus on Google Cloud Platform (GCP).

Key Responsibilities

  • Ensure the stability and reliability of cloud-native applications deployed on GCP, containerized with Docker and orchestrated via Kubernetes.
  • Define, implement, and monitor SLOs, SLAs, and SLIs to measure system performance and user experience.
  • Automate infrastructure provisioning using Terraform and manage Kubernetes configurations with Kustomize and Helm.
  • Develop and maintain monitoring and alerting systems using Datadog and GCP-native tools.
  • Conduct incident analysis and postmortems to drive continuous improvement.
  • Collaborate with development teams to integrate reliability practices into CI/CD pipelines using GitHub Actions.
  • Manage and troubleshoot database systems, particularly PostgreSQL and Cassandra.
  • Apply networking knowledge and Linux system administration skills to troubleshoot and optimize system connectivity and performance.

Qualifications

Must-Have Qualifications

Education

  • Bachelor’s or Master’s degree in Computer Science, Software Engineering, or equivalent practical experience.

Work Experience & Skills

  • 6+ years of experience in Site Reliability Engineering.
  • Proven experience designing and operating elastic, resilient systems in cloud
  • environments.
  • Strong understanding of GCP, Kubernetes, and container orchestration.
  • Proficiency in infrastructure as code and configuration management tools (Terraform,
  • Helm, Kustomize).
  • Experience with monitoring and observability tools (Datadog, GCP Monitoring).
  • Solid scripting skills in bash and familiarity with automation frameworks.
  • Experience with CI/CD pipelines, especially using GitHub Actions.
  • Familiarity with networking fundamentals and troubleshooting.
  • Strong coding skills and ability to develop reliability-focused tooling.
  • Excellent communication skills in English (written and spoken)

Other Requirements

  • Strong problem-solving skills and a process-oriented mindset.
  • Ability to work independently and collaboratively in a fast-paced environment.
  • Passion for clean code, automation, and continuous improvement.

Nice-to-Have

  • Familiarity with monitoring tools (e.g., DataDog, Prometheus, GCP Monitoring).
  • Experience working in Agile/Scrum teams.
METRO AG

About METRO AG

METRO is a leading international food wholesaler which specialises in serving the needs of hotels, restaurants, and caterers (HoReCa) as well as independent merchants (Traders). Around the world, METRO has approx. 15 million customers who benefit from the wholesale company’s unique multichannel mix: customers can purchase their goods in one of the large stores in their area as well as by delivery (Food Service Distribution, FSD) – all digitally supported and connected. In parallel, METRO MARKETS is being developed as an international online marketplace for the needs of professional customers which has been growing and expanding continuously since 2019. Acting sustainably is one of the company principles of METRO which has been listed in various sustainability indices and rankings, including MSCI, Sustainalytics and CDP. METRO operates in more than 30 countries and employs over 85,000 people worldwide. In financial year 2023/24, METRO generated sales of €31 billion. More information can be found at MPULSE.de, our online magazine.

Data protection notice:

https://www.metroag.de/en/data-privacy/social-media

Imprint:

https://www.metroag.de/en/imprint

Industry
Wholesale & Distribution
Company Size
1,001-5,000 employees
Headquarters
Düsseldorf, DE
Year Founded
1964
Social Media