Veeam Software

Senior Software Engineer, Reliabilty

Veeam Software  •  Bengaluru, IN (Hybrid)  •  3 months ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

Veeam is the Data and AI Trust Company, specializing in helping organizations ensure their data and AI are fully understood, secured, and resilient to enable the acceleration of safe AI at scale. As the market leader in both data resilience and data security posture management, Veeam is built for the convergence of identity, data, security, and AI risk. Headquartered in Seattle with offices in more than 30 countries, Veeam protects over 550,000 customers worldwide, who trust Veeam to keep their businesses running. Join us as we go fearlessly forward together, growing, learning, and making a real impact for some of the world’s biggest brands.

We are looking for a Senior Software Engineer, Reliability, you will serve as a hands-on technical leader within the SRE team, guiding senior engineers, influencing product development teams, and ensuring the systems we operate are built to be reliable, scalable, and observable from the ground up.

You will drive strategic initiatives, mentor others in the practice of SRE, and help define architectural best practices across our platform. This role is pivotal in aligning teams, enforcing high standards, and scaling SRE principles globally within Veeam.

Yours tasks will include

Reliability Engineering & Resilience

  • Design and evolve infrastructure to be highly available, fault tolerant, and scalable across public clouds (initially Azure, with future expansion plans to other providers).

  • Establish and maintain SLIs, SLOs, and error budgets that define and enforce reliability objectives.

  • Lead incident response, analysis, blameless postmortems, and sharing sessions in order to maximize learning across our entire engineering team and driving changes to the entire socio-technical engineering system.

Observability & Operational Excellence

  • Drive adoption of deep observability practices, ensuring telemetry, logs, metrics, and tracing are comprehensive and actionable.

  • Develop automation and self-healing tools to reduce toil and support Veeam’s fleet management strategy.

  • Participate in on-call rotations and lead operational excellence across the stack.

Engineering at Scale

  • Contribute to infrastructure as code (IaC), CI/CD systems, deployment automation, and scalable config management.

  • Integrate and extend monitoring and chaos engineering tools to validate reliability assumptions under load and failure conditions.

  • Implement testing strategies, canary deployments, and release validation pipelines to protect production environments and allow teams to safely deliver new features as quickly as possible.

Collaboration & Culture

  • Embed within product and platform teams to champion reliability from design through delivery.

  • Contribute to a learning culture focused on continuous improvement and proactive risk management.

  • Mentor engineers and advocate for DevOps/SRE best practices across global teams.

What we expect from you:

  • 5+ years of hands-on experience in a Software Engineering role with at least 2 years in Site Reliability, Platform Engineering, or similar.
  • Deep experience building systems on public cloud providers (Azure preferred)

  • Strong programming skills in JS, Node, Typescript, Go, Java, C#, or similar.

  • Proven track record in delivering monitoring, alerting, and observability tooling (e.g., Prometheus, Grafana, OpenTelemetry).

  • Experience with IaC tools like Terraform/Pulumi, and container orchestration (e.g., Kubernetes).

  • Solid understanding of distributed systems, cloud networking, and cloud-native system design.

  • Excellent communication and collaboration skills across geographies and disciplines.

Will be an added advantage

  • Experience working on large-scale B2B SaaS platforms.

  • Background in chaos engineering, resilience testing, performance testing, load testing, or incident learning programs.

  • Familiarity with compliance frameworks (e.g., ISO, SOC 2, GDPR, FEDRAMP/ CMMC).

We offer:

  • 18 paid vacation days, plus 4 extra global VeeaMe Days for self-care and 24 paid volunteer hours annually through Veeam Cares
    Private medical coverage for you and up to four dependents
  • Life, accident, and disability insurance with enhanced coverage
  • Annual flexible wellbeing allowance for physical and mental wellness
  • Free confidential counselling and coaching via Employee Assistance Program (EAP), including legal and financial advice
  • Meal, fuel, and transportation benefits based on work arrangement
  • Daycare reimbursement and safe cab facility for eligible employees
  • Opportunities to learn and grow through on-demand libraries (LinkedIn Learning, O’Reilly), mentoring, workshops, and learning events like our annual Global Day of Learning

Please note: If the applicant is permanently located outside India, Veeam reserves the right to decline the application.


#LI-KP1
#Hybrid

Veeam Software is an equal opportunity employer and does not tolerate discrimination in any form on the basis of race, color, religion, gender, age, national origin, citizenship, disability, veteran status or any other classification protected by federal, state or local law. All your information will be kept confidential.

Please note that any personal data collected from you during the recruitment process will be processed in accordance with our Recruiting Privacy Notice

The Privacy Notice sets out the basis on which the personal data collected from you, or that you provide to us, will be processed by us in connection with our recruitment processes.

By applying for this position, you consent to the processing of your personal data in accordance with our Recruiting Privacy Notice

By submitting your application, you acknowledge that the information provided in your job application and any supporting documents is complete and accurate to the best of your knowledge. Any misrepresentation, omission, or falsification of information may result in disqualification from consideration for employment or, if discovered after employment begins, termination of employment.

Veeam Software

About Veeam Software

Welcome to Veeam’s LinkedIn page.

Follow us here for company news, product updates, events and more.

Veeam®, the #1 global market leader in data resilience, believes every business should be able to bounce forward after a disruption with the confidence and control of all their data whenever and wherever they need it. Veeam calls this radical resilience, and we’re obsessed with creating innovative ways to help our customers achieve it.

With Veeam, organizations achieve radical resilience through data security, data recovery, and data freedom for their hybrid cloud.

Veeam solutions are purpose-built for powering data resilience by providing data backup, data recovery, data freedom, data security, and data intelligence. With Veeam, IT and security leaders rest easy knowing that their apps and data are protected and always available across their cloud, virtual, physical, SaaS, and Kubernetes environments.

Headquartered in Seattle with offices in more than 30 countries, Veeam protects over 550,000 customers worldwide, including 67% of the Global 2000, that trust Veeam to keep their businesses running.

Radical resilience starts with Veeam.

Learn more at www.veeam.com or follow Veeam on X @veeam.

Industry
IT & Software
Company Size
5,001-10,000 employees
Headquarters
Seattle, WA
Year Founded
Unknown
Website
veeam.com
Social Media