Saviynt

Senior Site Reliability Engineer

Saviynt  •  Bengaluru, IN (Onsite)  •  1 day ago
Apply
AI can make mistakes so check important info. Chat history is never stored.
80
AI Success™

Job Description

Saviynt's AI-powered identity platform manages and governs human and non-human access to all of an organization's applications, data, and business processes. Customers trust Saviynt to safeguard their digital assets, drive operational efficiency, and reduce compliance costs. Built for the AI age, Saviynt is today helping organizations safely accelerate their deployment and usage of AI. Saviynt is recognized as the leader in identity security, with solutions that protect and empower the world’s leading brands, Fortune 500 companies and government institutions. For more information, please visit www.saviynt.com

We’re a fast-moving AI Security Company building AI-native infrastructure and applications powered by LLMs and autonomous agents. Our stack is deeply integrated with AWS, Kubernetes, and OpenAI-based systems, and we’re rethinking reliability in a world where software can reason, adapt, and self-heal.

We’re hiring a Senior SRE Engineer to own reliability across our cloud-native and AI-driven platform. You’ll work at the intersection of distributed systems, Kubernetes operations, and LLM-powered automation, building systems that don’t just scale—but think and fix themselves.

WHAT YOU BRING

    • 5+ years in SRE / DevOps / Platform Engineering.
    • Strong hands-on experience with:
      • AWS infrastructure at scale
      • Kubernetes (production-grade clusters)
      • Proven ability to debug complex distributed systems under pressure.
      • Strong coding skills (Python or Go)—you build internal platforms and tools.
      • Experience implementing monitoring, alerting, and incident management systems.
      • Bonus (AI / LLM Focus)

        • Experience working with LLM APIs such as the OpenAI API.
        • Familiarity with agent frameworks like:
          • LangChain
          • AutoGen
          • Built or experimented with:
            • AI agents for DevOps / SRE workflows
            • Retrieval-Augmented Generation (RAG) systems
            • Vector databases (Pinecone, Weaviate, etc.)
            • Exposure to AIOps or intelligent automation systems.

WHAT YOU WILL BE DOING

    • Own uptime, reliability, and performance of services running on AWS + Kubernetes (EKS).
    • Design and implement self-healing infrastructure using automation and AI agents.
    • Build LLM-powered operational tooling using APIs such as the OpenAI API for:
      • Intelligent alert triage
      • Incident summarization
      • Root cause analysis
      • Runbook automation
      • Manage and scale Kubernetes workloads:
        • Deployments, autoscaling, resource optimization
        • Cluster reliability and cost efficiency
        • Build and evolve observability systems:
          • Metrics (Prometheus), dashboards (Grafana)
          • Logs (ELK / OpenSearch)
          • Tracing (OpenTelemetry)
          • Define and enforce SLOs, SLAs, and error budgets tied to business metrics.
          • Automate infrastructure using Terraform and CI/CD pipelines.
          • Lead incident response, postmortems, and continuous reliability improvements.
          • Introduce chaos engineering practices to proactively test system resilience.

If required for this role, you will:- Complete security & privacy literacy and awareness training during onboarding and annually thereafter- Review (initially and annually thereafter), understand, and adhere to Information Security/Privacy Policies and Procedures such as (but not limited to):
> Data Classification, Retention & Handling Policy > Incident Response Policy/Procedures > Business Continuity/Disaster Recovery Policy/Procedures > Mobile Device Policy > Account Management Policy > Access Control Policy > Personnel Security Policy > Privacy Policy
Saviynt is an amazing place to work. We are a high-growth, Platform as a Service company focused on Identity Authority to power and protect the world at work. You will experience tremendous growth and learning opportunities through challenging yet rewarding work which directly impacts our customers, all within a welcoming and positive work environment. If you're resilient and enjoy working in a dynamic environment you belong with us!
Saviynt is an equal opportunity employer and we welcome everyone to our team. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status.

Saviynt

About Saviynt

At Saviynt, we are pioneers in intelligent identity security solutions, dedicated to empowering enterprises to safeguard their digital environments. We aim to transform IGA by delivering innovative, cloud-first solutions that ensure security, compliance, & risk management across diverse IT landscapes, including multi-cloud, hybrid, & on-premises environments.

Our Values

Innovation: We continuously enhance our solutions to meet the evolving needs of the modern enterprise.

Customer Focus: Our customers are at the heart of everything we do. We strive to provide exceptional service & solutions that deliver real value.

Accountability: We take responsibility for our actions & deliver on our promises, ensuring excellence in every aspect of our work.

Collaboration: We believe in the power of working together & fostering an inclusive environment where ideas & innovation can flourish.

Integrity: We operate with the highest standards of ethics & transparency, building trust with our customers, partners, & team members.

Our Mission

Saviynt’s mission is to provide intelligent, cloud-first identity governance & access management solutions that enable organizations to achieve Zero-Trust security. We aim to simplify the complexity of identity security by providing deep visibility & seamless integration across all IT environments.

Our Goals

Enhance Security: We help organizations protect their most critical assets from cyber threats by leveraging advanced identity governance & access management solutions.

Ensure Compliance: Our solutions meet stringent regulatory requirements, helping organizations maintain compliance effortlessly.

Drive Efficiency: We enable organizations to streamline their identity management processes through automation & intelligent analytics, reducing costs & improving productivity.

Foster Innovation: We are committed to staying at the forefront of technology, continually evolving our solutions to meet the demands of the digital age.

Industry
IT & Software
Company Size
1,001-5,000 employees
Headquarters
El Segundo, California
Year Founded
2010
Social Media