CloudRaft

Site Reliability Engineer(SRE)

CloudRaft  •  Bengaluru, IN (Onsite)  •  1 month ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

Immediate joiners or candidates who can join within 10 days only to apply.If you have already applied to CloudRaft in the last 90 days, we already have your CV/resume on file. Multiple applications from the same candidate will not be considered.
About CloudRaftCloudRaft is a dynamic company specializing in advanced AI and cloud-native solutions. We foster creativity, collaboration, and innovation, enabling our team to address complex challenges and deliver exceptional results. Join us to contribute to an organization that prioritizes professional growth, operational excellence, and technological advancement.
We seek an experienced Site Reliability Engineer (SRE) to join our team. In this role, you will scale our operations, design and maintain resilient infrastructure and apply best practices for reliability and efficiency within our cloud-native environment.
Responsibilities
  • Manage and maintain Kubernetes clusters across cloud platforms, including OpenShift, Amazon EKS, Azure AKS, and Google GKE.
  • Implement and manage CI/CD pipelines using tools such as Jenkins, GitHub Actions, Argo CD, or GitLab CI/CD.
  • Design and maintain observability stacks with tools including Prometheus, Grafana, Loki, OpenTelemetry, and related technologies.
  • Optimize system performance and resolve production issues.
  • Implement SRE principles, including Service Level Indicators (SLIs) and Service Level Objectives (SLOs), to uphold system reliability.
  • Automate infrastructure and operational tasks using programming languages such as Go or Python, and Infrastructure as Code (IaC) tools like Terraform.
  • Apply AI skills like Vibe Coding for engineering tasks, AIOps and automation, understanding of Large Language Models (LLMs) and AI Agents, and proficiency in Prompt Engineering.
  • Remain current with emerging technologies, including AI, MLOps, and Edge Computing.
  • Contribute to knowledge sharing through technical writing and presentations.

Qualifications
  • Bachelor’s degree in Computer Science, Information Technology, or a related field.
  • 2-5 years of experience in SRE, Platform Engineering, or DevOps roles.
  • Strong expertise in Kubernetes, cloud-native technologies, and major cloud platforms (AWS, Azure, GCP).
  • Proficiency in programming languages such as Python or Go or Node.js.
  • Familiarity with CI/CD tools and contemporary deployment practices.
  • Knowledge of observability tools and Infrastructure as Code.
  • AI skills, including experience with Vibe Coding, AIOps and automation, understanding of LLMs and AI Agents, and Prompt Engineering.
  • CKA Certified (Brownie points!)
  • Excellent problem-solving abilities and communication skills.
  • Inclination toward open-source contributions is advantageous.

Benefits : - Competitive salary- Premium health insurance and various health & wellness benefits- Opportunity to work on cutting-edge technologies- Collaborative and supportive work environment- Chance to make a real impact on the company's success
CloudRaft

About CloudRaft

CloudRaft is a trusted problem solver for startups and Fortune 500 companies. Our team crafts cutting-edge AI Cloud, GPU Cloud, and cloud native solutions. We specialize in DevOps & cloud consulting, observability, and enterprise-grade support for open source technologies like PostgreSQL and Clickhouse. With our expertise, businesses can confidently navigate their digital transformation journey.

Our Specialization:

- AI Cloud, GPU Cloud, AI Infrastructure, Enterprise AI, and Generative AI: Empowering businesses with advanced AI capabilities that enhance decision-making and operational efficiency.

- Cloud Native Solutions, Kubernetes Consulting: Crafting scalable, resilient cloud environments that adapt to your business needs.

- DevOps & Cloud Consulting: Streamlining development and operations through best practices in DevOps and cloud strategies.

- DevSecOps & Security: Ensuring robust security measures are integrated seamlessly into every stage of development.

- Observability: Providing deep insights into system performance to ensure optimal functionality and quick resolution of issues.

- Enterprise-grade Support for Open Source Technologies: Offering expert support for tools like Thanos, Prometheus, ArgoCD, PostgreSQL, and Clickhouse, ensuring your open-source projects thrive.

We are committed to helping businesses navigate their digital transformation journey with confidence.

Visit us at www.cloudraft.io

Industry
IT & Software
Company Size
11-50 employees
Headquarters
Indore, IN
Year Founded
2022
Social Media