Job Description
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a System Reliability Engineer in India.
This role is centered on ensuring the stability, scalability, and performance of large-scale hybrid infrastructure environments spanning global data centers and cloud platforms. You will play a key part in maintaining and optimizing thousands of systems through monitoring, automation, and proactive incident management. The position combines hands-on systems administration with DevOps practices, focusing on reducing operational complexity through tooling, scripting, and infrastructure automation. You will work in a fast-paced, engineering-driven environment where reliability, uptime, and user experience are top priorities. The role involves close collaboration with global teams to troubleshoot complex issues, manage deployments, and enhance system efficiency. This is a high-impact position suited for engineers who thrive in operational excellence and continuous improvement environments.
Accountabilities
You will be responsible for maintaining, automating, and improving large-scale infrastructure systems while ensuring reliability, security, and performance across global environments.
- Install, configure, monitor, and maintain systems across hybrid cloud and on-premise data centers
- Manage patching, upgrades, and lifecycle operations for thousands of physical and virtual systems
- Develop automation scripts and tools using Python and Shell to reduce manual operational tasks
- Troubleshoot and resolve complex system, network, and performance issues across distributed environments
- Support CI/CD pipelines and version control workflows using tools such as Jenkins, GitLab CI, and Git
- Manage infrastructure provisioning and configuration using tools like Terraform, Ansible, and related IaC frameworks
- Handle Linux system administration tasks, primarily in Ubuntu environments
- Participate in on-call rotations and incident response activities, including off-hours support when required
- Manage user access, permissions, firewalls, and system security configurations
- Support bare metal infrastructure operations and deployment tooling (e.g., Foreman, Cobbler, MAAS)
Requirements
This role requires strong systems engineering experience, hands-on DevOps capabilities, and the ability to operate in high-availability, distributed infrastructure environments.
- 2–5 years of experience in Site Reliability Engineering, DevOps, or production operations roles
- Bachelor’s or Master’s degree in Computer Science or a related field preferred
- Strong proficiency in scripting languages such as Python and Shell
- Solid experience in Linux system administration, preferably Ubuntu-based environments
- Hands-on experience with CI/CD tools such as Jenkins or GitLab CI
- Familiarity with infrastructure-as-code tools like Terraform, Packer, or Ansible
- Experience working with cloud platforms and hybrid infrastructure environments
- Understanding of incident management and on-call operational processes
- Strong analytical and troubleshooting skills for complex distributed systems
- Bonus: experience with Kubernetes or Linux certifications
Benefits
- Competitive compensation aligned with experience and market standards
- Comprehensive health and wellness benefits supporting physical and mental well-being
- Opportunities for continuous learning, certifications, and professional development
- Exposure to large-scale global infrastructure and cutting-edge collaboration technologies
- Career growth opportunities in a fast-scaling, engineering-driven organization
- Supportive work environment focused on innovation, automation, and operational excellence
- Inclusive culture with strong emphasis on collaboration and employee well-being
How Jobgether works:
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
#LI-CL1