Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work, offering you the opportunity to make an impact globally while working across a global team located across 5 continents. Razer is also a great place to work, providing you the unique, gamer-centric #LifeAtRazer experience that will put you in an accelerated growth, both personally and professionally.
We are seeking a skilled and driven Senior Site Reliability Engineer (SRE) to join our growing infrastructure and platform engineering team. The ideal candidate will have hands-on experience in Amazon Web Services (AWS), strong troubleshooting capabilities, and a passion for building scalable, observable, and resilient systems using modern Infrastructure as Code (IaC) and automation tools.
REQUIREMENTS:
Bachelor’s degree in Computer Science, Software Engineering, Information Technology, or a related field.
Minimum 3 years of experience in SRE, DevOps, cloud infrastructure, or system administration roles.
Hands-on expertise with AWS Cloud Services, including:
Compute & Containerization: EC2, Lambda, ECS, EKS, Auto Scaling
Networking: Load Balancers, VPC, Route 53, Security Groups, Firewalls
Storage & Databases: RDS, ElastiCache, Athena, S3
Messaging: SQS, SES
Deep understanding of Infrastructure as Code (IaC) tools such as Terraform and CloudFormation.
Proficiency in at least one programming/scripting language: Python, Node.js, Bash, Ruby, or related.
Experience operating and troubleshooting across Linux, Windows, and container-based environments.
Strong understanding of distributed systems, cloud networking (routers, switches), firewalls, DNS, and HTTP/TLS.
Experience implementing monitoring and alerting systems and working with incident management processes.
Experience with Zero Downtime Deployments, blue/green or canary deployments.
Familiarity with cost optimization and right-sizing AWS resources.
Exposure to multi-region, multi-account AWS architecture.
Understanding of API gateway, or edge networking (e.g., Akamai, CloudFront).
Design, develop, and maintain Infrastructure as Code (IaC) using tools like Terraform or AWS CloudFormation, leveraging AI coding assistants to accelerate development and enforce best practices.
Implement and operate reliable, scalable cloud infrastructure primarily on AWS (e.g., EC2, ECS, RDS, S3, Lambda, ElastiCache, SQS, SES, Auto Scaling, Load Balancers)
Lead and participate in architecture reviews focusing on reliability, scalability, security, performance, and the cost-efficiency of infrastructure.
Develop and manage robust monitoring, alerting, and logging solutions (e.g., CloudWatch, Prometheus, Grafana, ELK), incorporating AIOps tools for predictive alerting, anomaly detection, and reducing alert fatigue.
Perform incident management, postmortems, root cause analysis, and implement continuous improvement strategies, utilizing AI-driven analytics to rapidly summarize logs and traces during outages.
Collaborate with software engineering teams to improve CI/CD pipelines, deployment automation, release management, and the deployment lifecycles of machine learning models.
Automate infrastructure operations, reduce manual toil, and improve reliability using scripting (Python, Bash, Node.js, or Ruby) and AI-powered workflow automation.
Maintain and troubleshoot environments involving web servers, databases, firewalls, DNS, load balancers, networking.
Ensure systems are compliant with security standards, including patching, hardening, secure access policies, and data privacy constraints specific to AI training data.
Provide on-call support, participate in incident rotations.
Monitor and maintain service-level objectives (SLOs), SLAs, and error budgets to ensure reliability targets are met.
Provide support and solution handling to incidents and tickets assigned.
Razer is proud to be an Equal Opportunity Employer. We believe that diverse teams drive better ideas, better products, and a stronger culture. We are committed to providing an inclusive, respectful, and fair workplace for every employee across all the countries we operate in. We do not discriminate on the basis of race, ethnicity, colour, nationality, ancestry, religion, age, sex, sexual orientation, gender identity or expression, disability, marital status, or any other characteristic protected under local laws. Where needed, we provide reasonable accommodations - including for disability or religious practices - to ensure every team member can perform and contribute at their best.
Are you game?

Razer™ is the world’s leading lifestyle brand for gamers.
The triple-headed snake trademark of Razer is one of the most recognized logos in the global gaming and esports communities.
With a fan base that spans every continent, the company has designed and built the world’s largest gamer-focused ecosystem of hardware, software and services.
Razer’s award-winning hardware includes high-performance gaming peripherals and Blade gaming laptops. Razer’s software platform, with over 70 million users, includes Razer Synapse (an Internet of Things platform), Razer Chroma™ (a proprietary RGB lighting technology system), and Razer Cortex (a game optimizer and launcher).
In services, Razer Gold is one of the world’s largest virtual credit services for gamers, and Razer Fintech is one of the largest online-to-offline digital payment networks in SE Asia.
Founded in 2005 and dual-headquartered in Irvine and Singapore, Razer has 18 offices worldwide and is recognized as the leading brand for gamers in the USA, Europe and China.