Akamai Technologies

Manager Site Reliability Engineering

Akamai Technologies  •  Republic of India (Onsite)  •  1 day ago
Apply
AI can make mistakes so check important info. Chat history is never stored.
59
AI Success™

Job Description

Are you intrigued by planetary scale, distributed, intelligent systems?

Do you like collaborating across teams to solve complex problems?

Join our SRE team!

Our Site Reliability Engineering team collaborates across our organization to ensure the performance and reliability of our cloud platform and services. We establish Key Performance Indicators, define Service Level Objectives, and monitor compliance to them. Partnering with multiple teams, we address difficult problems that go beyond having simple answers.

Partner with the best

As Site Reliability Engineering Manager, you will be a hands-on leader contributing and managing a team of APJ-based site reliability engineers. Your team will collaborate across the organization to define, maintain, and improve the Compute products operation and customers supportability. You will face some of the most complex challenges in distributed systems at scale.

As a Site Reliability Engineering Manager, you will be responsible for:

  • Leading a team of site reliability engineers in developing and executing product operation and customer supportability plans
  • Aggregating the customer signal to drive product improvements
  • Managing on-call duty for the APJ based team as part of the global REAct team
  • Shaping cross-team communication and collaboration to improve product operation and customer supportability
  • Identifying opportunities to automate operational processes and reduce toil

Do what you love

To be successful in this role you will:

  • Have 8 years of relevant experience and a Bachelor's degree in Engineering, Computer Science, or related discipline, or its equivalent
  • Demonstrate a good track record of managing high-performance teams, delivering solutions, resolving issues, and developing engineers
  • Have experience in operations, analysis, and troubleshooting of large distributed systems, and with corresponding monitoring/logging/performance analytics tools
  • Have experience with tools like SaltStack, Ansible, Chef, or Puppet for managing infrastructure at scale
  • Be familiar with software development practices and product life cycles from concept to deployment and operations.

About us

At Akamai, we make life better for billions of people, trillions of times a day.

Whether you're streaming live events, scrolling social media, watching your favorite series, or managing your savings, we're the engine behind the scenes. We provide the world's most distributed platform from Cloud to Edge to help the giants of the digital world work faster and stay more secure, making the internet a better experience for everyone.

Our focus is simple:
Cloud and Edge: Running apps closer to users for instant performance.
Security Neutralizing threats before they ever reach your data.
Content Delivery Scaling the world's biggest moments without a glitch.
AI Enabling our customers to build, secure, and scale AI apps on the world's most distributed cloud platform.

At Akamai, we don't just support the internet; we power and protect it, because behind every great digital experience is a massive hidden challenge. And we're the ones who solve it. When millions of people hit play or pay, Akamai ensures it just works.

Benefits at Akamai: We support your health, well-being, finances, and life beyond work. See our benefits.

FlexBase adapts to your job's needs

Akamai's FlexBase program is yet another way we show our commitment to providing employees with an exceptional workplace experience. It's not about telling employees where to work; it's about supporting employees to do their best work.

We trust our incredible employees to work in ways that suit them best: at home, in an office, or a combination of both.


Connect with us on social and see what life at Akamai is like!

Akamai Technologies

About Akamai Technologies

At Akamai, we make life better for billions of people, billions of times a day.

Every day, billions of people around the world connect with their favorite brands to shop online, play the latest video games, log into mobile banking apps, learn remotely, share videos with friends, and so much more. They may not know it, but Akamai is there, powering and protecting life online.

Over 20 years ago, we set out to solve the toughest challenge of the early internet: the “World Wide Wait.” And we’ve been solving the internet’s toughest challenges ever since, working toward our vision of a safer and more connected world.

With the world’s most distributed compute platform — from cloud to edge — we make it easy for businesses to develop and run applications, while we keep experiences closer to users and threats farther away. That’s why innovative companies worldwide choose Akamai to build, deliver, and secure their digital experiences.

Our leading security, compute, and delivery solutions are helping global companies make life better for billions of people, billions of times a day.

Devoted, determined problem-solvers who share a passion for technology, we’re always pushing ground-breaking ideas and driving innovation.

Want to power and protect life online, by solving the toughest challenges?

Be part of an amazing team.

Let’s connect:

LinkedIn: https://www.linkedin.com/company/akamai-technologies

Twitter: https://twitter.com/Akamai

Blog: https://www.akamai.com/blog

Industry
IT & Software
Company Size
10,000+ employees
Headquarters
Cambridge, MA
Year Founded
Unknown
Social Media