Atera

Senior SRE

Atera  •  Tel Aviv, IL (Hybrid)  •  3 months ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

About Atera

Atera is inventing a new way of managing IT end-to-end for IT professionals and teams worldwide.

By creating an AI-powered IT platform, Atera's all-in-one Remote Monitoring and Management (RMM) Helpdesk, Ticketing, and Reporting solution helps more than 23,000 IT pros achieve 10X operational efficiency, cut down time-to-resolution, and deliver better outcomes faster. Located in the heart of Tel Aviv, our team of passionate, like-minded individuals is driven by a shared mission to unleash everyone's potential and constantly innovate. We create an open, transparent, and supportive environment that gives our teams the autonomy, resources, and freedom to thrive.

This is a full-time and onsite (hybrid-remote) role at our Tel Aviv office.

Atera is looking for a motivated senior site reliability engineer to join us and build the framework for the engineering ops to scale.

Responsibilities:

  • Build tools and automation to monitor system health, performance, and reliability, ensuring quick detection and resolution of any anomalies or issues.
  • Write high-quality infrastructure-as-code that automates the provisioning, deployment, scaling, and effective monitoring, alerting, and logging solutions.
  • Work with other engineers to ensure that new services are well-designed, properly monitored, and have well-defined SLIs and achievable SLOs
  • Build and maintain observability pipelines using tools like Prometheus, Grafana, OpenTelemetry, and distributed tracing systems
  • Proactively track our capacity, quotas, and other performance limits to plan for growth.
  • Participate in a 24x7 on-call rotation to handle product availability issues as well as urgent customer support escalations.
  • Investigate and resolve incidents and outages, performing root cause analysis to identify systemic issues and implement preventive measures.
  • Develop and maintain disaster recovery plans and perform regular testing to ensure data integrity and business continuity.

Requirements

Requirements:

  • 3 + years of experience as an SRE in large-scale, cloud-based production environments
  • Strong experience in designing, implementing, and managing monitoring processes.
  • Familiarity with observability tools (Prometheus, Grafana, ELK, Datadog, OpenTelemetry)
  • Experience in at least one scripting language (Python, Ruby, Perl, Bash) and infrastructure as code technologies (e.g., Terraform, CloudFormation)
  • Strong abilities to lead, design, and execute cross-organization projects
  • Experience in managing container and infrastructure orchestration tools (e.g., Kubernetes, Terraform)
  • Hands-on experience administering public clouds
  • Background in high-scale, high-throughput telemetry or data ingestion systems - Advantage
  • Experience designing SLO frameworks from

Some about our benefits

Atera is highly collaborative and, yes, fun! To support you at work (and play), we offer some fantastic perks: ample time to learn from your teammates and contemporaries, time off to relax and recharge, community volunteer days, an annual budget to support your learning & growth, a company-paid trip, and lots more.

About Atera


None

Atera

About Atera

Atera is leading the future of IT with the world’s first patented Autonomous IT platform powered by a digital fleet of self-learning AI agents that transform how IT is done. By cutting up to 40% of routine IT workload, Atera empowers IT teams and MSPs to shift from reactive and routine automation to proactive and preemptive action, delivering 24/7 support with zero downtime.

At the core of Atera’s platform are two powerful AI agents; IT Autopilot is a personal IT tech for every employee that doesn’t just assist but decides, acts, and resolves issues instantly and autonomously. AI Copilot is an intelligent IT companion that supports technicians with real-time device diagnostics, smart recommendations, and instant, context-aware actions. Together, they create a tireless digital workforce that adapts to your environment, preempts disruptions, and frees IT teams to focus on what matters most.

Atera’s all-in-one IT management platform consolidates RMM, helpdesk, ticketing, and automation, empowering IT teams and MSPs to efficiently manage and protect infrastructure, automate tasks, and boost service quality by reducing downtime and improving SLAs.

Trusted by over 13K customers in 120+ countries, Atera offers a scalable solution enabling organizations to drive sustainable growth and maximize organizational efficiency.

Try Atera for free for 30 days at www.atera.com

Industry
IT & Software
Company Size
201-500 employees
Headquarters
Tel Aviv, IL
Year Founded
2011
Website
atera.com
Social Media