F5

Site Reliability Engineer – UDF

F5  •  $138k - $206k/yr  •  United States (Hybrid)  •  30 days ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

At F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation.

Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive.

This role will be a new member of our UnifiedDemo Framework(UDF)platformteamsupporting thelaunch and managementoftheF5 Guardrails andRedteamproduct linesinto UDFTherole will focus on designing, deploying, and supportingKubernetes environmentsthat support a wide variety of usecases across many F5 teams As a technical expert, the SRE will work closely with cross-functional teams to instantiate AI features,optimizesystem performance, and ensure reliability in production environments.

The ideal candidate will have deepexpertiseinKubernetesorchestration, containerized architectures,andbuilds and runs systems with an operational excellence mindset This individual will play a critical role in advancing the operational maturity and scalabilityof the UDF platform andensureour abilityto incorporate new F5 product lines and features

Key Responsibilities

Kubernetes Orchestration and Management

  • Design, deploy, and manage Kubernetes clusters and ensure efficient container orchestration to support AI workloads.

  • Implement andmaintainKubernetes-based deployment pipelines

  • Optimizeresource allocation within Kubernetesclusters,while reducing costs and maximizing performance.

  • Develop andmaintainhigh-availability and fault-tolerant Kubernetes architectures to ensure service continuity

Observability and Monitoring

  • Design and implement observability pipelines for real-time monitoring ofKubernetes clusters, including metrics collection forscaling, resourceutilization, and system health.

  • Leverage tools such asCloudwatch,DataDog,Grafana, or similar platforms to ensure visibility into Kubernetes-managedworkloads

  • Establish logging, tracing, and alerting strategies to enable proactive identification and resolution of performance or reliability issues.

Automation and Scalability

  • Automate infrastructure management tasks to support the efficient deployment and operation of AI functionalities, including upgrades, scaling, and provisioning.

  • Support Infrastructure-as-Code (IaC) methodologies for the provisioning and configuration of environments,leveragingtools such as Terraform or Helm.

  • Contribute to the development of CI/CD workflows tailoredfor automatic scaling and effective change management practices

Collaboration and Process Improvement

  • Collaborate withproduct teams and sales engineering to integrate F5 products into the UDF platform and ensure effectiveutilizationbythe sales organization

  • Support root cause analysis (RCA) processes for issues affectingthe UDF platform, driving long-term corrective actions to improve system reliability.

  • Provide technicalexpertiseto designoperational workflows and procedures that improve the agility and stability ofthe UDF platform

Required Qualifications

  • EducationBachelor’s degree in Computer Science, Software Engineering, or a related technical field (or equivalent experience).

  • Experience

  • 4+ years of experience in Site Reliability Engineering (SRE), DevOps, or similar roles with a focus oncontainer management and AWS usage

  • Strongexpertisein managing Kubernetes clusters and containerized workloads in production environments.

  • Hands-on experience deploying and managingKubernetes environments in AWS, especially using EKS, as well asin self-hosted ecosystems such ason-premisedatacenters.

  • Proficient in monitoring and observability tools, includingCloudWatch, Grafana,Fluentd,DataDog, or equivalent platforms.

  • Expertisewith Infrastructure-as-Code (IaC) tools such as Terraform,Helm, orCloudFormation, and CI/CD frameworks.

  • Solid understanding of networking, storage, andcomputeinfrastructure within containerized environments.

  • Proficiencyin coding and scripting languages, including Python, Go, or Bash, withfocuson automation and system integration.

  • Expertisein applying security best practices to Kubernetes environments, including data protection and resource access controls.

  • Familiarity with GPU-based workloads in Kubernetes environments and optimization strategies for AIbased workloads

  • Experience with orchestrating, troubleshooting,best practices,andoptimizing complex network environments in AWSand GCP VPCs.

  • Experience working with hypervisors in GCP VPCs

Preferred Qualifications

  • Certifications

  • Certified Kubernetes Administrator (CKA) or Certified Kubernetes Application Developer (CKAD).

  • Relevant cloud certifications, such as AWS Certified Solutions Architect orGCP Cloud Architectcertifications.

  • Familiarity with advanced Kubernetes tools and techniques such as service mesh technologies (Istio,Linkerd) or Kubernetes operators for machine learning workflows.

  • Knowledge of distributed computing concepts and experience supporting large-scale AI workloads.

  • Practical experience integrating observability and monitoring into pipelines for inference engines and machine learning models.

#LI-Hybrid #LI-EM1

The Job Description is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change.

The annual base pay for this position is: $137,600.00 - $206,400.00

F5 maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, geographic locations, and market conditions, as well as to reflect F5’s differing products, industries, and lines of business. The pay range referenced is as of the time of the job posting and is subject to change.

You may also be offered incentive compensation, bonus, restricted stock units, and benefits. More details about F5’s benefits can be found at the following link: https://www.f5.com/company/careers/benefits F5 reserves the right to change or terminate any benefit plan without notice.

Please note that F5 only contacts candidates through F5 email address (ending with @f5.com) or auto email notification from Workday (ending with f5.com or @myworkday.com)

Equal Employment Opportunity

It is the policy of F5 to provide equal employment opportunities to all employees and employment applicants without regard to unlawful considerations of race, religion, color, national origin, sex, sexual orientation, gender identity or expression, age, sensory, physical, or mental disability, marital status, veteran or military status, genetic information, or any other classification protected by applicable local, state, or federal laws. This policy applies to all aspects of employment, including, but not limited to, hiring, job assignment, compensation, promotion, benefits, training, discipline, and termination. F5 offers a variety of reasonable accommodations for candidates Requesting an accommodation is completely voluntary. F5 will assess the need for accommodations in the application process separately from those that may be needed to perform the job. Request by contacting accommodations@f5.com

F5

About F5

F5, Inc. (NASDAQ: FFIV) is the global leader that delivers and secures every app. Backed by three decades of expertise, F5 has built the industry’s premier platform—F5 Application Delivery and Security Platform (ADSP) —to deliver and secure every app, every API, anywhere: on-premises, in the cloud, at the edge, and across hybrid, multicloud environments. F5 is committed to innovating and partnering with the world’s largest and most advanced organizations to deliver fast, available, and secure digital experiences. Together, we help each other thrive and bring a better digital world to life.

Industry
IT & Software
Company Size
5,001-10,000 employees
Headquarters
Seattle, Washington
Year Founded
Unknown
Website
f5.com
Social Media