
Work Schedule
Standard (Mon-Fri)
Environmental Conditions
Office
Whenyou’repart of the team at Thermo Fisher Scientific,you’lldo important work. Surrounded by collaborative colleagues,you’llhave the support and opportunities that only a global leader can give you. Our respected, growing organization has an exceptional strategy for the near term and beyond. Take your place on ourstrongteamandhelp us make significant contributions to the world.
Responsibilities
Serve as a senior technical SME for enterprise infrastructure and application observability, supporting proactive detection, incident response, and service reliability.
Design, implement, andoperateobservability platforms includingZabbix, Prometheus, Grafana, andBigPandaacross on-premises and cloud environments.
Own and lead integrations between observability tools, event management platforms, and ITSM systems (ServiceNow) to enable automated incident and workflow management.
Define andmaintainstandardized alerting strategies, dashboards, service health models, SLIs/SLOs, and operational reporting.
Act as a senior escalation point for complex observability and event management issues, leading root causeanalysisand corrective actions.
Develop andmaintainautomation and auto-remediation workflows usingStackStormand supporting scripts.
Maintain observability-related documentation,including runbooks, dashboards, and procedures.
Provide technical leadership and mentorship to junior engineers.
Participate in on-call rotations and provide off-hourssupport asrequired
Qualifications
Bachelor’s degree in information technology, Computer Science, Engineering, or a related discipline (or equivalent practical experience).
7+ years of experience in enterprise infrastructure and application observability, monitoring platforms, and IT operations.
Strong hands-on experience with observability and monitoring tools such asZabbix,Prometheus, Grafana,Playwright,BigPanda, including metrics, alerting, dashboards, event correlation, and reporting.
Proven experience integrating observability platforms with ITSM systems (ServiceNow or equivalent) and event management workflows.
Hands-on experience building automation and auto-remediation workflows, including scripting withPlaywright,Python and Shell and API-based integrations (e.g.,Stackstormor similar tools).
Solid understanding of infrastructure, cloud, and application architectures, with the ability to troubleshoot complex cross-domain issues and act as a senior technical escalation point.
Nice to Have
Experience with logging platforms and log aggregation pipelines (e.g., ELK, OpenSearch, Splunk).
Exposure tocloud-native observability, SRE practices, or reliability engineering concepts.

About Thermo Fisher Scientific
Thermo Fisher Scientific Inc. is the world leader in serving science, with annual revenue of approximately $40 billion. Our Mission is to enable our customers to make the world healthier, cleaner and safer. Whether our customers are accelerating life sciences research, solving complex analytical challenges, increasing productivity in their laboratories, improving patient health through diagnostics or the development and manufacture of life-changing therapies, we are here to support them. Our global team delivers an unrivaled combination of innovative technologies, purchasing convenience and pharmaceutical services through our industry-leading brands, including Thermo Scientific, Applied Biosystems, Invitrogen, Fisher Scientific, Unity Lab Services, Patheon and PPD.
For more information, please visit www.thermofisher.com.