Farfetch is a leading global marketplace for the luxury fashion industry. The Farfetch Marketplace connects customers in over 190 countries and territories with items from more than 50 countries and over 1,400 of the world’s best brands, boutiques, and department stores, delivering a truly unique shopping experience and access to the most extensive selection of luxury on a global marketplace.
TECHNOLOGY
We're on a mission to build end-to-end products and technology that powers the an incredible e-commerce experience for luxury customers everywhere, understanding the motivations and needs of our customers and partners, to designing and testing hypotheses, to creating industry-leading experiences for luxury customers.
PORTO
Our office is near Porto, in the north of Portugal, and is located in a vibrant business hub. It offers a dynamic and welcoming environment where our employees can connect and network with a large community of tech professionals.
THE ROLE
As a Senior Observability Engineer in the Tech Platform team, you will play a critical role in building and maintaining the core of our telemetry ecosystem. Your mission is to provide a scalable and reliable platform for analysis, detection, and investigation, empowering users across FARFETCH Business Units through self-service and comprehensive coverage of telemetry data.
In this role, you will be a key technical contributor dedicated to evolving our observability infrastructure. You will focus on scaling data pipelines and storage solutions for Logs, Metrics, Traces, Profiling, and RUM, reducing the cognitive load for developers while ensuring our telemetry systems maintain the highest levels of performance, resilience, and operational excellence.
Design and build robust observability infrastructure, focusing on the scalability, high availability, and performance of our core telemetry platforms.
Manage large-scale data pipelines and storage, ensuring the reliable ingestion, processing, and querying of Logs, Metrics, and Traces using technologies like OpenTelemetry (OTEL), Fluentbit, the Victoria ecosystem (Metrics, Logs, Traces).
Drive the creation of self-service capabilities, empowering engineering teams and FARFETCH Business Units to be autonomous in their analysis, detection, and investigation efforts.
Ensure the reliability and performance of our internal observability stack, making sure systems like Grafana, and various exporters remain robust even during high-traffic events.
Contribute to technical strategy and roadmaps, identifying opportunities to optimize cost efficiency, storage, and reliability through modern infrastructure patterns.
Collaborate and support the integration of Observability Frameworks (Audit, Logging, Monitoring, etc), ensuring they fit seamlessly into the broader platform architecture.
Provide technical mentorship and guidance to other engineers, fostering a culture of technical excellence, psychological safety, and continuous learning within the team.
You have a proven track record of designing, building, and maintaining complex observability platforms and distributed systems at scale.
You possess a strong technical background in telemetry infrastructure, with hands-on experience managing tools like OpenTelemetry, Fluentbit, Victoria Metrics/Logs, Grafana.
You have solid experience with cloud infrastructure and container orchestration (Kubernetes), allowing you to make and guide complex architectural decisions regarding the observability stack.
You understand the software development lifecycle deeply and have experience with development languages (e.g., Golang or Python) to effectively build tools, exporters, or automations.
You have strong analytical and problem-solving skills, with a relentless focus on continuous system improvement, infrastructure as code, and automation.
You have good interpersonal and communication skills, with the ability to collaborate effectively and influence technical stakeholders across different Business Units.
You prioritize data security, compliance, and cost-efficiency as core pillars of a reliable observability platform.
