Job Description
About the Team:
The Media Network team is building an industry-leading real-time media distributed system that plays a critical role in TikTok’s live streaming and RTC scenarios. Serving billions of users worldwide, our network is among the largest real-time media networks globally.Join the Media Network team to work on large-scale distributed systems and help deliver an exceptional real-time media experience to TikTok users.
Responsibilities:
1. Stability Enhancement, including:Design and implement resilient, decoupled systems to minimize single points of failure and improve overall system reliability; Implement automated recovery and failover mechanisms to ensure continuous service availability during incidents; Establish and enforce change management processes to reduce risks associated with configuration changes and prevent outages.
2. Cost Optimization, including:Analyze infrastructure usage and spending to identify key cost drivers and opportunities for savings; Lead initiatives to optimize resource allocation, including storage and compute efficiency, to maximize efficiency and reduce expenses;Monitor and control infrastructure costs to prevent waste and ensure budget adherence.
3. Quality Improvement, including: Identify and address issues impacting service quality and user experience, such as playback performance and content lifecycle management; Implement security measures to protect systems from common threats and vulnerabilities; Collaborate with teams to monitor quality metrics and drive continuous improvements.
4. Cross-Team Collaboration, including: Partner with multiple teams to align objectives and deliver integrated solutions; Develop and maintain standard operating procedures for deployments, disaster recovery, and operational best practices; Facilitate process improvements to enhance coordination, efficiency, and accountability across teams.