Cloudera

Staff Software Engineer - Replication Manager

Cloudera  •  Bengaluru, IN (Hybrid)  •  1 day ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

Business Area:

Engineering

Seniority Level:

Mid-Senior level

At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.

The Data Platform Pillar is the bedrock of Cloudera’s technology, where we design and build the core components that let our customers store, manage, and process data with unmatched scalability, security, and performance.

As a Staff Software Engineer of the Replication Manager team, you will be responsible for designing, developing, and maintaining enterprise-grade data replication solutions that enable seamless data movement across hybrid and multi-cloud environments. You'll work on critical infrastructure that helps Fortune 500 companies manage their data lifecycle and migration strategies.

As the Staff Software Engineer you will lead: (5-7 bullets)

  • Data Replication Engineering — Design and implement scalable replication services across HDFS, Hive, HBase, Apache Iceberg, and other big data technologies, with strong data consistency and minimal downtime.

  • Cloud Migration — Lead complex data migration initiatives between on-premises clusters and cloud environments including AWS S3 and Azure ADLS Gen2.

  • API & Microservices Development — Build robust APIs and microservices for the Replication Manager platform, along with advanced features like bandwidth throttling, scheduling, and policy management.

  • Distributed Systems Architecture — Design fault-tolerant, petabyte-scale distributed systems with comprehensive monitoring, alerting, and observability capabilities.

  • Security & Governance — Ensure data security and governance compliance during movement operations, leveraging Apache Atlas for metadata lineage and data discovery.

  • Product & Innovation — Drive technical decisions for new features, evaluate emerging replication and cloud technologies, and translate business requirements into technical specifications.

  • Cross-functional Collaboration — Partner with CDP, SRE, and field engineering teams to integrate replication capabilities, resolve customer escalations, and improve system reliability.

  • Technical Mentorship — Guide junior engineers on best practices, conduct code reviews, and contribute to technical documentation and customer-facing materials.

We’re excited about you if you have: (Minimum Qualifications)

  • 8+ years in software engineering with strong proficiency in Java, Scala, or Python, and deep hands-on experience with the Apache Hadoop ecosystem (HDFS, Hive, HBase, YARN).

  • Solid experience with modern data formats including Apache Iceberg, Delta Lake, and Hive tables with ACID support, alongside streaming technologies like Kafka and Pulsar.

  • Practical experience across AWS, Azure, and GCP storage services, with working knowledge of containerization tools like Docker and Kubernetes.

  • Proven ability to architect large-scale distributed systems with a strong grasp of data consistency models, CAP theorem, microservices, and API design.

  • Familiarity with security protocols and data governance frameworks to ensure compliant and trustworthy data operations.

  • Well-versed in agile SDLC, CI/CD pipelines, automated testing, Git-based code review workflows, and observability tooling including Prometheus, Grafana, and the ELK stack.

You may also have: (Preferred Qualifications)

  • Experience with Apache Ranger and Apache Atlas for data governance and metadata management

  • Understanding of Apache Iceberg table format and its replication challenges in hybrid cloud environments

  • Knowledge of enterprise backup and disaster recovery solutions

  • Previous experience in data migration or ETL pipeline development

  • Contributions to open-source big data projects

  • Experience in customer-facing roles or supporting enterprise customers

  • Advanced degree in Computer Science, Engineering, or a related field

What you can expect from us:

  • Generous PTO Policy

  • Support work life balance with Unplugged Days

  • Flexible WFH Policy

  • Mental & Physical Wellness programs

  • Phone and Internet Reimbursement program

  • Access to Continued Career Development

  • Comprehensive Benefits and Competitive Packages

  • Paid Volunteer Time

  • Employee Resource Groups

EEO/VEVRAA

#LI-SV1

Cloudera

About Cloudera

Cloudera is the only data and AI platform company that brings AI to data anywhere: in clouds, data centers, and at the edge. Cloudera delivers 100% of data in all forms–whether it is in Cloudera or anywhere in the entire data estate. The world’s largest organizations rely on Cloudera to fuel insights that boost bottom lines, safeguard against threats, and save lives. Learn more at Cloudera.com.

---------------------------------------------------------------------------------

Recruitment Fraud Alert

It has come to our attention that job seekers have been contacted about fake job opportunities with Cloudera from individuals fraudulently posing as Cloudera employees. These recruiting fraud schemes often include requests for personal information and payments.

Be aware that Cloudera will never request a payment as part of its recruitment process. Additionally, Cloudera will never make a job offer without conducting an interview process. Any information submitted to Cloudera in relation to a job application should only be through our official career portal (https://www.cloudera.com/careers.html). Email communications from Cloudera will come from an email address ending in @cloudera.com.

If you are the target of a recruiting scam, consider filing a report with law enforcement authorities. Cloudera is not responsible for fraudulent job offers and/or any claims, damages, expenses, or other inconvenience connected to recruiting scams.

Industry
IT & Software
Company Size
1,001-5,000 employees
Headquarters
Santa Clara, California
Year Founded
Unknown
Social Media