Job Description

Location: Gurugram,Haryana,India

About US:-

We turn customer challenges into growth opportunities. Material is a global strategy partner to the world’s most recognizable brands and innovative companies. Our people around the globe thrive by helping organizations design and deliver rewarding customer experiences.

We use deep human insights, design innovation and data to create experiences powered by modern technology. Our approaches speed engagement and growth for the companies we work with and transform relationships between businesses and the people they serve.

Srijan, a Material company, is a renowned global digital engineering firm with a reputation for solving complex technology problems using their deep technology expertise and leveraging strategic partnerships with top-tier technology partners. Be a part of an Awesome Tribe

We are seeking a highly skilled and experienced Data Engineering Lead with strong expertise in AWS data services and retail data ecosystems. In this role, you will lead the design, development, and optimization of scalable data pipelines responsible for ingesting and transforming data from MMS (Merchandise Management Systems) and POS (Point of Sale) systems into a centralized Operational Data Store (ODS) to support downstream applications and analytics use cases.

You will play a critical role in building a robust, high-performance data platform using AWS-native services, ensuring data quality, reliability, and real-time or near real-time availability for business operations.

Responsibilities:-

Design and implement scalable ETL/ELT pipelines to ingest data from MMS / POS or third practice systems into AWS-based data platforms.

Build and maintain a centralized Operational Data Store (ODS) using Couchbase or similar NoSQL technologies, optimized for low-latency application access.

Develop and optimize data processing workflows using Apache Spark / PySpark and AWS services such as AWS Glue and Amazon EMR.

Create denormalized, API-ready data models aligned with downstream microservices and application consumption patterns.

Implement idempotent processing, CDC merge (upsert) strategies, and data reconciliation mechanisms to ensure consistency across batch and streaming pipelines.

Leverage Amazon S3, Amazon Redshift, or Amazon Aurora for efficient storage and querying of structured and semi-structured data.

Implement data ingestion patterns (batch and streaming) using tools like Amazon Kinesis or AWS Lambda where applicable.

Apply performance tuning and optimization techniques to improve pipeline efficiency, scalability, and cost-effectiveness.

Define and enforce data governance, data quality, and metadata management standards across the data platform.

Collaborate with DevOps teams to design and maintain CI/CD pipelines using tools like AWS CodePipeline, GitLab, or similar.

Conduct peer reviews and provide technical leadership and mentorship to the data engineering team.

Collaborate with microservices and application teams to ensure seamless integration with the ODS via APIs and event streams.

Requirements:-

5+ years of experience in data engineering, with at least 2+ years in a lead role

Strong hands-on experience with AWS data services such as AWS Glue, Amazon S3, Amazon Redshift, and Amazon EMR.

Proficiency in PySpark / Spark for large-scale data processing and optimization.

Experience designing and implementing ODS layers using NoSQL databases, preferably Couchbase, or similar (DynamoDB, MongoDB).

Strong expertise in ETL/ELT design patterns, data ingestion, and transformation pipelines.

Experience working with MMS, POS, or retail transaction data is highly preferred.

Hands-on experience with CI/CD pipelines (GitLab, AWS CodePipeline, or similar).

Good understanding of data governance, data quality, and metadata frameworks

Experience supporting microservices architectures with data platforms.

Strong problem-solving skills and ability to optimize complex data workflows.

Excellent communication and stakeholder management skills.

Ability to work in a fast-paced, agile environment and manage multiple priorities.

Apply to this job

About Srijan: Now Material

Srijan, A Material+ Company, is a global engineering firm that builds transformative digital paths to better futures for Fortune 500 enterprises to nonprofits all over the world. We bring advanced engineering capabilities and agile practices to some of the biggest names across FMCG, Aviation, Telecom, Technology, and others. With the AWS Select Tier Services partnership, a services relationship with VMWare, registered partnership with Confluent, Cloudinary, Yugabyte and Searchstax, Srijan has built a reputation over the past two decades for solving some of the most complex business and technology problems. As a Drupal Enterprise Partner, Diamond Certified Contributor and Acquia Gold Level Partner, Srijan leads in Drupal with 400+ Drupal engineers, 90+ Acquia certifications. Srijan has been certified as a Great Place to Work™️ for the 6th time in a row.

With our new identity as a Material+ company, we have further strengthened our capability footprint by offering highly differentiated customer experience transformation through Material’s “Science and Systems” approach - a way of shaping customer experience by combining strategy, insights, design, and technology. For more information visit: https://www.materialplus.io/

Industry

IT & Software

Company Size

201-500 employees

Headquarters

Manasquan, New Jersey

Year Founded

Unknown

Website

srijan.net

Social Media

Lead Data Engineer - ETL

Job Description

About Srijan: Now Material