qode.world

Data Engineer

qode.world  •  Socialist Republic of Vietnam (Remote)  •  7 days ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

About the roleWe are looking for a Data Engineer to join our Data Platform team, focusing on building scalable data pipelines and enabling analytics across the organization.In this role, you will work with modern data stack tools like Databricks, AWS, Airflow, Airbyte, and dbt to design and maintain data workflows that support reporting, analytics, and data-driven decisions.This is a good fit if you enjoy working with large-scale data systems, building reliable pipelines, and optimizing performance in a cloud-based environment.
Your Responsibilities
  • Design and build scalable ETL/ELT pipelines using both batch and streaming approaches
  • Develop ingestion workflows from multiple sources such as databases, APIs, and event streams
  • Implement ingestion strategies including full load, incremental load, and CDC
  • Orchestrate data workflows using Apache Airflow
  • Manage data connectors using Airbyte
  • Work with Databricks Lakehouse to build and optimize data processing pipelines
  • Write and optimize complex SQL queries for analytics and transformation
  • Build modular and testable data models using dbt (staging → intermediate → marts)
  • Maintain data quality, observability, and reliability across the platform
  • Work with AWS services such as S3, Lambda, EC2, IAM
  • Containerize data services using Docker and Kubernetes (EKS) when needed
  • Document pipelines, data models, and data dictionaries for long-term maintainability

Requirements
  • At least 6 years of experience in Data Engineering
  • Strong understanding of data architectures such as Data Lake, Data Warehouse, and Lakehouse
  • Hands-on experience with ETL/ELT pipelines, including batch and streaming processing
  • Familiar with ingestion patterns: full load, incremental, CDC, event-driven
  • Experience working with Databricks (Delta Live Tables, Jobs, Notebooks)
  • Strong skills in PySpark or Spark SQL for large-scale data processing
  • Solid understanding of Delta Lake (ACID, time travel, schema evolution)
  • Experience with Apache Airflow (DAGs, scheduling, monitoring)
  • Experience with Airbyte or similar ingestion tools
  • Strong SQL skills (CTEs, joins, window functions, query optimization)
  • Experience with dbt for transformation, testing, and documentation
  • Hands-on experience with AWS (S3, Lambda, IAM, etc.)
  • Be proficient in English communication skills (at least C1 level)
Nice to Have
  • Experience with Docker, Kubernetes (EKS)
  • Experience running Airflow or Airbyte on Kubernetes
  • Familiar with data quality tools such as Great Expectations or Soda
  • Experience with Terraform or Infrastructure as Code
  • Exposure to data governance or catalog tools (e.g., Databricks Catalog)
  • Experience with CI/CD pipelines (e.g., GitHub Actions)
  • Strong Python skills for automation and pipeline scripting

👉 Our Benefit Packages:
  • Attractive salary range and we are open to negotiate if you're a strong fit.
  • Hybrid/Remote-friendly culture, work where you grow best!
  • Flexible hours, async teamwork (we respect your focus time)
  • Work equipment support
  • Allowance for Certification & Skill Development
  • Year-end bonus & performance-based rewards
  • 22 paid leaves from your 5th year - take a full month off
  • Career growth with personal coaching sessions
  • Open, collaborative team culture - no micromanagement, only trust
  • Tools & AI-powered workflows that make remote work easier

About CoderPushCoderPush is a remote-first technology company that partners with startups and global businesses to build scalable, high-quality software products. We focus on long-term collaboration, clear communication, and delivering real impact through strong engineering and product thinking.Please find more at: https://coderpush.com/
qode.world

About qode.world

We revolutionize how talent finds meaningful careers by harnessing the power of data and automation. Our platform utilizes LLMs to parse resumes and reconstruct queries, transforming unstructured data into actionable insights. This enables us to build robust data moats, such as creating 'Private Talent Pools' for recruiters where autonomous agents enrich candidate profiles.

By automating high-volume recruiting workflows, we reduce the marginal cost of work to zero. Agents match profiles to job descriptions, find contact information, and send personalized messages and schedule interviews automatically, significantly decreasing the time to close. Additionally, we transcribe the interviews and make the data searchable, making hiring decisions more objective.

We drive confidence by raising the quality bar for job seekers. We automate technical exercises such as coding tests, evaluate candidates on merit, providing recruiters with pass/fail scores and qualitative feedback.

We also provide Exclusive or Retained Recruitment services, offering specialized recruitment with no upfront cost or a retained model with a partial fee, ensuring exclusivity and dedicated support throughout the hiring process.

Our Fractional Head of People and HR Advisory services offer flexible, strategic support through part-time or interim roles, as well as comprehensive advisory services to guide crucial HR decision-making.

Lastly, our HR Due Diligence process provides thorough insights into the HR frameworks of target companies, helping mitigate risks across the board.

How do you envision the future of recruiting with the integration of such advanced technologies?

Industry
IT & Software
Company Size
51-200 employees
Headquarters
Singapore , SG
Year Founded
2023
Social Media