Endpoint Clinical

Data Architect

Endpoint Clinical  •  Bengaluru, IN (Onsite)  •  2 days ago
Apply
AI can make mistakes so check important info. Chat history is never stored.
71
AI Success™

Job Description

We are seeking a Data Solutions Architect with 15+ years of experience in designing enterprise-scale data platforms. This role focuses on building Azure + Databricks Lakehouse solutions for clinical trial and life sciences data, enabling advanced analytics, machine learning workflows, and regulatory compliance.

Responsibilities

  • Architect Lakehouse solutions leveraging Azure Data Lake (ADLS Gen2), Databricks, and Delta Lake
  • Design data models (star, snowflake, data vault), ingestion pipelines, and CDC strategies with schema evolution and performance tuning.
  • Implement data governance, security, and compliance aligned with GxP, HIPAA, and 21 CFR Part 11
  • Enable data science and ML workflows using MLflow, Feature Store, and curated datasets.
  • Collaborate with clinical operations and biometrics teams to deliver business-aligned solutions.

Experience

  • 15+ years in data architecture/engineering; 5+ years with Azure; 3+ years with Databricks.
  • Azure Expertise: ADLS Gen2, Data Factory/Fabric pipelines, Synapse/SQL, Event Hubs, Functions, Key Vault, Private Endpoints, VNets.
  • Databricks Expertise: Spark (PySpark/SQL), Unity Catalog, Delta Live Tables (DLT), Workflows, MLflow, Feature Store.
  • Data Modeling: Star/snowflake, data vault, CDC, schema evolution, performance tuning.
  • Programming: PySpark, SQL; bonus: Python (pandas), Scala, dbt.
  • Governance & Security: IAM/RBAC/ABAC, row/column-level security, encryption, masking/tokenization, secrets, audit.
  • Observability & Reliability: Monitoring, lineage, alerting, CI/CD (GitHub Actions/Azure DevOps), automated testing/validation.

Skills

  • Clinical trial data standards (CDISC: CDASH, SDTM, ADaM) and systems (EDC, CTMS, IRT).
  • Familiarity with decentralized trials, real-world data (RWD/RWE), and regulatory compliance frameworks
  • Strong stakeholder management and communication skills.
  • Ability to translate complex technical concepts into business value.
  • Leadership in cross-functional teams and mentoring engineers.

Education

  • Bachelor’s/Master’s in Computer Science, Data Engineering, or related field.
  • Certifications: Azure Data Engineer (DP-203), Databricks Architect, Azure Solutions Architect (AZ-305), GxP/CSV.
Endpoint Clinical

About Endpoint Clinical

Clinical trials are designed around endpoints, but it’s the beginning that sets them up for success—including selecting the best RTSM partner for continuous and flexible support throughout the duration of a trial.

With decades of experience, Endpoint Clinical is obsessed with proactive problem-solving, ensuring that issues are anticipated and resolved before they escalate, and any changes are easily incorporated. From day one, we become an extension of your team, seamlessly integrating into your trial operations. We minimize friction, reduce risk, and enable your clinical teams to focus where it matters most—on trial success.

Endpoint is headquartered in Raleigh, NC, with offices across the US, EU, and Asia.

Industry
Chemicals & Materials
Company Size
501-1,000 employees
Headquarters
Raleigh, North Carolina
Year Founded
2009
Social Media