Datavail

Technical Specialist -Databricks

Datavail  •  Mumbai, IN (Onsite)  •  2 months ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description

Title: Technical Specialist

Location: Mumbai

Education: Bachelor’s Degree

  • Build scalable ETL/ELT pipelines using Databricks (PySpark, SQL, Spark Streaming).
  • Develop and optimize Delta Lake tables, ACID transactions, schema evolution, and time travel.
  • Implement Unity Catalog, data governance, and access control.Optimize cluster configurations, job workflows, and performance tuning in Databricks.
  • Design and implement batch and streaming pipelines using Spark Structured Streaming.
  • Integrate Databricks with multiple data sources (RDBMS, APIs, cloud storage, message queues).Develop reusable, modular, and automated data processing frameworks.
  • Implement CI/CD pipelines for Databricks using GitHub Actions / Azure DevOps / GitLab.Automate cluster management and job orchestration using Databricks REST APIs.
  • Maintain code quality, unit tests, and documentation.
  • Write and optimize complex SQL queries and statements to ensure high performance and efficient data retrieval.
  • Strong database design including normalization, data modelling, and relational schema creation.
  • Conduct performance analysis, troubleshoot database issues like slow queries or deadlocks and implement solutions
  • Design and implement database structures, including tables, schemas, views, stored procedures, functions, and triggers.
  • Optimize database performance through query tuning, indexing, and performance analysis.
  • Ensure data integrity, security, and compliance standards
  • Need strong Python skills combined with expertise in Apache Spark for large scale data processing. Core abilities include building efficient ETL pipelines, optimizing distributed jobs, and handling large-scale data transformations
  • Expertise in Python programming, Spark APIs, and parallel processing.
  • Proficiency in Python (including Pandas, NumPy) for data manipulation and scripting
  • Deep knowledge of PySpark APIs like DataFrames, RDDs, Spark SQL for querying and processing.
  • Familiarity with RESTful APIs, batch processing, CI/CD, and monitoring data jobs.
  • Optimize Spark jobs for performance, troubleshoot issues, and ensure data quality across systems.
  • Collaborate with data engineers and scientists to implement workflows, conduct code reviews, and integrate with cloud platforms like AWS or Azure.
  • Design, develop, and maintain scalable data pipelines and ETL processes using Azure Databricks
  • Build data transformation workflows using Python or Scala.
  • Work with data lakes using Delta Lake.
  • Integrate data from multiple sources such as APIs, databases, and cloud storage.
  • Monitor and optimize data workflows for performance and reliability.
  • Collaborate with data scientists, analysts, and business teams


Datavail is a leading provider of data management, application development, analytics, and cloud services, with more than 1,000 professionals helping clients build and manage applications and data via a world-class tech-enabled delivery platform and software solutions across all leading technologies. For more than 17 years, Datavail has worked with thousands of companies spanning different industries and sizes, and is an AWS Advanced Tier Consulting Partner, a Microsoft Solutions Partner for Data & AI and Digital & App Innovation (Azure), an Oracle Partner, and a MySQL Partner.

Datavail

About Datavail

Datavail | Data, Cloud & AI—Built for Real Business Outcomes

Datavail is a data, cloud, and AI consultancy that helps organizations turn complex technology environments into clear, measurable business outcomes.

We partner with data, technology, and IT leaders to make enterprise data more usable, systems more adaptable, and decisions more informed. Our work sits at the intersection of data management, cloud modernization, enterprise applications, and AI—bringing these disciplines together so they support the business, not slow it down.

In a landscape full of tools, platforms, and transformation promises, Datavail focuses on what actually drives progress:

• Trusted, well-managed data that teams can rely on

• Cloud environments without unnecessary cost or complexity

• Enterprise applications that evolve with the business

• Practical, responsible AI that delivers value—not experiments

We help organizations:

• Improve data quality, accessibility, and governance

• Turn analytics and AI into everyday decision-making tools

• Modernize and optimize cloud and application environments

• Reduce operational risk while increasing agility and performance

Our Core Capabilities:

• Data Management & AI: Data foundations, analytics, AI and machine learning that support real-world decisions

• Cloud Services: Cloud modernization, optimization, SRE services, and license optimization

• Enterprise Applications: Managed services, upgrades & integrations, digital transformation, and implementation services

At Datavail, we believe data only creates value when it’s well managed, well understood, and actively used. Our role is to help organizations move from complexity to clarity—and from data to action.

Industry
IT & Software
Company Size
501-1,000 employees
Headquarters
Boulder, Colorado
Year Founded
2007
Social Media