Databricks Data Engineer

WinaxisColumbus, OH, United States

5 days ago

Job type

Full-time

Quick Apply

Job description

Title : Databricks and AWS Focused Data Engineer

Location : Columbus, OH - Onsite

Overview :

We are seeking an experienced data engineer to deliver high-quality, scalable data solutions on Databricks and AWS for one of our Big Four clients. You will build and optimize pipelines, implement medallion architecture, integrate streaming and batch sources, and enforce strong governance and access controls to support analytics and ML use cases.

Key Responsibilities :

Build and Maintain Data Pipelines : Develop scalable data pipelines using PySpark and Spark within the Databricks environment.
Implement Medallion Architecture : Design workflows using raw, trusted, and refined layers to drive reliable data processing.
Integrate Diverse Data Sources : Connect data from Kafka streams, extract channels, and APIs.
Data Cataloging and Governance : Model and register datasets in enterprise data catalogs, ensuring robust governance and accessibility.
Access Control : Manage secure, role-based access patterns to support analytics, AI, and ML needs.
Team Collaboration : Work closely with peers to achieve required code coverage and deliver high-quality, well-tested solutions.
Optimize and Operationalize : Tune Spark jobs (partitioning, caching, broadcast joins, AQE), manage Delta Lake performance (Z-Ordering, OPTIMIZE, VACUUM), and implement cost and reliability best practices on AWS.
Data Quality and Testing : Implement data quality checks and validations (e.g., Great Expectations, custom PySpark checks), unit / integration tests, and CI / CD for Databricks Jobs / Workflows.
Infrastructure as Code : Provision and manage Databricks and AWS resources using Terraform (workspaces, clusters, jobs, secret scopes, Unity Catalog objects, S3, IAM).
Monitoring and Observability : Set up logging, metrics, and alerts (CloudWatch, Datadog, Databricks audit logs) for pipelines and jobs.
Documentation : Produce clear technical documentation, runbooks, and data lineage for governed datasets.

Required Skills & Qualifications :

Databricks : 6-9 years of experience with expert-level proficiency

PySpark / Spark : 6-9 years of advanced hands-on experience

AWS : 6-9 years of experience with strong competency, including S3 and Terraform for infrastructure-as-code

Data Architecture : Solid knowledge of the medallion pattern and data warehousing best practices

Data Pipelines : Proven ability to build, optimize, and govern enterprise data pipelines

Delta Lake and Unity Catalog : Expertise in Delta Lake internals, time travel, schema evolution / enforcement, and Unity Catalog RBAC / ABAC

Streaming : Hands-on experience with Spark Structured Streaming, Kafka, checkpointing, exactly-once semantics, and late-arriving data handling

CI / CD : Experience with Git-based workflows and CI / CD for Databricks (e.g., Databricks Repos, dbx, GitHub Actions, Azure DevOps, or Jenkins)

Security and Compliance : Experience with IAM, KMS, encryption, secrets management, token / credential rotation, and PII governance

Performance and Cost : Demonstrated ability to tune Spark jobs and optimize Databricks cluster configurations and AWS usage for cost and throughput

Collaboration : Experience working in Agile / Scrum teams, peer reviews, and achieving code coverage targets

Preferred Skills & Qualifications :

Certifications : Databricks Data Engineer Professional, AWS Solutions Architect / Developer, HashiCorp Terraform Associate

Data Catalogs : Experience with enterprise catalogs such as Collibra or Alation, and lineage tooling such as OpenLineage

Orchestration : Databricks Workflows and / or Airflow

Additional AWS : Glue, Lambda, Step Functions, CloudWatch, Secrets Manager

Testing : pytest, chispa, Great Expectations, dbx test

Domain Experience : Analytics and ML feature pipelines, MLOps integrations

Create a job alert for this search

Data Engineer • Columbus, OH, United States

Related jobs

Promoted

Databricks Data Engineer - Manager - Consulting - Location Open

EYColumbus, OH, United States

Full-time

At EY, we're all in to shape your future with confidence.We'll help you succeed in a globally connected powerhouse of diverse teams and take your career wherever you want it to go.Join EY and help ...Show moreLast updated: 3 days ago

Promoted

Data Engineer -Columbus, OH (5 Days Onsite)

Career Mentors LLCColumbus, OH, United States

Full-time

W2 Only (Local / Nearby Candidates Preferred).Privileged Access Management (PAM).Privileged Access Management (PAM).The ideal candidate will bring deep expertise in ETL and database systems, along wi...Show moreLast updated: 3 days ago

Promoted

Data Engineer III

DATAECONOMY Inc.Dublin, OH, United States

Full-time

Understand client data requirements and translate them into plans for data migrations.Extract data from systems using automated tools, ensuring that data integrity is maintained.Migrate extracted d...Show moreLast updated: 30+ days ago

Promoted

Databricks Data Engineer

Inizio PartnersColumbus, OH, United States

Full-time

About the job Databricks Data Engineer.Title : Databricks Data Engineer (Senior Manager / AVP).Location : Columbus, OH (1st choice), Remote (2nd choice). Experience Required : 7 12 years.We are seeking...Show moreLast updated: 3 days ago

Promoted

Databricks Developer

Diverse LynxColumbus, OH, United States

Full-time

Job Title : Databricks Developer.Location : Columbus OH (Onsite).Must Have Functional Skills 5 years of experience in Databricks. At least 2-3 years of experience in Working with Informatica.Good unde...Show moreLast updated: 3 days ago

Promoted

Data Engineer- Columbus OH

Georgia IT IncColumbus, OH, United States

Full-time

Data Engineering tools and techniques (any ETL / ELT skills are good to have ).Proficiency in Linux / Unix environments.Experience building applications for public cloud environments (AWS preferred).E...Show moreLast updated: 30+ days ago

Promoted

Data Engineer Location : Columbus OH (Onsite - 5 Days a Week)

Career Mentors LLCColumbus, OH, United States

Full-time

Columbus, OH (Onsite - 5 Days a Week).W2 Only (Local Candidates Preferred).PAM (Privileged Access Management).Privileged Access Management (PAM) group in Columbus, OH. This role requires a highly mo...Show moreLast updated: 3 days ago

Promoted

Snowflake Data Engineer _ Columbus, OH / Hybrid

Georgia IT IncColumbus, OH, United States

Full-time

Job Title_ Data Engineer Snowflake.Expected to be available full-time for this project with the following skills and responsibilities : . Document and script data models to support business use cases....Show moreLast updated: 3 days ago

Promoted

Palantir Foundry Data Engineer

Purple DriveMilford Center, OH, United States

Full-time

Palantir Foundry Data Engineer.Design, Development and enhancement of data pipelines on Palantir Foundry platform.Data Analyis using contour module of palantir. Building of custom application and In...Show moreLast updated: 3 days ago

Promoted

Data Engineer

InterSourcesColumbus, OH, United States

Full-time

Certified Diverse Supplier, was founded in 2007 and offers innovative solutions to help clients with Digital Transformations across various domains and industries. Our history spans over 16 years an...Show moreLast updated: 3 days ago

Promoted

Lead Data Engineer - Databricks

JPMorgan Chase Bank, N.A.Westerville, OH, United States

Full-time

Join us as we embark on a journey of collaboration and innovation, where your unique skills and talents will be valued and celebrated. Together we will create a brighter future and make a meaningful...Show moreLast updated: 30+ days ago

Promoted

Lead Data Engineer

Syntricate TechnologiesColumbus, OH, United States

Full-time

Required Skills : 10 & Above year Experience.Experience in Data warehousing, AWS Cloud Business, Intelligence, Informatica and Databricks(data flow design, development, enhancement and maintenance)....Show moreLast updated: 3 days ago

Promoted

Specialist, Data Platform Engineer (Databricks & Snowflake)

Nation Wide GroupColumbus, OH, United States

Full-time

If you’re passionate about innovation and love working in an environment where you can constantly improve and adopt new technologies to drive business results, then Nationwide’s Information Technol...Show moreLast updated: 3 days ago

Promoted

AWS Data Engineer

QodeColumbus, OH, US

Full-time

We are looking for an experienced.The ideal candidate will have hands-on experience with.S3, Glue, EMR, Redshift, Lambda, Athena, Kinesis, Step Functions, RDS) for data engineering solutions.Work w...Show moreLast updated: 30+ days ago

Promoted

Data Engineer

IQVentures Corporate, LLCDublin, OH, United States

Full-time

The Data Engineer is a technical leader and hands-on developer responsible for designing, building, and optimizing data pipelines and infrastructure to support analytics and reporting.This role wil...Show moreLast updated: 3 days ago

Promoted

Data Engineer

Everest TechnologiesColumbus, OH, United States

Full-time

The Data Engineer will be a key builder on our AI journey, responsible for designing,.This role will focus on building robust and scalable data pipelines to extract data from a variety of sources, ...Show moreLast updated: 30+ days ago

Promoted

Data engineer

Diverse LynxColumbus, OH, United States

Full-time

We are seeking a highly skilled and motivated Data Engineer to join our innovative team.As a Data Engineer, you will be responsible for designing, building, and maintaining scalable data pipelines ...Show moreLast updated: 3 days ago

Promoted

Data Engineer (AWS, Azure, GCP)

CapTech ConsultingColumbus, OH, US

Full-time

CapTech is an award-winning consulting firm that collaborates with clients to achieve what’s possible through the power of technology. At CapTech, we’re passionate about the work we do a...Show moreLast updated: 1 day ago