Talent.com
Databricks Data Engineer

Databricks Data Engineer

WinaxisColumbus, OH, United States
5 days ago
Job type
  • Full-time
  • Quick Apply
Job description

Title : Databricks and AWS Focused Data Engineer

Location : Columbus, OH - Onsite

Overview :

We are seeking an experienced data engineer to deliver high-quality, scalable data solutions on Databricks and AWS for one of our Big Four clients. You will build and optimize pipelines, implement medallion architecture, integrate streaming and batch sources, and enforce strong governance and access controls to support analytics and ML use cases.

Key Responsibilities :

  • Build and Maintain Data Pipelines : Develop scalable data pipelines using PySpark and Spark within the Databricks environment.
  • Implement Medallion Architecture : Design workflows using raw, trusted, and refined layers to drive reliable data processing.
  • Integrate Diverse Data Sources : Connect data from Kafka streams, extract channels, and APIs.
  • Data Cataloging and Governance : Model and register datasets in enterprise data catalogs, ensuring robust governance and accessibility.
  • Access Control : Manage secure, role-based access patterns to support analytics, AI, and ML needs.
  • Team Collaboration : Work closely with peers to achieve required code coverage and deliver high-quality, well-tested solutions.
  • Optimize and Operationalize : Tune Spark jobs (partitioning, caching, broadcast joins, AQE), manage Delta Lake performance (Z-Ordering, OPTIMIZE, VACUUM), and implement cost and reliability best practices on AWS.
  • Data Quality and Testing : Implement data quality checks and validations (e.g., Great Expectations, custom PySpark checks), unit / integration tests, and CI / CD for Databricks Jobs / Workflows.
  • Infrastructure as Code : Provision and manage Databricks and AWS resources using Terraform (workspaces, clusters, jobs, secret scopes, Unity Catalog objects, S3, IAM).
  • Monitoring and Observability : Set up logging, metrics, and alerts (CloudWatch, Datadog, Databricks audit logs) for pipelines and jobs.
  • Documentation : Produce clear technical documentation, runbooks, and data lineage for governed datasets.

Required Skills & Qualifications :

  • Databricks : 6-9 years of experience with expert-level proficiency
  • PySpark / Spark : 6-9 years of advanced hands-on experience
  • AWS : 6-9 years of experience with strong competency, including S3 and Terraform for infrastructure-as-code
  • Data Architecture : Solid knowledge of the medallion pattern and data warehousing best practices
  • Data Pipelines : Proven ability to build, optimize, and govern enterprise data pipelines
  • Delta Lake and Unity Catalog : Expertise in Delta Lake internals, time travel, schema evolution / enforcement, and Unity Catalog RBAC / ABAC
  • Streaming : Hands-on experience with Spark Structured Streaming, Kafka, checkpointing, exactly-once semantics, and late-arriving data handling
  • CI / CD : Experience with Git-based workflows and CI / CD for Databricks (e.g., Databricks Repos, dbx, GitHub Actions, Azure DevOps, or Jenkins)
  • Security and Compliance : Experience with IAM, KMS, encryption, secrets management, token / credential rotation, and PII governance
  • Performance and Cost : Demonstrated ability to tune Spark jobs and optimize Databricks cluster configurations and AWS usage for cost and throughput
  • Collaboration : Experience working in Agile / Scrum teams, peer reviews, and achieving code coverage targets
  • Preferred Skills & Qualifications :

  • Certifications : Databricks Data Engineer Professional, AWS Solutions Architect / Developer, HashiCorp Terraform Associate
  • Data Catalogs : Experience with enterprise catalogs such as Collibra or Alation, and lineage tooling such as OpenLineage
  • Orchestration : Databricks Workflows and / or Airflow
  • Additional AWS : Glue, Lambda, Step Functions, CloudWatch, Secrets Manager
  • Testing : pytest, chispa, Great Expectations, dbx test
  • Domain Experience : Analytics and ML feature pipelines, MLOps integrations
  • Create a job alert for this search

    Data Engineer • Columbus, OH, United States

    Related jobs
    • Promoted
    Databricks Data Engineer - Manager - Consulting - Location Open

    Databricks Data Engineer - Manager - Consulting - Location Open

    EYColumbus, OH, United States
    Full-time
    At EY, we're all in to shape your future with confidence.We'll help you succeed in a globally connected powerhouse of diverse teams and take your career wherever you want it to go.Join EY and help ...Show moreLast updated: 3 days ago
    • Promoted
    Data Engineer -Columbus, OH (5 Days Onsite)

    Data Engineer -Columbus, OH (5 Days Onsite)

    Career Mentors LLCColumbus, OH, United States
    Full-time
    W2 Only (Local / Nearby Candidates Preferred).Privileged Access Management (PAM).Privileged Access Management (PAM).The ideal candidate will bring deep expertise in ETL and database systems, along wi...Show moreLast updated: 3 days ago
    • Promoted
    Data Engineer III

    Data Engineer III

    DATAECONOMY Inc.Dublin, OH, United States
    Full-time
    Understand client data requirements and translate them into plans for data migrations.Extract data from systems using automated tools, ensuring that data integrity is maintained.Migrate extracted d...Show moreLast updated: 30+ days ago
    • Promoted
    Databricks Data Engineer

    Databricks Data Engineer

    Inizio PartnersColumbus, OH, United States
    Full-time
    About the job Databricks Data Engineer.Title : Databricks Data Engineer (Senior Manager / AVP).Location : Columbus, OH (1st choice), Remote (2nd choice). Experience Required : 7 12 years.We are seeking...Show moreLast updated: 3 days ago
    • Promoted
    Databricks Developer

    Databricks Developer

    Diverse LynxColumbus, OH, United States
    Full-time
    Job Title : Databricks Developer.Location : Columbus OH (Onsite).Must Have Functional Skills 5 years of experience in Databricks. At least 2-3 years of experience in Working with Informatica.Good unde...Show moreLast updated: 3 days ago
    • Promoted
    Data Engineer- Columbus OH

    Data Engineer- Columbus OH

    Georgia IT IncColumbus, OH, United States
    Full-time
    Data Engineering tools and techniques (any ETL / ELT skills are good to have ).Proficiency in Linux / Unix environments.Experience building applications for public cloud environments (AWS preferred).E...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer Location : Columbus OH (Onsite - 5 Days a Week)

    Data Engineer Location : Columbus OH (Onsite - 5 Days a Week)

    Career Mentors LLCColumbus, OH, United States
    Full-time
    Columbus, OH (Onsite - 5 Days a Week).W2 Only (Local Candidates Preferred).PAM (Privileged Access Management).Privileged Access Management (PAM) group in Columbus, OH. This role requires a highly mo...Show moreLast updated: 3 days ago
    • Promoted
    Snowflake Data Engineer _ Columbus, OH / Hybrid

    Snowflake Data Engineer _ Columbus, OH / Hybrid

    Georgia IT IncColumbus, OH, United States
    Full-time
    Job Title_ Data Engineer Snowflake.Expected to be available full-time for this project with the following skills and responsibilities : . Document and script data models to support business use cases....Show moreLast updated: 3 days ago
    • Promoted
    Palantir Foundry Data Engineer

    Palantir Foundry Data Engineer

    Purple DriveMilford Center, OH, United States
    Full-time
    Palantir Foundry Data Engineer.Design, Development and enhancement of data pipelines on Palantir Foundry platform.Data Analyis using contour module of palantir. Building of custom application and In...Show moreLast updated: 3 days ago
    • Promoted
    Data Engineer

    Data Engineer

    InterSourcesColumbus, OH, United States
    Full-time
    Certified Diverse Supplier, was founded in 2007 and offers innovative solutions to help clients with Digital Transformations across various domains and industries. Our history spans over 16 years an...Show moreLast updated: 3 days ago
    • Promoted
    Lead Data Engineer - Databricks

    Lead Data Engineer - Databricks

    JPMorgan Chase Bank, N.A.Westerville, OH, United States
    Full-time
    Join us as we embark on a journey of collaboration and innovation, where your unique skills and talents will be valued and celebrated. Together we will create a brighter future and make a meaningful...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Data Engineer

    Lead Data Engineer

    Syntricate TechnologiesColumbus, OH, United States
    Full-time
    Required Skills : 10 & Above year Experience.Experience in Data warehousing, AWS Cloud Business, Intelligence, Informatica and Databricks(data flow design, development, enhancement and maintenance)....Show moreLast updated: 3 days ago
    • Promoted
    Specialist, Data Platform Engineer (Databricks & Snowflake)

    Specialist, Data Platform Engineer (Databricks & Snowflake)

    Nation Wide GroupColumbus, OH, United States
    Full-time
    If you’re passionate about innovation and love working in an environment where you can constantly improve and adopt new technologies to drive business results, then Nationwide’s Information Technol...Show moreLast updated: 3 days ago
    • Promoted
    AWS Data Engineer

    AWS Data Engineer

    QodeColumbus, OH, US
    Full-time
    We are looking for an experienced.The ideal candidate will have hands-on experience with.S3, Glue, EMR, Redshift, Lambda, Athena, Kinesis, Step Functions, RDS) for data engineering solutions.Work w...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    IQVentures Corporate, LLCDublin, OH, United States
    Full-time
    The Data Engineer is a technical leader and hands-on developer responsible for designing, building, and optimizing data pipelines and infrastructure to support analytics and reporting.This role wil...Show moreLast updated: 3 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Everest TechnologiesColumbus, OH, United States
    Full-time
    The Data Engineer will be a key builder on our AI journey, responsible for designing,.This role will focus on building robust and scalable data pipelines to extract data from a variety of sources, ...Show moreLast updated: 30+ days ago
    • Promoted
    Data engineer

    Data engineer

    Diverse LynxColumbus, OH, United States
    Full-time
    We are seeking a highly skilled and motivated Data Engineer to join our innovative team.As a Data Engineer, you will be responsible for designing, building, and maintaining scalable data pipelines ...Show moreLast updated: 3 days ago
    • Promoted
    Data Engineer (AWS, Azure, GCP)

    Data Engineer (AWS, Azure, GCP)

    CapTech ConsultingColumbus, OH, US
    Full-time
    CapTech is an award-winning consulting firm that collaborates with clients to achieve what’s possible through the power of technology. At CapTech, we’re passionate about the work we do a...Show moreLast updated: 1 day ago