Talent.com
Data Engineer

Data Engineer

Techcliff GroupColumbus, OH, United States
5 days ago
Job type
  • Full-time
  • Quick Apply
Job description

Must have : Date of birth

Full resume

full updated linkedin

Overview :

We are seeking an experienced data engineer to deliver high-quality, scalable data solutions on Databricks and AWS for one of our Big Four clients. You will build and optimize pipelines, implement medallion architecture, integrate streaming and batch sources, and enforce strong governance and access controls to support analytics and ML use cases.

Key Responsibilities :

  • Build and Maintain Data Pipelines : Develop scalable data pipelines using PySpark and Spark within the Databricks environment.
  • Implement Medallion Architecture : Design workflows using raw, trusted, and refined layers to drive reliable data processing.
  • Integrate Diverse Data Sources : Connect data from Kafka streams, extract channels, and APIs.
  • Data Cataloging and Governance : Model and register datasets in enterprise data catalogs, ensuring robust governance and accessibility.
  • Access Control : Manage secure, role-based access patterns to support analytics, AI, and ML needs.
  • Team Collaboration : Work closely with peers to achieve required code coverage and deliver high-quality, well-tested solutions.
  • Optimize and Operationalize : Tune Spark jobs (partitioning, caching, broadcast joins, AQE), manage Delta Lake performance (Z-Ordering, OPTIMIZE, VACUUM), and implement cost and reliability best practices on AWS.
  • Data Quality and Testing : Implement data quality checks and validations (e.g., Great Expectations, custom PySpark checks), unit / integration tests, and CI / CD for Databricks Jobs / Workflows.
  • Infrastructure as Code : Provision and manage Databricks and AWS resources using Terraform (workspaces, clusters, jobs, secret scopes, Unity Catalog objects, S3, IAM).
  • Monitoring and Observability : Set up logging, metrics, and alerts (CloudWatch, Datadog, Databricks audit logs) for pipelines and jobs.
  • Documentation : Produce clear technical documentation, runbooks, and data lineage for governed datasets.

Required Skills & Qualifications :

  • Databricks : 6-9 years of experience with expert-level proficiency
  • PySpark / Spark : 6-9 years of advanced hands-on experience
  • AWS : 6-9 years of experience with strong competency, including S3 and Terraform for infrastructure-as-code
  • Data Architecture : Solid knowledge of the medallion pattern and data warehousing best practices
  • Data Pipelines : Proven ability to build, optimize, and govern enterprise data pipelines
  • Delta Lake and Unity Catalog : Expertise in Delta Lake internals, time travel, schema evolution / enforcement, and Unity Catalog RBAC / ABAC
  • Streaming : Hands-on experience with Spark Structured Streaming, Kafka, checkpointing, exactly-once semantics, and late-arriving data handling
  • CI / CD : Experience with Git-based workflows and CI / CD for Databricks (e.g., Databricks Repos, dbx, GitHub Actions, Azure DevOps, or Jenkins)
  • Security and Compliance : Experience with IAM, KMS, encryption, secrets management, token / credential rotation, and PII governance
  • Performance and Cost : Demonstrated ability to tune Spark jobs and optimize Databricks cluster configurations and AWS usage for cost and throughput
  • Collaboration : Experience working in Agile / Scrum teams, peer reviews, and achieving code coverage targets
  • Preferred Skills & Qualifications :

  • Certifications : Databricks Data Engineer Professional, AWS Solutions Architect / Developer, HashiCorp Terraform Associate
  • Data Catalogs : Experience with enterprise catalogs such as Collibra or Alation, and lineage tooling such as OpenLineage
  • Orchestration : Databricks Workflows and / or Airflow
  • Additional AWS : Glue, Lambda, Step Functions, CloudWatch, Secrets Manager
  • Testing : pytest, chispa, Great Expectations, dbx test
  • Domain Experience : Analytics and ML feature pipelines, MLOps integrations
  • Create a job alert for this search

    Data Engineer • Columbus, OH, United States

    Related jobs
    • Promoted
    Data Engineer II

    Data Engineer II

    VirtualVocationsColumbus, Ohio, United States
    Full-time
    A company is looking for a Data Engineer II.Key Responsibilities Produce high-quality data models and maintain data integrity for analytics products Develop scalable ELT pipelines and business i...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer -Columbus, OH (5 Days Onsite)

    Data Engineer -Columbus, OH (5 Days Onsite)

    Career Mentors LLCColumbus, OH, United States
    Full-time
    W2 Only (Local / Nearby Candidates Preferred).Privileged Access Management (PAM).Privileged Access Management (PAM).The ideal candidate will bring deep expertise in ETL and database systems, along wi...Show moreLast updated: 3 days ago
    • Promoted
    Principal Data Engineer

    Principal Data Engineer

    VirtualVocationsColumbus, Ohio, United States
    Full-time
    A company is looking for a Principal Platform Data Engineer (Databricks Platform).Key Responsibilities Collaborate with stakeholders to translate business requirements into technical specificatio...Show moreLast updated: 30+ days ago
    • Promoted
    Databricks Data Engineer

    Databricks Data Engineer

    Inizio PartnersColumbus, OH, United States
    Full-time
    About the job Databricks Data Engineer.Title : Databricks Data Engineer (Senior Manager / AVP).Location : Columbus, OH (1st choice), Remote (2nd choice). Experience Required : 7 12 years.We are seeking...Show moreLast updated: 3 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Purple DriveColumbus, OH, United States
    Full-time
    Skills : Python, Py Spark, Snowflake.We are seeking a highly skilled and motivated Data Engineer to join our innovative team. As a Data Engineer, you will be responsible for designing, building, and ...Show moreLast updated: 3 days ago
    • Promoted
    Lead Data Engineer

    Lead Data Engineer

    VirtualVocationsColumbus, Ohio, United States
    Full-time
    A company is looking for a Lead Data Engineer to design, build, and manage enterprise-grade data pipelines.Key Responsibilities Design, develop, and optimize metadata-driven data pipelines in Fab...Show moreLast updated: 30+ days ago
    • Promoted
    Data Platform Engineer

    Data Platform Engineer

    VirtualVocationsColumbus, Ohio, United States
    Full-time
    A company is looking for a Data Platform Engineer, Data Capture.Key Responsibilities Expand developer tools for capturing business events and operational data into the Lakehouse Enhance self-ser...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer- Columbus OH

    Data Engineer- Columbus OH

    Georgia IT IncColumbus, OH, United States
    Full-time
    Data Engineering tools and techniques (any ETL / ELT skills are good to have ).Proficiency in Linux / Unix environments.Experience building applications for public cloud environments (AWS preferred).E...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    InterSourcesColumbus, OH, United States
    Full-time
    Certified Diverse Supplier, was founded in 2007 and offers innovative solutions to help clients with Digital Transformations across various domains and industries. Our history spans over 16 years an...Show moreLast updated: 3 days ago
    • Promoted
    Data Engineer Location : Columbus OH (Onsite - 5 Days a Week)

    Data Engineer Location : Columbus OH (Onsite - 5 Days a Week)

    Career Mentors LLCColumbus, OH, United States
    Full-time
    Columbus, OH (Onsite - 5 Days a Week).W2 Only (Local Candidates Preferred).PAM (Privileged Access Management).Privileged Access Management (PAM) group in Columbus, OH. This role requires a highly mo...Show moreLast updated: 3 days ago
    • Promoted
    Lead Data Engineer - Databricks

    Lead Data Engineer - Databricks

    JPMorgan Chase Bank, N.A.Westerville, OH, United States
    Full-time
    Join us as we embark on a journey of collaboration and innovation, where your unique skills and talents will be valued and celebrated. Together we will create a brighter future and make a meaningful...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    VirtualVocationsColumbus, Ohio, United States
    Full-time
    A company is looking for a Senior Data Engineer to lead the data engineering function and enhance data capabilities.Key Responsibilities Design, build, and optimize data architecture, pipelines, ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Engineer II

    Senior Data Engineer II

    VirtualVocationsColumbus, Ohio, United States
    Full-time
    A company is looking for a Senior Data Engineer II to join their data engineering team.Key Responsibilities Design, develop, and maintain scalable data pipelines using Apache Spark on Databricks ...Show moreLast updated: 1 day ago
    • Promoted
    Data Engineer

    Data Engineer

    IQVentures Corporate, LLCDublin, OH, United States
    Full-time
    The Data Engineer is a technical leader and hands-on developer responsible for designing, building, and optimizing data pipelines and infrastructure to support analytics and reporting.This role wil...Show moreLast updated: 3 days ago
    • Promoted
    Data Engineer for AI

    Data Engineer for AI

    VirtualVocationsColumbus, Ohio, United States
    Full-time
    A company is looking for a Data Engineer (AI Platforms).Key Responsibilities Design, build, and optimize scalable data pipelines and migrate from legacy systems to an AI-centric platform Evolve ...Show moreLast updated: 1 day ago
    • Promoted
    Data Engineer

    Data Engineer

    Everest TechnologiesColumbus, OH, United States
    Full-time
    The Data Engineer will be a key builder on our AI journey, responsible for designing,.This role will focus on building robust and scalable data pipelines to extract data from a variety of sources, ...Show moreLast updated: 30+ days ago
    • Promoted
    Data engineer

    Data engineer

    Diverse LynxColumbus, OH, United States
    Full-time
    We are seeking a highly skilled and motivated Data Engineer to join our innovative team.As a Data Engineer, you will be responsible for designing, building, and maintaining scalable data pipelines ...Show moreLast updated: 3 days ago
    • Promoted
    Data Engineer

    Data Engineer

    AndHealthColumbus, OH, US
    Full-time
    AndHealth is on a mission to radically improve access and outcomes for the most challenging chronic health conditions with the goal of making world-class specialty care accessible and affordable to...Show moreLast updated: 2 days ago
    • Promoted
    Data Engineer

    Data Engineer

    VirtualVocationsColumbus, Ohio, United States
    Full-time
    A company is looking for a Data Engineer to build, maintain, and optimize data pipelines for genealogy data products.Key Responsibilities Build and maintain scalable data pipelines for large gene...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer, Analytics

    Data Engineer, Analytics

    METAColumbus, OH, United States
    Full-time
    Meta), formerly known as Facebook Inc.When Facebook launched in 2004, it changed the way people connect.Apps and services like Messenger, Instagram, and WhatsApp further empowered billions around t...Show moreLast updated: 30+ days ago