Talent.com
Lead Data Engineer

Lead Data Engineer

WorkHQLos Angeles, California, US
30+ days ago
Job type
  • Full-time
Job description

Company Context

Series A, well-funded US startup in HRTech developing WorkHQ.com and an AI Recruiter product.

This is a US-only, Remote role (Mainland).

Role Overview

Lead data infrastructure architect managing billions of data points across 250M+ professional profiles.

Hire data engineers to aid you in that journey.

Core Responsibilities

Design scalable data pipelines processing massive record volumes

Architect ETL processes using PySpark on Amazon EMR (Open to shifting to other solutions like Data Bricks / Snowflake)

Distribute enriched data through medallion architecture across Postgres, Athena, OpenSearch

Integrate new data sources into the main pipeline

Implement advanced data matching using Splink

Technical Requirements

5-8 years professional data engineering experience

Good proficiency in :

PySpark and distributed computing

AWS data services (EMR, Glue, Athena)

Docker

Pandas and DataFrame manipulation

Complex data format handling (JSONL, Parquet)

Strong background in :

Big data processing architectures

Data warehouse design

Performance optimization

Advanced Python, SQL skills

Nice to Have

Probabilistic record linking expertise

OpenSearch / elasticsearch technologies

Machine learning data pipeline design

Recruitment tech ecosystem knowledge

Technical Stack

Big Data : PySpark, EMR

Databases : Postgres, OpenSearch

Cloud : AWS

Containerization : Docker

Data Formats : JSONL, Parquet

Analytics : Metabase, Athena, Glue

Data Processing : Pandas, Splink

Other Considerations

While this role has specific requirements - if you lack a few technical skills, but motivated to learn and lead the platform, please apply for consideration.

If you are coming from Director / Head of / VP levels that is relevant to this job, you can apply as well.

You will need to apply directly on our platform.

Thank you for your time.

Create a job alert for this search

Lead Data Engineer • Los Angeles, California, US

Related jobs
  • Promoted
Data Engineer

Data Engineer

VirtualVocationsGarden Grove, California, United States
Full-time
A company is looking for a Data Engineer to build, maintain, and optimize data pipelines for genealogy data products.Key Responsibilities Build and maintain scalable data pipelines for large gene...Show moreLast updated: 30+ days ago
  • Promoted
Senior Data Management Engineer

Senior Data Management Engineer

VirtualVocationsBurbank, California, United States
Full-time
A company is looking for a Senior Engineer, Data Management (Remote).Key Responsibilities Build and automate data ingestion, transformation, and aggregation pipelines Conduct complex data analys...Show moreLast updated: 2 days ago
  • Promoted
Data Analytics Manager

Data Analytics Manager

VirtualVocationsNorth Hollywood, California, United States
Full-time
A company is looking for a Data Analytics Manager to lead a team of Product Analysts and drive data-informed product development. Key Responsibilities Lead analytics initiatives and align cross-fu...Show moreLast updated: 30+ days ago
  • Promoted
Lead Data Engineer

Lead Data Engineer

VirtualVocationsFullerton, California, United States
Full-time
A company is looking for a Lead Data Engineer to design, build, and manage enterprise-grade data pipelines.Key Responsibilities Design, develop, and optimize metadata-driven data pipelines in Fab...Show moreLast updated: 30+ days ago
  • Promoted
Senior Data Engineer

Senior Data Engineer

VirtualVocationsVan Nuys, California, United States
Full-time
A company is looking for a Senior Data Engineer to lead the data engineering function and enhance data capabilities.Key Responsibilities Design, build, and optimize data architecture, pipelines, ...Show moreLast updated: 30+ days ago
  • Promoted
Lead Software Engineer - Data Platform

Lead Software Engineer - Data Platform

RelativityLos Angeles, CA, United States
Full-time
Join our team as we reimagine and modernize the core of Relativity's data architecture.You'll play a pivotal role in transforming the Document Domain - a foundational component of our platform-into...Show moreLast updated: 30+ days ago
  • Promoted
Analytics Engineer

Analytics Engineer

VirtualVocationsOrange, California, United States
Full-time
A company is looking for an Analytics Engineer II (REMOTE).Key Responsibilities Design and build data models and BI dashboards to solve business problems Collaborate with Agile teams to understa...Show moreLast updated: 30+ days ago
  • Promoted
Lead AI Data Engineer

Lead AI Data Engineer

VirtualVocationsWhittier, California, United States
Full-time
A company is looking for a Lead AI and Data Solution Engineer (LLMs, MCP).Key Responsibilities Lead the design, development, and deployment of enterprise-scale data and AI solutions Architect an...Show moreLast updated: 30+ days ago
  • Promoted
Data Platform Engineer

Data Platform Engineer

VirtualVocationsSanta Ana, California, United States
Full-time
A company is looking for a Data Platform Engineer, Data Capture.Key Responsibilities Expand developer tools for capturing business events and operational data into the Lakehouse Enhance self-ser...Show moreLast updated: 30+ days ago
  • Promoted
Senior Data Engineer II

Senior Data Engineer II

VirtualVocationsNorwalk, California, United States
Full-time
A company is looking for a Senior Data Engineer II to join their data engineering team.Key Responsibilities Design, develop, and maintain scalable data pipelines using Apache Spark on Databricks ...Show moreLast updated: 1 day ago
  • Promoted
Principal Data Engineer

Principal Data Engineer

VirtualVocationsFullerton, California, United States
Full-time
A company is looking for a Principal Platform Data Engineer (Databricks Platform).Key Responsibilities Collaborate with stakeholders to translate business requirements into technical specificatio...Show moreLast updated: 30+ days ago
  • Promoted
Staff Data Engineer

Staff Data Engineer

VirtualVocationsOrange, California, United States
Full-time
A company is looking for a Staff Data Engineer.Key Responsibilities Design and implement data pipelines and architectures Collaborate with cross-functional teams to optimize data usage Ensure d...Show moreLast updated: 30+ days ago
  • Promoted
Lead Analyst, Operational Analytics

Lead Analyst, Operational Analytics

VirtualVocationsVan Nuys, California, United States
Full-time
A company is looking for a Lead Analyst, Operational Analytics.Key Responsibilities : Lead the data governance team in developing and enforcing data governance policies, standards, and best practi...Show moreLast updated: 1 day ago
  • Promoted
Data Engineer for AI

Data Engineer for AI

VirtualVocationsNorth Hollywood, California, United States
Full-time
A company is looking for a Data Engineer (AI Platforms).Key Responsibilities Design, build, and optimize scalable data pipelines and migrate from legacy systems to an AI-centric platform Evolve ...Show moreLast updated: 1 day ago
  • Promoted
Data Pipeline Engineer

Data Pipeline Engineer

VirtualVocationsNorth Hollywood, California, United States
Full-time
A company is looking for an EA-Data Pipeline.Key Responsibilities Process datasets from various data vendors and clients Identify and implement automation improvements for data processing Maint...Show moreLast updated: 30+ days ago
  • Promoted
Data Governance Engineer

Data Governance Engineer

VirtualVocationsVan Nuys, California, United States
Full-time
A company is looking for a Data Governance Customer Engineer to assist enterprise customers in implementing Microsoft Purview across complex data environments. Key Responsibilities Guide customers...Show moreLast updated: 1 day ago
  • Promoted
Data & AI Engineer

Data & AI Engineer

VirtualVocationsPasadena, California, United States
Full-time
A company is looking for a Data / AI Engineer (Gen AI, LLM, ML).Key Responsibilities Design, build, and maintain robust data pipelines and workflows for healthcare data Develop, train, and deploy ...Show moreLast updated: 1 day ago
  • Promoted
Snowflake Deployment Engineer

Snowflake Deployment Engineer

VirtualVocationsLong Beach, California, United States
Full-time
A company is looking for a Snowflake Deployment Engineer to design, implement, and optimize Snowflake environments for data platforms. Key Responsibilities Lead the deployment, configuration, and ...Show moreLast updated: 2 days ago
  • Promoted
Data Engineer II

Data Engineer II

VirtualVocationsLong Beach, California, United States
Full-time
A company is looking for a Data Engineer II.Key Responsibilities Produce high-quality data models and maintain data integrity for analytics products Develop scalable ELT pipelines and business i...Show moreLast updated: 30+ days ago
  • Promoted
Data Cloud Solution Engineer

Data Cloud Solution Engineer

VirtualVocationsWhittier, California, United States
Full-time
A company is looking for a Solution Engineer, Data Cloud.Key Responsibilities Conduct in-person and web-based meetings to drive technical discovery and close opportunities Collaborate with Accou...Show moreLast updated: 2 days ago