Talent.com
Data Engineer

Data Engineer

People Data LabsSan Francisco, CA, US
30+ days ago
Job type
  • Permanent
Job description

Job Description

Job Description

Note for all engineering roles : with the rise of fake applicants and AI-enabled candidate fraud, we have built in additional measures throughout the process to identify such candidates and remove them.

About Us

People Data Labs (PDL) is the provider of people and company data. We do the heavy lifting of data collection and standardization so our customers can focus on building and scaling innovative, compliant data solutions. Our sole focus is on building the best data available by integrating thousands of compliantly sourced datasets into a single, developer-friendly source of truth. Leading companies across the world use PDL's workforce data to enrich recruiting platforms, power AI models, create custom audiences, and more.

We are looking for individuals who can balance extreme ownership with a "one-team, one-dream" mindset. Our customers are trying to solve complex problems, and we only help them achieve their goals as a team. Our Data Engineering Team is the secret sauce behind all that we do and we are looking for the best of the best.

If you are looking to be part of a team discovering the next frontier of data-as-a-service (DaaS) with a high level of autonomy and opportunity for direct contributions, this might be the role for you. We like our engineers to be thoughtful, quirky, and willing to fearlessly try new things. Failure is embraced at PDL as long as we continue to learn and grow from it.

What You Get to Do

  • Build infrastructure for ingestion, transformation, and loading an exponentially increasing volume of data from a variety of sources using Spark, SQL, AWS, and Databricks
  • Building an organic entity resolution framework capable of correctly merging hundreds of billions of individual entities into a number of clean, consumable datasets.
  • Developing CI / CD pipelines and anomaly detection systems capable of continuously improving the quality of data we're pushing into production.
  • Dreaming up solutions to largely undefined data engineering and data science problems.

The Technical Chops You'll Need

  • 4-6+ years of industry experience with clear examples of strategic technical problem-solving and implementation
  • Strong software development fundamentals.
  • Experience with Python
  • Expertise with Apache Spark (Java, Scala, and / or Python-based)
  • Experience with SQL
  • Experience building scalable data processing systems (e.g., cleaning, transformation) from the ground up.
  • Experience using developer-oriented data pipeline and workflow orchestration (e.g., Airflow (preferred), dbt, dagster or similar)
  • Knowledge of modern data design and storage patterns (e.g., incremental updating, partitioning and segmentation, rebuilds and backfills)
  • Experience working in Databricks (including delta live tables, data lakehouse patterns, etc.)
  • Experience with cloud computing services (AWS (preferred), GCP, Azure or similar)
  • Experience with data warehousing (e.g., Databricks, Snowflake, Redshift, BigQuery, or similar)
  • Understanding of modern data storage formats and tools (e.g., parquet, ORC, Avro, Delta Lake)
  • People Thrive Here Who Can

  • Balance high ownership and autonomy with a strong ability to collaborate
  • Work effectively remotely (able to be proactive about managing blockers, proactive on reaching out and asking questions, and participating in team activities)
  • Demonstrate strong written communication skills on Slack / Chat and in documents
  • Exhibt experience in writing data design docs (pipeline design, dataflow, schema design)
  • Scope and breakdown projects, communicate and collaborate progress and blockers effectively with your manager, team, and stakeholders
  • Some Nice To Haves

  • Degree in a quantitative discipline such as computer science, mathematics, statistics, or engineering
  • Experience working with entity data (entity resolution / record linkage)
  • Experience working with data acquisition / data integration
  • Expertise with Python and the Python data stack (e.g., numpy, pandas)
  • Experience with streaming platforms (e.g., Kafka)
  • Experience evaluating data quality and maintaining consistently high data standards across new feature releases (e.g., consistency, accuracy, validity, completeness)
  • Our Benefits

  • Stock
  • Competitive Salaries
  • Unlimited paid time off
  • Medical, dental, & vision insurance
  • Health, fitness, and office stipends
  • The permanent ability to work wherever and however you want
  • Comp : $160-180K

    People Data Labs does not discriminate on the basis of race, sex, color, religion, age, national origin, marital status, disability, veteran status, genetic information, sexual orientation, gender identity or any other reason prohibited by law in provision of employment opportunities and benefits.

    Qualified Applicants with arrest or conviction records will be considered for Employment in accordance with the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act.

    Personal Privacy Policy for California Residents

    https : / / www.peopledatalabs.com / pdf / privacy -policy-and-notice.pdf

    Create a job alert for this search

    Data Engineer • San Francisco, CA, US

    Related jobs
    • Promoted
    Data Platform Engineer

    Data Platform Engineer

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for a Data Platform Engineer, Data Capture.Key Responsibilities Expand developer tools for capturing business events and operational data into the Lakehouse Enhance self-ser...Show moreLast updated: 30+ days ago
    • Promoted
    Lead AI Data Engineer

    Lead AI Data Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Lead AI and Data Solution Engineer (LLMs, MCP).Key Responsibilities Lead the design, development, and deployment of enterprise-scale data and AI solutions Architect an...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer II

    Data Engineer II

    VirtualVocationsOakland, California, United States
    Full-time
    A company is looking for a Data Engineer II.Key Responsibilities Produce high-quality data models and maintain data integrity for analytics products Develop scalable ELT pipelines and business i...Show moreLast updated: 30+ days ago
    • Promoted
    Data Pipeline Engineer

    Data Pipeline Engineer

    VirtualVocationsConcord, California, United States
    Full-time
    A company is looking for an EA-Data Pipeline.Key Responsibilities Process datasets from various data vendors and clients Identify and implement automation improvements for data processing Maint...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    Sinclair Broadcast GroupSan Francisco, CA, US
    Full-time
    Digital Remedy is looking for a skilled Data Engineer to join our growing team of analytics experts.As a Data Engineer, you will help design, build, maintain, and troubleshoot data processing syste...Show moreLast updated: 17 days ago
    • Promoted
    Data Engineer

    Data Engineer

    EVERYDaly City, CA, US
    Full-time
    EVERY™ is a leading VC-backed food tech ingredient company and market leader using precision fermentation to create animal proteins without the animal for the global food and beverage industr...Show moreLast updated: 1 day ago
    • Promoted
    Data Engineer

    Data Engineer

    ForhyreSunnyvale, CA, US
    Full-time
    We are looking for a passionate certified Data Engineer.The successful candidate will turn data into information, information into insight and insight into business decisions.Data analyst responsib...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer 4

    Data Engineer 4

    Cypress HCMSan Jose, CA, US
    Full-time
    Location : San Jose CA 95110 (Hybrid).Duration : 10 / 01 / 2025 to 2 / 27 / 2026.Design, develop, and maintain scalable and reliable data pipelines to support large-scale data processing.Build and optimize d...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    The Rockridge GroupEmeryville, CA, US
    Full-time
    Google Search console experience required.Google Tag Manager, merchant account or data studio experience preferred.Facebook knowledge will be big plus. Very proficient with installation, data interr...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Data Engineer

    Lead Data Engineer

    VirtualVocationsSan Jose, California, United States
    Full-time
    A company is looking for a Lead Data Engineer to design, build, and manage enterprise-grade data pipelines.Key Responsibilities Design, develop, and optimize metadata-driven data pipelines in Fab...Show moreLast updated: 30+ days ago
    • Promoted
    Data Analytics Engineer

    Data Analytics Engineer

    ParafinSan Francisco, CA, US
    Full-time
    At Parafin, we’re on a mission to grow small businesses.Small businesses are the backbone of our economy, but traditional banks often don’t have their backs. We build tech that makes it ...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Data Engineer

    Staff Data Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Staff Data Engineer.Key Responsibilities Design and implement data pipelines and architectures Collaborate with cross-functional teams to optimize data usage Ensure d...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    VirtualVocationsConcord, California, United States
    Full-time
    A company is looking for a Data Engineer to build, maintain, and optimize data pipelines for genealogy data products.Key Responsibilities Build and maintain scalable data pipelines for large gene...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for a Senior Data Engineer to lead the data engineering function and enhance data capabilities.Key Responsibilities Design, build, and optimize data architecture, pipelines, ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior Data Engineer II

    Senior Data Engineer II

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Senior Data Engineer II to join their data engineering team.Key Responsibilities Design, develop, and maintain scalable data pipelines using Apache Spark on Databricks ...Show moreLast updated: 12 hours ago
    • Promoted
    Data Engineer 4

    Data Engineer 4

    NextDeavor Inc.San Jose, CA, US
    Temporary
    Here's how you'll become a key player with this opportunity : .We are looking for a Splunk expert to join on a short-term contract and help stabilize, optimize, and improve our Splunk environ...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer - Product

    Data Engineer - Product

    xAIPalo Alto, CA, US
    Full-time
    AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering exc...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Data Engineer for AI

    Data Engineer for AI

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Data Engineer (AI Platforms).Key Responsibilities Design, build, and optimize scalable data pipelines and migrate from legacy systems to an AI-centric platform Evolve ...Show moreLast updated: 16 hours ago