Talent.com
Sr. Data Engineer

Sr. Data Engineer

MachinifyPalo Alto, CA, US
30+ days ago
Job type
  • Full-time
Job description

Job Description

Job Description

Machinify is a leading healthcare intelligence company with expertise across the payment continuum, delivering unmatched value, transparency, and efficiency to health plan clients across the country. Deployed by over 60 health plans, including many of the top 20, and representing more than 160 million lives, Machinify brings together a fully configurable and content-rich, AI-powered platform along with best-in-class expertise. We’re constantly reimagining what’s possible in our industry, creating disruptively simple, powerfully clear ways to maximize financial outcomes and drive down healthcare costs.

Why This Role Matters

As a Data Engineer, you’ll be at the heart of transforming raw external data into powerful, trusted datasets that drive payment, product, and operational decisions. You’ll work closely with product managers, data scientists, subject matter experts, engineers, and customer teams to build, scale, and refine production pipelines — ensuring data is accurate, observable, and actionable.

You’ll also play a critical role in onboarding new customers , integrating their raw data into our internal models . Your pipelines will directly power the company’s ML models, dashboards, and core product experiences . If you enjoy owning end-to-end workflows, shaping data standards, and driving impact in a fast-moving environment, this is your opportunity.

What You’ll Do

Design and implement robust, production-grade pipelines using Python , Spark SQL , and Airflow to process high-volume file-based datasets (CSV, Parquet, JSON).

Lead efforts to canonicalize raw healthcare data (837 claims, EHR, partner data, flat files) into internal models .

Own the full lifecycle of core pipelines — from file ingestion to validated, queryable datasets — ensuring high reliability and performance.

Onboard new customers by integrating their raw data into internal pipelines and canonical models; collaborate with SMEs, Account Managers, and Product to ensure successful implementation and troubleshooting.

Build resilient, idempotent transformation logic with data quality checks , validation layers , and observability .

Refactor and scale existing pipelines to meet growing data and business needs.

Tune Spark jobs and optimize distributed processing performance.

Implement schema enforcement and versioning aligned with internal data standards.

Collaborate deeply with Data Analysts, Data Scientists, Product Managers, Engineering, Platform, SMEs, and AMs to ensure pipelines meet evolving business needs.

Monitor pipeline health, participate in on-call rotations , and proactively debug and resolve production data flow issues.

Contribute to the evolution of our data platform — driving toward mature patterns in observability, testing, and automation.

Build and enhance streaming pipelines (Kafka, SQS, or similar) where needed to support near-real-time data needs.

Help develop and champion internal best practices around pipeline development and data modeling.

What You Bring

4+ years of experience as a Data Engineer (or equivalent), building production-grade pipelines .

Strong expertise in Python , Spark SQL , and Airflow .

Experience processing large-scale file-based datasets ( CSV, Parquet, JSON, etc ) in production environments.

Experience mapping and standardizing raw external data into canonical models .

Familiarity with AWS (or any cloud), including file storage and distributed compute concepts.

Experience onboarding new customers and integrating external customer data with non-standard formats.

Ability to work across teams, manage priorities, and own complex data workflows with minimal supervision.

Strong written and verbal communication skills — able to explain technical concepts to non-engineering partners.

Comfortable designing pipelines from scratch and improving existing pipelines.

Experience working with large-scale or messy datasets (healthcare, financial, logs, etc.).

Experience building or willingness to learn streaming pipelines using tools such as Kafka or SQS .

Bonus : Familiarity with healthcare data (837, 835, EHR, UB04, claims normalization).

🌱 Why Join Us

Real impact — your pipelines will directly support decision-making and claims payment outcomes from day one.

High visibility — partner with ML, Product, Analytics, Platform, Operations, and Customer teams on critical data initiatives.

Total ownership — you’ll drive the lifecycle of core datasets powering our platform.

Customer-facing impact — you will directly contribute to successful customer onboarding and data integration.

We're hiring across multiple levels for this role. Final level and title will be determined based on experience and performance during the interview process.

Equal Employment Opportunity at Machinify

Machinify is committed to hiring talented and qualified individuals with diverse backgrounds for all of its positions. Machinify believes that the gathering and celebration of unique backgrounds, qualities, and cultures enriches the workplace.

See our Candidate Privacy Notice at : https : / / www.machinify.com / candidate -privacy-notice /

Create a job alert for this search

Sr Data Engineer • Palo Alto, CA, US

Related jobs
Data Engineer

Data Engineer

CGSSan Francisco, California, United States, 94102
Full-time
Employment Type : Full-Time, Mid-level.Department : Business Intelligence.CGS is seeking a passionate and driven Data Engineer to support a rapidly growing Data Analytics and Business Intelligence pl...Show moreLast updated: 30+ days ago
  • Promoted
Principal Data Engineer

Principal Data Engineer

StartXPalo Alto, CA, United States
Full-time
Sanas is revolutionizing the way we communicate with the world’s first real-time algorithm, designed to modulate accents, eliminate background noises, and magnify speech clarity.Pioneered by season...Show moreLast updated: 29 days ago
AWS Data Engineer

AWS Data Engineer

GlobalchannelmanagementSan Francisco, California, United States
Full-time
Quick Apply
AWS Data Engineer needs 7+ years of experience in Amazon Web Services (AWS) Cloud Computing.Strong background in designing, developing, and maintaining scalable data pipelines for ingestion and pro...Show moreLast updated: 30+ days ago
  • Promoted
Data Warehouse Architect

Data Warehouse Architect

VirtualVocationsHayward, California, United States
Full-time
A company is looking for a Data Warehouse Architect & AI Analytics Lead.Key Responsibilities Lead the design and implementation of extensions to a Data Vault 2. Architect and pilot predictive anal...Show moreLast updated: 30+ days ago
  • Promoted
Data Pipeline Engineer

Data Pipeline Engineer

FocusKPI Inc.Mountain View, CA, US
Temporary
FocusKPI is seeking a .The team is seeking world-class server software engineers with experience in Big Data Infrastructure and Data warehousing to join their technology innovation group,...Show moreLast updated: 1 day ago
  • Promoted
Azure Data Engineer

Azure Data Engineer

VirtualVocationsHayward, California, United States
Full-time
A company is looking for an Azure Data Engineer to implement data solutions in Azure and support Infrastructure as Code initiatives. Key Responsibilities : Design, develop, and implement integratio...Show moreLast updated: 30+ days ago
  • Promoted
Sr. Data Center Solution Engineer

Sr. Data Center Solution Engineer

SupermicroSan Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 30+ days ago
  • Promoted
Senior Data Software Engineer

Senior Data Software Engineer

PsiQuantumPalo Alto, CA, United States
Full-time
Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show moreLast updated: 30+ days ago
  • Promoted
Principal Data Engineer

Principal Data Engineer

SanasPalo Alto, CA, United States
Full-time
Sanas is revolutionizing the way we communicate with the world’s first real-time algorithm, designed to modulate accents, eliminate background noises, and magnify speech clarity.Our GDP-shifting te...Show moreLast updated: 30+ days ago
  • Promoted
Data Engineer

Data Engineer

VirtualVocationsHayward, California, United States
Full-time
A company is looking for a Data Engineer - Databricks.Key Responsibilities Lead technical direction for data engineering initiatives Architect scalable data pipelines and systems using PySpark, ...Show moreLast updated: 30+ days ago
  • Promoted
Lead Data Engineer

Lead Data Engineer

VirtualVocationsOakland, California, United States
Full-time
A company is looking for a Lead Data Engineer to join their data engineering team.Key Responsibilities Lead the technical design and execution of complex data projects while mentoring mid-level e...Show moreLast updated: 30+ days ago
  • Promoted
Sr. Software Engineer - Data Engineer

Sr. Software Engineer - Data Engineer

FyrFly Venture PartnersPleasanton, CA, United States
Full-time
Software Engineer - Data Engineer.Senior Software Engineer – Data Engineer.This role will focus on building and optimizing data pipelines, integrating multiple data sources, and enabling advanced a...Show moreLast updated: 10 days ago
  • Promoted
Staff Data Engineer, Merchandising Catalog & Taxonomy (IC) (San Francisco)

Staff Data Engineer, Merchandising Catalog & Taxonomy (IC) (San Francisco)

Attachments KingSan Francisco, CA, United States
Full-time
Attachments King is an eCommerce startup in the Heavy Equipment Industry developing proprietary software that flexibly discovers compatibility between equipment and host machine components.Were hir...Show moreLast updated: 5 days ago
  • Promoted
AI Systems & Data Engineer

AI Systems & Data Engineer

HyperFiSan Francisco, CA, United States
Full-time
We're building the kind of platform we always wanted to use : fast, flexible, and built for making sense of real-world complexity. Behind the scenes is a robust, event-driven architecture that connec...Show moreLast updated: 1 day ago
Senior Data Engineer : 24-01769 (No C2C)

Senior Data Engineer : 24-01769 (No C2C)

Akraya IncSunnyvale, California, United States
Full-time
Quick Apply
Primary Skills : SQL (Expert), Data Modeling (Advanced), Python (Advanced), AWS (Intermediate), Data Visualization (Intermediate). Duration : 11+ Months (Possible Extension).Location : Sunnyvale, CA (H...Show moreLast updated: 30+ days ago
  • Promoted
AI Engineer

AI Engineer

VirtualVocationsFremont, California, United States
Full-time
A company is looking for an AI Engineer focused on FinTech workflow automation.Key Responsibilities Design, build, and deploy AI models into production Develop and maintain backend Python servic...Show moreLast updated: 30+ days ago
  • Promoted
Senior Data Engineer

Senior Data Engineer

VirtualVocationsHayward, California, United States
Full-time
A company is looking for a Senior Data Engineer.Key Responsibilities Design, build, and maintain reliable ETL pipelines Work with large datasets using Python or Java and raw SQL Collaborate wit...Show moreLast updated: 30+ days ago
Sr. Data Engineer - W2

Sr. Data Engineer - W2

Tricon SolutionsPleasanton, CA, California, USA
Full-time
Position : Senior Data Engineer Location : Pleasanton, CA Type : Contract-W2 (Ongoing)<...Show moreLast updated: 30+ days ago
  • Promoted
Senior Solutions Engineer

Senior Solutions Engineer

Storm3Santa Clara, CA, US
Full-time
F4BC; Series A HealthTech | Clinical Data Intelligence Platform.F30D; San Francisco (Hybrid | US Only).F4B0; $160,000+ (base + benefits + equity). This Series A HealthTech company is raising the qua...Show moreLast updated: 9 days ago
  • Promoted
Senior Data Platform Engineer

Senior Data Platform Engineer

Ellipsis HealthSan Francisco, CA, United States
Full-time
Ellipsis Health is creating cutting-edge AI / ML products that solve healthcare staffing issues and administrative burdens using conversational AI and our patented voice biomarker technology in the d...Show moreLast updated: 8 days ago