Data Pipeline Engineer (ML)

OSI EngineeringSeattle, WA, US

9 days ago

Job type

Full-time

Job description

Job Description

Our client is scaling production ML systems and needs a hands-on engineer to help build, maintain, and run essential ML data pipelines . You’ll own high-throughput data ingestion and transformation workflows (including image- and -type modalities), enforce rigorous data quality standards, and partner with research and platform teams to keep models fed with reliable, versioned datasets.

Design, build, and operate reliable ML data pipelines for batch and / or streaming use cases across cloud environments.
Develop robust ETL / ELT processes (ingest, validate, cleanse, transform, and publish) with clear SLAs and monitoring.
Implement data quality gates (schema checks, null / outlier handling, drift and bias signals) and data versioning for reproducibility.
Optimize pipelines for distributed computing and large modalities (e.g., images, multi-dimensional s).
Automate repetitive workflows with CI / CD and infrastructure-as-code; document, test, and harden for production.
Collaborate with ML, Data Science, and Platform teams to align datasets, features, and model training needs.

Minimum Qualifications :

5+ years building and operating data pipelines in production.

Cloud : Hands-on with AWS , Azure , or GCP services for storage, compute, orchestration, and security.

Programming : Strong proficiency in Python and common data / ML libraries ( pandas , NumPy , etc.).

Distributed compute : Experience with at least one of Spark , Dask , or Ray .

Modalities : Experience handling image-type and -type data at scale.

Automation : Proven ability to automate repetitive tasks (shell / Python scripting, CI / CD).

Data Quality : Implemented validation, cleansing, and transformation frameworks in production.

Data Versioning : Familiar with tools / practices such as DVC , LakeFS , or similar.

Languages : Fluent in English or Farsi .

Strongly PreferredSQL expertise (writing performant queries; optimizing on large datasets).

Data warehousing / lakehouse concepts and tools (e.g., Snowflake / BigQuery / Redshift ; Delta / Lakehouse patterns).

Data virtualization / federation exposure (e.g., Presto / Trino) and semantic / metadata layers.

Orchestration (Airflow, Dagster, Prefect) and observability / monitoring for data pipelines.

MLOps practices (feature stores, experiment tracking, lineage, artifacts).

Containers & IaC (Docker; Terraform / CloudFormation) and CI / CD for data / ML workflows.

Testing for data / ETL (unit / integration tests, great_expectations or similar).

Soft Skills Executes independently and creatively ; comfortable owning outcomes in ambiguous environments.

Proactive communicator who collaborates cross-functionally with DS / ML / Platform stakeholders.

Location : Seattle, WA

Duration : 1+ year

Pay : $56 / hr

Create a job alert for this search

Data Engineer • Seattle, WA, US

Related jobs

Data Engineer Lead 24-02686-(No C2C)

Akraya IncSeattle, Washington, United States

Full-time

Quick Apply

SQL (Expert), Python (Expert), AWS (Advanced), Data Modeling (Intermediate), ETL Pipelines (Expert).We are seeking a Senior Data Engineer to join our AWS Infrastructure and Supply Chain Finance tea...Show moreLast updated: 30+ days ago

Data Engineer II : 24-02518

Akraya IncBellevue, Washington, United States

Full-time

Quick Apply

SQL, Python, AWS, Tableau, Large datasets.A leading FAANG E-Commerce company is looking for a Data Engineer.Data Engineer to join the Worldwide DEX Team. Customer-centric company on earth.If you wou...Show moreLast updated: 30+ days ago

Promoted

Travel MRI Tech - $2596.09 / Week

Atlas MedStaffEnumclaw, WA, US

Full-time

Atlas MedStaff is seeking an experienced MRI Tech for an exciting Travel Allied job in Enumclaw, WA.Shift : 5x8 hr PMs Start Date : 11 / 03 / 2025 Duration : 13 weeks Pay : $2596.Atlas Medstaff is currentl...Show moreLast updated: 2 days ago

Promoted

Travel CT Tech - $2,469 to $2,648 per week in Enumclaw, WA

AlliedTravelCareersEnumclaw, WA, US

Full-time

AlliedTravelCareers is working with Host Healthcare to find a qualified CT Tech in Enumclaw, Washington, 98022!.Host Healthcare is an award-winning travel healthcare company with an immediate openi...Show moreLast updated: 30+ days ago

Lead Data Engineer IV : 25-05606

Akraya IncSeattle, Washington, United States

Full-time

Quick Apply

Primary Skills : SQL (Expert), Python (Expert), AWS (Expert), Apache Spark (Proficient), BI Reporting Tools (Intermediate). Location : Seattle, WA (Onsite).We are seeking a Senior Data Engineer to joi...Show moreLast updated: 30+ days ago

Senior Data Engineer : 25-03955 (No C2C)

Akraya IncSeattle, Washington, United States

Full-time

Quick Apply

Primary Skills : SQL (Expert), Python (Expert), ETL (Expert), Airflow (Proficient), Spark (Intermediate).Duration : 13 Months with possible extension. Location : Seattle, WA or Santa Monica, CA (Hybrid...Show moreLast updated: 30+ days ago

Data Engineer

DRC SystemsSeattle, WA, United States

Full-time

Quick Apply

Only W2 (No H1B, No OPT, No C2C / Third Party Vendor) EX-META / Those candidates who Previously worked wit...Show moreLast updated: 6 days ago

Promoted

Travel MRI Tech - $2707.64 / Week

Uniti MedEnumclaw, WA, US

Full-time

Uniti Med is seeking an experienced MRI Tech for an exciting Travel Allied job in Enumclaw, WA.Shift : Inquire Start Date : 11 / 03 / 2025 Duration : 13 weeks Pay : $2707. Uniti Med provides career opportun...Show moreLast updated: 2 days ago

Promoted

Travel CT Tech - $2,422 to $2,601 per week in Enumclaw, WA

AlliedTravelCareersEnumclaw, WA, US

Full-time

Data Engineer Lead / Manager : 24-02270 (N0 C2C)

Akraya IncSeattle, Washington, United States

Temporary

Quick Apply

SQL, Redshift, Airflow, Python.Months Contract with Possible extention.Seattle, WA / Dallas, TX / Arlington, VA.Working in one of the world's largest and most complex data warehouse environments.Deve...Show moreLast updated: 30+ days ago

Promoted

Travel CT Tech - $2528.84 / Week

Ventura MedStaffEnumclaw, WA, US

Full-time

Ventura MedStaff is seeking an experienced CT Tech for an exciting Travel Allied job in Enumclaw, WA.Shift : Inquire Start Date : 10 / 27 / 2025 Duration : 13 weeks Pay : $2528. Founded in 2018 and located ...Show moreLast updated: 24 days ago

Promoted

Data Pipeline Engineer (ML) (Seattle)

OSI EngineeringSeattle, WA, United States

Full-time

Our client is scaling production ML systems and needs a hands-on engineer to help build, maintain, and run essential.Youll own high-throughput data ingestion and transformation workflows (including...Show moreLast updated: 5 days ago

Data Engineer : 25-04590 (No C2C)

Akraya IncSeattle, Washington, United States

Full-time

Quick Apply

Primary Skills : ETL (Expert), SQL (Proficient), Python (Proficient), AWS (intermediate), Data Warehousing (Expert).Duration : 6 Month with possible extension. Location : Seattle, WA (Onsite).The Clien...Show moreLast updated: 30+ days ago

Data Engineer Lead / Manager : 24-03217 (No C2C)-Local to Seattle, WA needed

Akraya IncSeattle, Washington, United States

Full-time

Quick Apply

Primary Skills : SQL (Expert), ETL (Advanced), AWS (Advanced), Data Modeling (Advanced), Python (Intermediate).Duration : 12 Months with possible extension. Location : Seattle, WA (Hybrid).A leading FA...Show moreLast updated: 30+ days ago

Promoted
New!

Consultant Engineer I - Seattle

FMBURIEN, Washington, United States

Full-time

FM is one of the world’s largest risk management and industrial property insurance organizations.With 76 office locations in over 60 countries worldwide, FM provides specialized property protection...Show moreLast updated: 9 hours ago

Promoted

Remote Commercial Banking Analyst - AI Trainer

Data AnnotationBremerton, Washington

Remote

Full-time +1

We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the q...Show moreLast updated: 30+ days ago

Promoted

Remote Financial Planner - AI Trainer

Data AnnotationBremerton, Washington

Remote

Full-time +1

Data Engineer III : 25-05317 (W2) (Ex Amazon Preferred)

Akraya IncSeattle, Washington, United States

Full-time

Quick Apply

Primary Skills : SQL (Expert), Python (Expert), ETL (Expert), AWS (Proficient), Data Modelling (Proficient).Duration : 3+ Months with possible extension. Location : Seattle, WA / Dallas, TX / Austin TX / ...Show moreLast updated: 30+ days ago