Senior Data Engineer – Feature Store, ML Platform & Performance Optimization

Techtriad Team IncWashington, DC, United States

10 hours ago

Job type

Temporary

Quick Apply

Job description

Senior Data Engineer - GC / USC only

Washington DC & New York (6-8 Months Contract) possible extensions

About the Role

We are seeking an experienced Senior Data Engineer to design, build, and optimize the data systems that power our machine learning models, segmentation analytics, and measurement pipelines.

This hands-on role will focus on building and maintaining our Feature Store, creating robust data pipelines to support training and inference workflows, and ensuring that our ML and analytics code runs efficiently and at scale across Databricks and AWS environments.

You'll collaborate closely with data scientists, ML engineers, and product teams to turn analytical concepts into production-ready data and model pipelines that drive personalization, audience targeting, and performance insights across the business.

What You'll Do

Design, build, and optimize high-performance data pipelines and feature store components using Databricks (PySpark, Delta Lake, SQL) and AWS (S3, Lambda, Glue, Kinesis, EMR).

Develop and maintain a centralized Feature Store to manage and serve machine learning features consistently across training and inference environments.

Build data pipelines for audience segmentation, measurement, and model performance tracking, ensuring accuracy and scalability.

Optimize existing code and pipelines for performance, cost efficiency, and maintainability - reducing compute time, improving Spark job performance, and minimizing data latency.

Collaborate with data scientists to productionize ML models, including model ingestion, transformation, and deployment pipelines.

Implement CI / CD workflows for data and ML deployments using Databricks Workflows, GitHub Actions, or similar automation tools.

Develop and enforce data quality, lineage, and observability frameworks (e.g., Great Expectations, Monte Carlo, Soda).

Work with cloud infrastructure teams to ensure reliable production environments and efficient resource utilization.

Contribute to code reviews, documentation, and data architecture design, promoting best practices for performance and scalability.

Stay current with Databricks, Spark, and AWS ecosystem updates to continuously improve platform efficiency.

What You'll Need

Bachelor's or Master's degree in Computer Science, Engineering, or a related field.

7 10+ years of experience in data engineering or distributed data systems development.

Deep expertise with Databricks (PySpark, Delta Lake, SQL) and strong experience with AWS (S3, Glue, EMR, Kinesis, Lambda).

Experience designing and building Feature Stores (Databricks Feature Store, Feast, or similar).

Proven ability to profile and optimize data processing code, including Spark tuning, partitioning strategies, and efficient data I / O.

Strong programming skills in Python (preferred) or Scala / Java, with emphasis on writing performant, production-ready code.

Experience with batch and streaming pipelines, real-time data processing, and large-scale distributed computing.

Familiarity with ML model deployment and monitoring workflows (MLflow, SageMaker, custom frameworks).

Familiarity with ML model development using libraries such as scikit-learn, TensorFlow, or PyTorch.

Working knowledge of data quality frameworks, CI / CD, and infrastructure-as-code.

Excellent problem-solving and communication skills; able to collaborate across technical and product domains.

Preferred Qualifications

Experience with Databricks Unity Catalog, Delta Live Tables, and MLflow.

Understanding of segmentation, targeting, and personalization pipelines.

Experience with data observability and monitoring tools (Monte Carlo, Databand, etc.).

Familiarity with NoSQL or real-time stores (DynamoDB, Druid, Redis, etc.) for feature serving.

Exposure to containerization and orchestration (Docker, Kubernetes, Airflow, Dagster).

Strong understanding of data performance optimization principles - caching, partitioning, vectorization, and adaptive query execution.

Create a job alert for this search

Senior Engineer Data • Washington, DC, United States

Related jobs

Senior Data Engineer (Databricks)

Infinitive IncAshburn, VA, US

Full-time

Quick Apply

Candidates must possess work authorization which does not require sponsorship by the employer for a visa.Infinitive is a data and AI consultancy that enables its clients to modernize, monetize and ...Show moreLast updated: 30+ days ago

Senior Consultant, Data Engineer

LovelyticsArlington, VA, US

Full-time

Quick Apply

Lovelytics is seeking an ambitious data engineer who is comfortable working in a client-facing role, to join our Data & AI team and work on strategic Databricks and other data engineering ...Show moreLast updated: 30+ days ago

Promoted

Senior Project Manager, Data Centers

Suffolk ConstructionFrederick, MD, US

Full-time

Suffolk is a national enterprise that builds, innovates and invests.Suffolk is an end-to-end business that provides value throughout the entire project lifecycle by leveraging its core construction...Show moreLast updated: 8 days ago

Promoted

Senior Data Analyst with Salesvision Experience

Open Systems TechnologiesWoodbridge, Virginia, US

Full-time

Senior Data Analyst with Salesvision Experience .Chicago or Iselin, NJ (Hybrid).If you want to know about the requirements for this role, read on for all the relevant information.No C2C allowed - P...Show moreLast updated: 1 day ago

Promoted

Senior Director, Data and Analytics

InvariantWashington, DC, United States

Full-time

Be among the first 25 applicants.Invariant seeks a Senior Director to join the emerging Data and Analytics team.You will work with the team that turns complex data into competitive insights and act...Show moreLast updated: 1 day ago

25-1071 : Senior AI / ML Engineer - Herndon, VA

NavitasHerndon, VA, US

Full-time

Quick Apply

Clearance : Secret Location : Herndon, VA Who We Are : Since our inception back in 2006, Navitas has grown to be an industry leader in the digital transformation space, and we’ve served a...Show moreLast updated: 22 days ago

Promoted

Data Driven UI Engineer

ManTechAlexandria, VA, US

Full-time

MANTECH seeks a motivated, career and customer-oriented.As part of the position, you will act as a Software Engineer designing and implementing services and components for AI applications.Responsib...Show moreLast updated: 30+ days ago

Promoted

Platform Engineer (Hybrid) - 22190

EnlightenColumbia, Maryland, US

Full-time

Enlighten, honored as a Top Workplace from USA Today, is a leader in big data solution development and deployment, with expertise in cloud-based services, software and systems engineering, cyber ca...Show moreLast updated: 30+ days ago

Promoted

Platform Engineer (Hybrid) - 22954

EnlightenColumbia, Maryland, US

Full-time

Promoted

GCP AI Vertex Engineer

PerficientColumbia, MD, US

Full-time

We are seeking a highly skilled and execution-driven GCP AI Architect / Engineer to lead the design and implementation of cutting-edge AI solutions. This role demands deep expertise in Google Cloud Pl...Show moreLast updated: 1 day ago

Senior Data Engineer (Databricks Pro Certified)

Infinitive IncAshburn, VA, US

Full-time

Quick Apply

Sr. Data Platform Engineer

Real Time Medical SystemsLinthicum Heights, MD, US

Full-time

Quick Apply

Data Platform Engineer This is a fully remote role.Job purpose summary Real Time’s suite of Interventional Analytics solutions enables post-acute facilities to seamlessly collaborate with the...Show moreLast updated: 13 days ago

Promoted

Software Data Engineer

ManTechSpringfield, VA, US

Full-time

Responsibilities include, but are not limited to : .Provide expert guidance on Enterprise Asset Management; Identify, recommend, and implement agreed upon leading industry practices, process methodol...Show moreLast updated: 13 days ago

Promoted

Platform Engineer (Hybrid) - 23236

EnlightenColumbia, Maryland, US

Full-time

Promoted

Data Engineer

BOOZ, ALLEN & HAMILTON, INC.Arlington, VA, US

Full-time +1

Ever-expanding technology like IoT, machine learning, and artifi cia l intelligence means that there's more structured and unstructured data available today than ever before.As a data engineer,...Show moreLast updated: 26 days ago

Promoted

Platform Engineer (Hybrid) - 23248

EnlightenColumbia, Maryland, US

Full-time

Promoted

Senior Construction Market Research Analyst

Home Innovation Research LabsUpper Marlboro, MD, US

Full-time

Senior Construction Market Research Analyst.Salary range : $89,000 - $99,325.Home Innovation Research Labs is the premier provider of marketing research services to manufacturers of construction-rel...Show moreLast updated: 27 days ago

Promoted

AWS Cloud Engineer (Hybrid) - 22868

EnlightenColumbia, Maryland, US

Full-time