Data Engineer

INSPYR SolutionsCupertino, CA, United States

4 days ago

Job type

Full-time

Job description

ABOUT THIS FEATURED OPPORTUNITY

Join our Data Operations Team as a Python Engineer , supporting machine learning and AI teams that depend on high-quality datasets to train their models. You'll work at the intersection of data engineering, automation, and operational excellence , delivering datasets across approximately 200 projects per year . These include use cases such as image generation, animation, and other generative AI applications . Many projects are highly confidential- engineers must be able to assess data quality and relevance even without full visibility into the end use case .

We're looking for someone who can design and manage data pipelines, debug issues efficiently , and operate independently across multiple fast-paced projects. Strong communication and attention to detail are essential -you'll need to respond quickly, handle issues proactively, and deliver accurate work the first time. Mistakes or rework can pose serious risks to project timelines , so precision and accountability are critical. The ideal candidate will be highly responsive, reliable, and thorough in communication , and must be available to work 9am-4pm PST , even if located in a different state.

THE OPPORTUNITY FOR YOU

Work on 3-4 projects to start , scaling up to 6-10 during peak season
Contribute to data collection, annotation, and generation pipelines using Python and distributed systems (Spark)
Collaborate with a tight-knit and highly responsive team , engaging in biweekly check-ins with team leads
Gain experience with confidential, multimodal, and LLM-related datasets across a high volume of AI / ML projects
Influence how large-scale datasets are prepared for training models across an enterprise AI org

KEY SUCCESS FACTORS

2+ years of experience in data engineering or Python development, with a strong foundation in Computer Science or Data Science

Proficiency in distributed systems (e.g., Spark), and solid understanding of multithreading vs. multiprocessing

Demonstrated ability to design scalable pipelines , handle diverse data structures, and manage large-scale workflows

Comfortable operating under pressure, context-switching across multiple projects, and working with ambiguity

NICE TO HAVES

Familiarity with Airflow , Spark , or Flask for scalable API / UI development

Experience with Docker , containerization, and CI / CD tools (e.g., Jenkins)

Exposure to LLMs , multi-modal data , or generative AI workflows

Prior involvement in designing tools to automate or scale ML data pipelines

Ability to collaborate in a high-volume, high-trust environment -your work will power some of the most impactful ML use cases in the organization

25-14750

Create a job alert for this search

Data Engineer • Cupertino, CA, United States

Related jobs

Promoted

Data Engineer

Stefanini GroupSan Francisco, CA, United States

Full-time

Stefanini is looking for a Data Engineer for various location across USA (Hybrid).For quick apply, please connect with Prakhar Goel : 248 263 5255 / Prakhar. As a Data Engineer, this CW will be respon...Show moreLast updated: 30+ days ago

Promoted

Data Engineer

Krea.ai, IncSan Francisco, CA, United States

Full-time

At Krea, we are building next-generation AI creative tools.We are dedicated to making AI intuitive and controllable for creatives. Our mission is to build tools that empower human creativity, not re...Show moreLast updated: 4 days ago

Promoted

Data Engineer

FigmaSan Francisco, CA, United States

Full-time

Figma is growing our team of passionate creatives and builders on a mission to make design accessible to all.Figma's platform helps teams bring ideas to life-whether you're brainstorming, creating ...Show moreLast updated: 4 days ago

Promoted

Data Engineer

Mercor IncSan Francisco, CA, United States

Full-time

Mercor is training models that predict how well someone will perform on a job better than a human can.We use our platform to source, vet, and onboard expert contractors who help train AI models in ...Show moreLast updated: 4 days ago

Promoted

Data Engineer

Crystal EquationFremont, CA, United States

Full-time

Senior Data Engineer Job Description.We are looking for a versatile data professional, capable of leading end-to-end projects. The responsibilities include requirements gathering and cross-functiona...Show moreLast updated: 4 days ago

Promoted

Data Engineer

KodiakMountain View, CA, United States

Full-time

The company has developed an artificial intelligence (AI) powered technology stack purpose-built for commercial trucking and the public sector. The company delivers freight daily for its customers a...Show moreLast updated: 4 days ago

Promoted

Data Engineer

AdobeSan Jose, CA, United States

Full-time

Changing the world through digital experiences is what Adobe's all about.We give everyone-from emerging artists to global brands-everything they need to design and deliver exceptional digital exper...Show moreLast updated: 4 days ago

Promoted

Data Engineer 4

NextDeavorSan Jose, CA, United States

Temporary

Here's how you'll become a key player with this opportunity : .We are looking for a Splunk expert to join on a short-term contract and help stabilize, optimize, and improve our Splunk environment.Thi...Show moreLast updated: 4 days ago

Promoted

Data Engineer

Cloudflare IncSan Francisco, CA, United States

Full-time

At Cloudflare, we are on a mission to help build a better Internet.Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties for cust...Show moreLast updated: 4 days ago

Promoted

Data Engineer

Sinclair Broadcast GroupSan Francisco, CA, US

Full-time

Digital Remedy is looking for a skilled Data Engineer to join our growing team of analytics experts.As a Data Engineer, you will help design, build, maintain, and troubleshoot data processing syste...Show moreLast updated: 18 days ago

Promoted

Data Engineer

Air AppsSan Francisco, CA, United States

Full-time

At Air Apps, we believe in thinking bigger-and moving faster.We're a family-founded company on a mission to create the world's first AI-powered Personal & Entrepreneurial Resource Planner (PRP), an...Show moreLast updated: 4 days ago

Promoted

Data Engineer

TranzealSunnyvale, CA, United States

Full-time

Design, build, and maintain robust data pipelines using Apache Spark and Python.Develop and manage workflow orchestration using Apache Airflow. Implement scalable data solutions on Google Cloud Plat...Show moreLast updated: 4 days ago

Promoted

Data Engineer

Ekman AssociatesMenlo Park, CA, United States

Full-time

Data Engineer (Data Pipelines & Visualization).Ekman Associates is a management consulting firm that specializes in developing business, digital, and technology strategy, delivering solutions, and ...Show moreLast updated: 4 days ago

Promoted

Data Engineer

EVERYDaly City, CA, US

Full-time

EVERY™ is a leading VC-backed food tech ingredient company and market leader using precision fermentation to create animal proteins without the animal for the global food and beverage industr...Show moreLast updated: 2 days ago

Promoted

Data Engineer - AG

BayoneSunnyvale, CA, United States

Full-time

Demonstrates up-to-date expertise and applies this to the development,.Supporting and aligning efforts to meet customer, business. Create software design and architecture for next software solution....Show moreLast updated: 30+ days ago

Promoted

Data Engineer

EVERY •San Francisco, CA, United States

Full-time

Promoted

Databricks Data Engineer

Tekfortune IncPleasanton, CA, United States

Permanent

Tekfortune is a fast-growing consulting firm specialized in permanent, contract & project-based staffing services for world's leading organizations in a broad range of industries.In this quickly ch...Show moreLast updated: 4 days ago

Promoted

Data Engineer

Brevian.aiSunnyvale, CA, United States

Full-time

BREV / AN is at the forefront of revolutionizing how businesses leverage artificial intelligence.Our no-code platform empowers every business team to harness the power of production-grade AI agents, ...Show moreLast updated: 4 days ago

Promoted
New!

Data Engineer

Stellar Consulting Solutions, LLCPleasanton, CA, US

Full-time

Job Description : HM prefers candidate to be on site at Pleasanton.Proficiency in Spark, Python, and SQL is essential for this role. Experience with relational databases such as Oracle, NoSQL databas...Show moreLast updated: less than 1 hour ago

Promoted

Lead Data Engineer

AdobeSan Jose, CA, United States

Full-time