Member of Technical Staff- Machine Learning Operations EngineerMicrosoft • Mountain View, CA, United States

Member of Technical Staff- Machine Learning Operations Engineer

Microsoft • Mountain View, CA, United States

11 hours ago

Job type

Full-time

Job description

Overview

At Microsoft Copilot, we focus on building the best AI powered products in the world. We’re building applied AI products designed to improve over time and we need someone to architect and build the infrastructure that makes that possible.

As an Machine Learning Operations (MLOps) Engineer , you’ll build the connective tissue between our models and the real world. You’re not just deploying models – you’re building the systems that accelerate model improvement and drive continuous learning from production.

This is a high‑impact, high‑autonomy role where your infrastructure decisions directly shape product quality and our ability to iterate. If you’ve ever felt frustrated by the gap between ML’s potential and its messy reality in production, this is your chance to close it.

The next wave of AI products won’t win on model architecture alone – they’ll win with robust infrastructure for continuous improvement. You’ll build the infrastructure that makes our AI products genuinely intelligent, not just generative. Every system you create shortens the loop between user feedback and model improvement, directly impacting product quality and user experience.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Starting January 26, 2026, MAI employees are expected to work from a designated Microsoft office at least four days a week if they live within 50 miles (U.S.) or 25 miles (non‑U.S., country‑specific) of that location. This expectation is subject to local law and may vary by jurisdiction.

Responsibilities / What you’ll build

Training pipelines that scale elegantly – design and implement robust training infrastructure that handles everything from data ingestion to model versioning, making it trivial for ML engineers to experiment and deploy with confidence
The data flywheel – build the infrastructure and product features that capture user interactions, ground truth labels, and edge cases, then automatically route them back into training loops. Turn every production interaction into a training example
Inference systems that deliver – dive deep into model serving architecture, optimize latency, manage costs, implement intelligent caching, and build the observability needed to maintain reliability at scale
Deployment pipelines with guardrails – create deployment systems that balance velocity with safety : automated testing, gradual rollouts, performance monitoring, and quick rollback mechanisms
Cross‑functional infrastructure – partner closely with ML engineers, platform engineers, and data scientists to build APIs and tools that enable tight, rapid feedback loops from production back to model development

Required Qualifications

Doctorate in Computer Science, Statistics, Software Engineering, or related field AND 3+ years applied ML engineering experience

OR Master’s Degree in Computer Science, Statistics, Software Engineering, or related field AND 4+ years applied ML engineering experience

OR Bachelor’s Degree in Computer Science, Data Engineering, Software Engineering, or related field AND 6+ years applied ML experience,

OR equivalent experience.

6+ years experience building and operating ML systems in production, with real stories about what breaks at scale and how you fixed it

5+ years of experience in software engineering fundamentals with experience in distributed systems, containerization (Docker / Kubernetes), and cloud platforms (AWS / GCP / Azure)

5+ years of hands‑on experience with ML orchestration tools (Airflow, Kubeflow, Metaflow), experiment tracking, model registries, and feature stores

5+ years of experience optimizing model inference, wrestled with GPU utilization, and know the tradeoffs between latency, throughput, and cost

Preferred Qualifications

Doctorate in Computer Science, Data Engineering, Software Engineering, or related field AND 6+ years data engineering experience (e.g., building ETL pipelines, managing distributed data systems, implementing data quality frameworks)

OR Master’s Degree in Computer Science, Data Engineering, Software Engineering, or related field AND 8+ years data engineering experience (e.g., building ETL pipelines, managing distributed data systems, implementing data quality frameworks)

OR Bachelor’s Degree in Computer Science, Data Engineering, Software Engineering, or related field AND 10+ years data engineering experience (e.g., building ETL pipelines, managing distributed data systems, implementing data quality frameworks)

OR equivalent experience.

Familiarity with LLM deployment patterns, vector databases, prompt management, and the unique challenges of serving foundation models

Experience working with RAG, fine‑tuning pipelines, or evaluation frameworks

The ability to see beyond individual components to design holistic systems where data flows naturally from production through improvement cycles and back

Desire and preference to work at the intersection of teams, translating between ML researchers who want flexibility and engineers who need reliability

Data Science IC4 – The typical base pay range for this role across the U.S. is USD 119,800 – 234,700 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD 158,400 – 258,000 per year.

Data Science IC5 – The typical base pay range for this role across the U.S. is USD 139,900 – 274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD 188,000 – 304,200 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here : https : / / careers.microsoft.com / us / en / us-corporate-pay

Microsoft will accept applications and processes offers for these roles on an ongoing basis.

#J-18808-Ljbffr

Create a job alert for this search

Member Of Technical Staff • Mountain View, CA, United States

Related jobs

Staff Machine Learning Engineer, Team Lead

Attentive • San Francisco, CA, United States

Full-time

We build advanced ML models that predict customer behaviors in real-time, enabling highly personalized shopping experiences. Joining our team offers a high-growth career opportunity to work with som...Show more

Last updated: 30+ days ago • Promoted

Staff Machine Learning Engineer

Coupang • Mountain View, CA, United States

Full-time

We know we’re doing the right thing when we hear our customers say, “How did we ever live without Coupang?” Born out of an obsession to make shopping, eating, and living easier than ever, we’re col...Show more

Last updated: 11 hours ago • Promoted • New!

Staff Machine Learning Engineer

Moloco • Redwood City, CA, United States

Full-time

Moloco is a machine learning company empowering organizations of all sizes to grow and unlock the full value of their unique first-party data, elevating the traditional path to performance advertis...Show more

Last updated: 30+ days ago • Promoted

Staff Machine Learning Engineer

Aarki • San Francisco, CA, United States

Full-time

Aarki is an AI-driven company specializing in mobile advertising solutions designed to fuel revenue growth.We leverage AI to discover audiences in a privacy-first environment through trillions of c...Show more

Last updated: 30+ days ago • Promoted

Staff Machine Learning Engineer

VirtualVocations • Santa Clara, California, United States

Full-time

A company is looking for a Staff Machine Learning Engineer - Wildfire.Key Responsibilities Architect and build advanced ML models to predict vegetation and fuel conditions Design and maintain da...Show more

Last updated: 30+ days ago • Promoted

Staff Machine Learning Engineer

Hive • San Francisco, CA, United States

Full-time

Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and most innovative organizations.The company...Show more

Last updated: 30+ days ago • Promoted

Staff Machine Learning Engineer

scribehow.com • San Francisco, CA, United States

Full-time

We're looking for a highly motivated and skilled Staff Machine Learning Engineer to build the future of AI productivity software. You will lead high-impact research and development, drive our core A...Show more

Last updated: 30+ days ago • Promoted

Staff Machine Learning Engineer

Adobe Inc. • San Jose, CA, United States

Full-time

Adobe Experience Intelligence Team is looking for a Staff Machine Learning Engineer who will apply AI and machine learning techniques to big-data problems to help Adobe better understand, manage an...Show more

Last updated: 30+ days ago • Promoted

Member of Technical Staff - Machine Learning Engineering, AGI Autonomy

Amazon • San Francisco, CA, United States

Full-time

Amazon has launched a new research lab in San Francisco to develop foundational capabilities for useful AI agents.We’re enabling practical AI to make our customers more productive, empowered, and f...Show more

Last updated: 30+ days ago • Promoted

Staff Machine Learning Engineer

Moloco, Inc. • Redwood City, CA, United States

Full-time

Last updated: 30+ days ago • Promoted

Staff Machine Learning Engineer

Suki • Redwood City, CA, United States

Full-time

Get AI-powered advice on this job and more exclusive features.The Future of Healthcare Needs You.At Suki, we’re building technology that listens, understands, and gets out of the way — so clinician...Show more

Last updated: 30+ days ago • Promoted

Staff Machine Learning Engineer

Ambience Healthcare, Inc. • San Francisco, CA, United States

Full-time

Ambience is developing the most capable AI systems for healthcare and medicine.As healthcare costs soar to 17.US GDP and a projected shortage of 100,000 physicians within the next decade, the need ...Show more

Last updated: 30+ days ago • Promoted

Staff machine learning engineer

Watershed • San Francisco, CA, United States

Full-time

At Watershed, we strive to design consistent, fair, and competitive compensation programs.The total cash compensation range may be inclusive of several levels at Watershed and final offer will be d...Show more

Last updated: 21 days ago • Promoted

Staff Machine Learning Engineer

Tubi Tv • San Francisco, CA, United States

Full-time

Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users.Tubi offers the world's largest collection of Hollywood movies and TV shows, th...Show more

Last updated: 30+ days ago • Promoted

Staff / Principal Machine Learning Engineer - USA

Inworld AI • Mountain View, CA, United States

Full-time

At Inworld, we believe that the benefits of AI should extend beyond business workflows to the applications and experiences that we enjoy every day. We began by pushing the frontier of lifelike, inte...Show more

Last updated: 1 day ago • Promoted

Member of Technical Staff, Machine Learning Engineer

Microsoft Corporation • Mountain View, CA, United States

Full-time

Member of Technical Staff, Machine Learning Engineer.Mountain View, California, United States.As a Member of Technical Staff - Machine Learning Engineer, you will work on the Technical Safety squad...Show more

Last updated: 30+ days ago • Promoted