Overview
At Microsoft Copilot, we focus on building the best AI powered products in the world. We’re building applied AI products designed to improve over time and we need someone to architect and build the infrastructure that makes that possible.
As an Machine Learning Operations (MLOps) Engineer , you’ll build the connective tissue between our models and the real world. You’re not just deploying models – you’re building the systems that accelerate model improvement and drive continuous learning from production.
This is a high‑impact, high‑autonomy role where your infrastructure decisions directly shape product quality and our ability to iterate. If you’ve ever felt frustrated by the gap between ML’s potential and its messy reality in production, this is your chance to close it.
The next wave of AI products won’t win on model architecture alone – they’ll win with robust infrastructure for continuous improvement. You’ll build the infrastructure that makes our AI products genuinely intelligent, not just generative. Every system you create shortens the loop between user feedback and model improvement, directly impacting product quality and user experience.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Starting January 26, 2026, MAI employees are expected to work from a designated Microsoft office at least four days a week if they live within 50 miles (U.S.) or 25 miles (non‑U.S., country‑specific) of that location. This expectation is subject to local law and may vary by jurisdiction.
Responsibilities / What you’ll build
- Training pipelines that scale elegantly – design and implement robust training infrastructure that handles everything from data ingestion to model versioning, making it trivial for ML engineers to experiment and deploy with confidence
- The data flywheel – build the infrastructure and product features that capture user interactions, ground truth labels, and edge cases, then automatically route them back into training loops. Turn every production interaction into a training example
- Inference systems that deliver – dive deep into model serving architecture, optimize latency, manage costs, implement intelligent caching, and build the observability needed to maintain reliability at scale
- Deployment pipelines with guardrails – create deployment systems that balance velocity with safety : automated testing, gradual rollouts, performance monitoring, and quick rollback mechanisms
- Cross‑functional infrastructure – partner closely with ML engineers, platform engineers, and data scientists to build APIs and tools that enable tight, rapid feedback loops from production back to model development
Required Qualifications
Doctorate in Computer Science, Statistics, Software Engineering, or related field AND 3+ years applied ML engineering experienceOR Master’s Degree in Computer Science, Statistics, Software Engineering, or related field AND 4+ years applied ML engineering experienceOR Bachelor’s Degree in Computer Science, Data Engineering, Software Engineering, or related field AND 6+ years applied ML experience,OR equivalent experience.6+ years experience building and operating ML systems in production, with real stories about what breaks at scale and how you fixed it5+ years of experience in software engineering fundamentals with experience in distributed systems, containerization (Docker / Kubernetes), and cloud platforms (AWS / GCP / Azure)5+ years of hands‑on experience with ML orchestration tools (Airflow, Kubeflow, Metaflow), experiment tracking, model registries, and feature stores5+ years of experience optimizing model inference, wrestled with GPU utilization, and know the tradeoffs between latency, throughput, and costPreferred Qualifications
Doctorate in Computer Science, Data Engineering, Software Engineering, or related field AND 6+ years data engineering experience (e.g., building ETL pipelines, managing distributed data systems, implementing data quality frameworks)OR Master’s Degree in Computer Science, Data Engineering, Software Engineering, or related field AND 8+ years data engineering experience (e.g., building ETL pipelines, managing distributed data systems, implementing data quality frameworks)OR Bachelor’s Degree in Computer Science, Data Engineering, Software Engineering, or related field AND 10+ years data engineering experience (e.g., building ETL pipelines, managing distributed data systems, implementing data quality frameworks)OR equivalent experience.Familiarity with LLM deployment patterns, vector databases, prompt management, and the unique challenges of serving foundation modelsExperience working with RAG, fine‑tuning pipelines, or evaluation frameworksThe ability to see beyond individual components to design holistic systems where data flows naturally from production through improvement cycles and backDesire and preference to work at the intersection of teams, translating between ML researchers who want flexibility and engineers who need reliabilityData Science IC4 – The typical base pay range for this role across the U.S. is USD 119,800 – 234,700 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD 158,400 – 258,000 per year.
Data Science IC5 – The typical base pay range for this role across the U.S. is USD 139,900 – 274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD 188,000 – 304,200 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here : https : / / careers.microsoft.com / us / en / us-corporate-pay
Microsoft will accept applications and processes offers for these roles on an ongoing basis.
#J-18808-Ljbffr