Talent.com
Senior MLOps Engineer
Senior MLOps EngineerMonad Foundation • Mountain View, CA, United States
Senior MLOps Engineer

Senior MLOps Engineer

Monad Foundation • Mountain View, CA, United States
1 day ago
Job type
  • Full-time
Job description

Fortytwo is a decentralized AI protocol on Monad that leverages idle consumer hardware for swarm inference. It enables Small Language Models to achieve advanced multi-step reasoning at lower costs, surpassing the performance and scalability of leading models.

Responsibilities :

Deploy scalable, production-ready ML services with optimized infrastructure and auto-scaling Kubernetes clusters.

Optimize GPU resources using MIG (Multi-Instance GPU) and NOS (Node Offloading System).

Manage cloud storage (e.g., S3) to ensure high availability and performance.

Integrate state-of-the-art ML techniques, such as LoRA and model merging, into workflows :

Work with SOTA ML codebases and adapt them to organizational needs.

Integrate LoRA (Low-Rank Adaptation) techniques and model merging workflows.

Deploy and manage large language models (LLM), small language models (SLM), and large multimodal models (LMM).

Serve ML models using technologies like Triton Inference Server.

Leverage solutions such as vLLM, TGI (Text Generation Inference), and other state-of-the-art serving frameworks.

Optimize models with ONNX and TensorRT for efficient deployment.

Develop Retrieval-Augmented Generation (RAG) systems integrating spreadsheet, math, and compiler processors.

Set up monitoring and logging solutions using Grafana, Prometheus, Loki, Elasticsearch, and OpenSearch.

Write and maintain CI / CD pipelines using GitHub Actions for seamless deployment processes.

Create Helm templates for rapid Kubernetes node deployment.

Automate workflows using cron jobs and Airflow DAGs.

Requirements :

Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.

Proficiency in Kubernetes, Helm, and containerization technologies.

Experience with GPU optimization (MIG, NOS) and cloud platforms (AWS, GCP, Azure).

Strong knowledge of monitoring tools (Grafana, Prometheus) and scripting languages (Python, Bash).

Hands-on experience with CI / CD tools and workflow management systems.

Familiarity with Triton Inference Server, ONNX, and TensorRT for model serving and optimization.

Preferred :

5+ years of experience in MLOps or ML engineering roles.

Experience with advanced ML techniques, such as multi-sampling and dynamic temperatures.

Knowledge of distributed training and large model fine-tuning.

Proficiency in Go or Rust programming languages.

Experience designing and implementing highly secure MLOps pipelines, including secure model deployment and data encryption.

Why Work with Us :

At Fortytwo, we are building a research-driven, decentralized AI infrastructure that prioritizes scalability, efficiency, and sustainability. Our approach moves beyond centralized AI constraints, applying globally scalable swarm intelligence to enhance LLM reasoning and problem-solving capabilities.

Engage in meaningful AI research – Work on decentralized inference, multi-agent systems, and efficient model deployment with a team that values rigorous, first-principles thinking.

Build scalable and sustainable AI – Design AI systems that reduce reliance on massive compute clusters, making advanced models more efficient, accessible, and cost-effective.

Collaborate with a highly technical team – Join engineers and researchers who are deeply experienced, intellectually curious, and motivated by solving hard problems.

We’re looking for individuals who thrive in research-driven environments, value autonomy, and want to work on foundational AI challenges.

#J-18808-Ljbffr

Create a job alert for this search

Mlops Engineer • Mountain View, CA, United States

Related jobs
ML Research Engineer - GPUs go Brrr

ML Research Engineer - GPUs go Brrr

Achira • San Francisco, CA, United States
Full-time
Join a world-class team of scientists, ML researchers, and engineers working together to make the physical microcosm predictable and reshape the future of drug discovery. Move beyond the beaten path...Show more
Last updated: 13 days ago • Promoted
Senior Software Engineer, AI / ML, Platforms and Devices

Senior Software Engineer, AI / ML, Platforms and Devices

Google Inc. • Mountain View, CA, United States
Full-time
Senior Software Engineer, AI / ML, Platforms and Devices.Bachelor’s degree or equivalent practical experience.ML infrastructure, or specialization in another ML field. Master's degree or PhD in Comput...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer, AI / ML

Senior Software Engineer, AI / ML

GLUE • San Francisco, CA, United States
Full-time
Glue is a well-funded startup working on the next generation of work communication tools.We believe that today’s work chat is noisy, unstructured, and not designed for productivity.We’re drawing fr...Show more
Last updated: 30+ days ago • Promoted
Deployment Engineer

Deployment Engineer

Netic • San Francisco, CA, United States
Full-time
Netic is the AI revenue engine that handles multi‑modal workflows, generates new demand, and drives measurable revenue for the $500B+ essential service industries that keep America running.With $20...Show more
Last updated: 23 hours ago • Promoted
Senior Software Engineer, AI / ML, Core

Senior Software Engineer, AI / ML, Core

Google Inc. • Sunnyvale, CA, United States
Full-time
Google place Sunnyvale, CA, USA.Bachelor’s degree or equivalent practical experience.ML infrastructure, or specialization in another ML field. Master\'s degree or PhD in Computer Science or related ...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer (MLOps)

Senior Software Engineer (MLOps)

Eliassen Group • San Jose, CA, United States
Temporary
Our client, a leading provider of digital creative solutions, has an excellent opportunity for a Senior Software Engineer (MLOps) to work on a 6+month contract opportunity.Work will be a hybrid sch...Show more
Last updated: 13 days ago • Promoted
Senior Software Engineer, ML Platform

Senior Software Engineer, ML Platform

Attentive • San Francisco, CA, United States
Full-time
Attentive is the AI-powered mobile marketing platform transforming how brands personalize consumer engagement.Attentive enables marketers to craft tailored journeys for every subscriber, driving hi...Show more
Last updated: 30+ days ago • Promoted
ML Engineer

ML Engineer

Phizenix • Menlo Park, CA, US
Full-time +1
Client Opportunity | Through Phizenix.Phizenix, a certified minority and women-led recruiting firm, is hiring on behalf of an innovative generative AI startup that's developing diffusion-based larg...Show more
Last updated: 30+ days ago • Promoted
Senior, Software Engineer - MLE

Senior, Software Engineer - MLE

Walmart • Sunnyvale, CA, United States
Full-time
We are seeking a highly motivated Machine Learning Engineer to join our Data Science team.In this role you will design, develop, and deploy machine learning models at scale and play a key role in s...Show more
Last updated: 16 days ago • Promoted
Senior Software Engineer, AI / ML GenAI, Workspace

Senior Software Engineer, AI / ML GenAI, Workspace

Google Inc. • Sunnyvale, CA, United States
Full-time
Senior Software Engineer, AI / ML GenAI, Workspace.Bachelor’s degree or equivalent practical experience.LLMs, Multi-Modal, Large Vision Models) or with GenAI-related concepts (e.Experience with distr...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer, AI / ML GenAI, Google Workspace

Senior Software Engineer, AI / ML GenAI, Google Workspace

Google Inc. • Sunnyvale, CA, United States
Full-time
Senior Software Engineer, AI / ML GenAI, Google Workspace.X Note : By applying to this position you will have an opportunity to share your preferred working location from the following : .New York, NY, ...Show more
Last updated: 30+ days ago • Promoted
Forward Deployed Engineer

Forward Deployed Engineer

Labelbox • San Francisco, CA, US
Full-time
At Labelbox, we're building the critical infrastructure that powers breakthrough AI models at leading research labs and enterprises. Since 2018, we've been pioneering data-centric approaches that ar...Show more
Last updated: 1 day ago • Promoted
Senior Software Engineer, AI / ML

Senior Software Engineer, AI / ML

Striim, Inc. • Palo Alto, CA, United States
Full-time
Striim, (pronounced “stream” with two i’s for integration and intelligence), is a unified data integration and streaming platform that connects clouds, data, and applications with unprecedented spe...Show more
Last updated: 30+ days ago • Promoted
Senior AI / ML Engineer

Senior AI / ML Engineer

Chime • San Francisco, CA, United States
Full-time
Chime’s AI / ML team is building models, services, and platforms that transform how millions of users manage and grow their financial lives. We are looking for a Senior AI / ML Engineer with deep techni...Show more
Last updated: 18 days ago • Promoted
Senior MLE

Senior MLE

Harnham • Sunnyvale, CA, US
Full-time
Senior Machine Learning Engineer.A leading commerce marketplace with 130M+ users and billions of daily events is hiring a. Senior Machine Learning Engineer.Their marketplace connects buyers and sell...Show more
Last updated: 21 days ago • Promoted
Senior ML Engineer, Energy

Senior ML Engineer, Energy

SB Energy • San Francisco, CA, United States
Full-time
Get AI-powered advice on this job and more exclusive features.Do you want to work with high caliber professionals in a dynamic and growing company? Are you entrepreneurial, hard-working, and colleg...Show more
Last updated: 30+ days ago • Promoted
Senior Mobile Software Engineer

Senior Mobile Software Engineer

Icon Ventures • San Francisco, CA, United States
Full-time
Our unified platform, spanning AI-powered analytics, study management, and grant automation, streamlines the entire research lifecycle, enabling faster, smarter, and more impactful discoveries acro...Show more
Last updated: 25 days ago • Promoted
Senior Mobile Software Engineer

Senior Mobile Software Engineer

Medeloop • San Francisco, CA, United States
Full-time
Our unified platform, spanning AI-powered analytics, study management, and grant automation, streamlines the entire research lifecycle, enabling faster, smarter, and more impactful discoveries acro...Show more
Last updated: 26 days ago • Promoted