Talent.com
Machine Learning / AI Operations Engineer
Machine Learning / AI Operations EngineerArcade • San Francisco, CA, United States
Machine Learning / AI Operations Engineer

Machine Learning / AI Operations Engineer

Arcade • San Francisco, CA, United States
13 days ago
Job type
  • Full-time
Job description

Everyone's talking about AI. But here's the truth : ChatGPT can't send your emails. It can't book your flights. It can't even order you lunch.

Why? Because AI is trapped in a chat box. It can't take real actions in the real world.

We are changing that forever. We're not just building another AI company - we're creating the infrastructure that will power every AI application you'll use in the future.

The Revolution Needs You

Every AI app needs agentic "tools" - special functions that let AI models take real actions. Without tools, AI can only chat. With tools, AI can actually do things. We're building the definitive tools catalog and tool-calling platform that will unlock AI's true potential. Think Zapier for AI Actions. Think Auth0 for AI. Think really big.

Why This Is The Opportunity of a Lifetime

  • Founder-Market Fit : Our CEO previously founded Stormpath (acquired by Okta), where he created the first Authentication API for developers. He's done this before - and this time the market is 10x bigger. Our CTO led the vector database team at Redis, shipped 100+ LLM applications, and is a contributor to LangChain and LlamaIndex. He knows this space better than anyone.
  • Dream Team : We've assembled authentication, integrations, distributed systems, and AI experts from Okta, Redis, Microsoft, Splunk, Ngrok, Google, Airbyte, Disney, Snowflake, and HPE who've built and founded multiple successful developer platforms.
  • Perfect Timing : We're at the inflection point of AI adoption. The biggest problem isn't better models - it's connecting AI to real-world actions. That's us.
  • Massive Market : We're building critical infrastructure for the biggest technological shift of our generation. Every AI app will need what we're building.
  • Backed By The Best : Our investors have backed Databricks, Clickhouse, MongoDB, Perplexity, Cohere, ScaleAI, Confluent, Elastic, and Firebase. They see what we see - this is going to be huge.

The Challenge

As our first Machine Learning / AI Operations Engineer, you will be responsible for building and maintaining the models and infrastructure that power Arcade's agentic features. We are building state‑of‑the‑art features to make everyone's agents more powerful, including tool selection, memory & context management, and related products. For certain tasks, building custom models is the way to deliver the performance and accuracy that customers are asking for, but hasn't been possible until now. We need your help ensuring that these features work reliably and quickly within our cloud and on-premise for our enterprise customers.

What You’ll Do

  • Build : Create bleeding‑edge models fine‑tuned for Arcade's agentic products.
  • Deploy : Test and deploy our models and related application software, both on‑prem and in our cloud.
  • Monitor : Prevent model drift. Make the models and APIs better. Collect the data you need to do it.
  • Build our stack. Use your experience to chose the right tools for the job, balancing speed, maintainability and cost.
  • Shape the roadmap for the team.
  • Build leverage (via AI) - projects that take a week today should take a day next time.
  • Share your work with our customers and community, building our (and your) brand.
  • Required Skills

  • An insatiable desire to ship.
  • Strong understanding of the state of the art in machine learning, especially LLMs and tool-calling (e.g. MCP).
  • Comfortable with tuning libraries (HuggingFace Trainer, DeepSpeed, FSDP, QLoRA, etc)
  • Familiarity with model lifecycle management tools (MLflow, Weights & Biases, DVC, etc)
  • Experience with model optimization, quantization and deployment formats (ONNX, OpenVino, TensortRT, etc)Experience with modern monitoring tools (Prometheus, Grafana, Datadog, ELK, Arize AI, etc.)
  • Production experience with at least one major agent framework (Langchain, LlamaIndex, OpenAI Agents SDK, Mastra, etc)
  • 5+ years of software engineering experience comprising of :

  • 3+ years experience working on a production level ML training or inference system
  • 2+ years of ML Expertise with frameworks for ML (TensorFlow, PyTorch, Scikit-learn, etc)
  • 2+ years of backend development experience with either Python or Go
  • 2+ years of experience with infrastructure and deployment (AWS / GCP, Terraform, Helm, etc)
  • Strong experience building and maintaining production APIs.
  • Track record of writing clean, well‑documented, well‑tested code.
  • User-centered approach to designing developer-centric products and tools.
  • Bonus Points

  • Experience building developer tools or SDKs
  • Open-source contributions
  • Experience with high-scale distributed systems
  • You’ve been a startup founder or early-state startup employee before and love it.
  • Join The Movement

    We're not just building a product - we're leading a movement to transform AI from just chatbots to agents that can take actions against real systems. This is your chance to be at the forefront of that revolution.

    If you want to look back in 5 years and say, "I helped build that", then we want to talk to you.

    Ready to make AI actually useful? Apply Now

    #J-18808-Ljbffr

    Create a job alert for this search

    Machine Learning Engineer Ai • San Francisco, CA, United States

    Related jobs
    Machine Learning Ops Engineer

    Machine Learning Ops Engineer

    VirtualVocations • Concord, California, United States
    Full-time
    A company is looking for a Machine Learning Ops Engineer.Key Responsibilities Design and implement AI cost tracking and observability frameworks across Azure, Google Cloud, and Snowflake Collabo...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    VirtualVocations • Concord, California, United States
    Full-time
    A company is looking for a Machine Learning Engineer.Key Responsibilities Design, develop, and deploy intelligent systems and multi-agent architectures for autonomous reasoning and planning Buil...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Data Engineer

    Machine Learning Data Engineer

    VirtualVocations • Fremont, California, United States
    Full-time
    A company is looking for a Machine Learning Data Engineer, Research - 3D Data Focus.Key Responsibilities Ingest, clean, normalize, and preprocess data for machine learning model training pipeline...Show more
    Last updated: 2 days ago • Promoted
    Senior Machine Learning Manager

    Senior Machine Learning Manager

    VirtualVocations • Concord, California, United States
    Full-time
    A company is looking for a Senior Machine Learning Manager, Gen AI & Knowledge AI.Key Responsibilities Lead the vision, design, and execution of LLM-powered AI products, including system architec...Show more
    Last updated: 30+ days ago • Promoted
    Staff Machine Learning Engineer - Responsible AI

    Staff Machine Learning Engineer - Responsible AI

    Pinterest • San Francisco, CA, United States
    Full-time
    Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...Show more
    Last updated: 30+ days ago • Promoted
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    VirtualVocations • Concord, California, United States
    Full-time
    A company is looking for a Staff Machine Learning Engineer - Wildfire.Key Responsibilities Architect and build advanced ML models to predict vegetation and fuel conditions Design and maintain da...Show more
    Last updated: 30+ days ago • Promoted
    Lead AI Engineer

    Lead AI Engineer

    VirtualVocations • Concord, California, United States
    Full-time
    A company is looking for a Lead AI Engineer (Remote).Key Responsibilities : Define the AI-first vision and technical roadmap, owning the design and architecture of AI features Leverage expertise ...Show more
    Last updated: 30+ days ago • Promoted
    Remote Machine Learning Engineer

    Remote Machine Learning Engineer

    VirtualVocations • Santa Clara, California, United States
    Remote
    Full-time
    A company is looking for a Remote Machine Learning Engineer.Key Responsibilities Design, develop, and architect high-performance AI / ML services and pipelines in a cloud-based environment Contrib...Show more
    Last updated: 4 days ago • Promoted
    Staff Machine Learning Engineer - Ads Modules

    Staff Machine Learning Engineer - Ads Modules

    Pinterest • Palo Alto, CA, United States
    Full-time
    Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...Show more
    Last updated: 30+ days ago • Promoted
    Senior Manager, AI / ML Platform

    Senior Manager, AI / ML Platform

    VirtualVocations • Fremont, California, United States
    Full-time
    A company is looking for a Senior Manager, Artificial Intelligence - Machine Learning Platform.Key Responsibilities Lead the strategic direction, development, and continuous improvement of the AI...Show more
    Last updated: 4 days ago • Promoted
    Junior AI / ML Engineer

    Junior AI / ML Engineer

    VirtualVocations • Concord, California, United States
    Full-time
    A company is looking for a Junior AI / ML Engineer.Key Responsibilities Develop, test, and optimize AI / ML models focusing on hybrid AI architectures Implement innovative algorithms and conduct exp...Show more
    Last updated: 30+ days ago • Promoted
    Engineering Manager, Machine Learning

    Engineering Manager, Machine Learning

    VirtualVocations • Fremont, California, United States
    Full-time
    A company is looking for an Engineering Manager, Machine Learning.Key Responsibilities Lead and mentor a team of data scientists in developing and monitoring ML models at scale Oversee the desig...Show more
    Last updated: 1 day ago • Promoted
    AI Engineer Team Lead

    AI Engineer Team Lead

    VirtualVocations • Fremont, California, United States
    Full-time
    A company is looking for an AI Engineer and Team Lead to enhance its core product with innovative AI features.Key Responsibilities Lead and support the AI team, ensuring alignment across discipli...Show more
    Last updated: 2 days ago • Promoted
    Lead Machine Learning Engineer

    Lead Machine Learning Engineer

    VirtualVocations • Fremont, California, United States
    Full-time
    A company is looking for a Lead Machine Learning Engineer.Key Responsibilities Develop and manage ML infrastructure for data engineering, LLM training, and deployment Architect cloud-native solu...Show more
    Last updated: 30+ days ago • Promoted
    Applied Machine Learning Engineer

    Applied Machine Learning Engineer

    VirtualVocations • Concord, California, United States
    Full-time
    A company is looking for an Applied Machine Learning Engineer to design, build, and deploy ML-driven solutions for cybersecurity. Key Responsibilities Develop, train, and deploy machine learning m...Show more
    Last updated: 30+ days ago • Promoted
    Senior AI / ML Developer

    Senior AI / ML Developer

    VirtualVocations • Concord, California, United States
    Full-time
    A company is looking for a (Senior) AI / ML Developer with over 3 years of experience, fully remote.Key Responsibilities Facilitate the development of applications utilizing Large Language Models a...Show more
    Last updated: 23 days ago • Promoted
    High Performance AI Engineer

    High Performance AI Engineer

    VirtualVocations • Concord, California, United States
    Full-time
    A company is looking for a High Performance AI Engineer to build groundbreaking multi-agent systems for the CUDA ecosystem. Key Responsibilities Design, build, and optimize agentic AI systems for ...Show more
    Last updated: 4 days ago • Promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    VirtualVocations • Fremont, California, United States
    Full-time
    A company is looking for a Senior Machine Learning Engineer, Security.Key Responsibilities Design, build, and deploy machine learning models to detect and mitigate security threats Develop algor...Show more
    Last updated: 30+ days ago • Promoted