Talent.com
Machine Learning / AI Operations Engineer

Machine Learning / AI Operations Engineer

ArcadeSan Francisco, CA, United States
1 day ago
Job type
  • Full-time
Job description

Everyone's talking about AI. But here's the truth : ChatGPT can't send your emails. It can't book your flights. It can't even order you lunch.

Why? Because AI is trapped in a chat box. It can't take real actions in the real world.

We are changing that forever. We're not just building another AI company - we're creating the infrastructure that will power every AI application you'll use in the future.

The Revolution Needs You

Every AI app needs agentic "tools" - special functions that let AI models take real actions. Without tools, AI can only chat. With tools, AI can actually do things. We're building the definitive tools catalog and tool-calling platform that will unlock AI's true potential. Think Zapier for AI Actions. Think Auth0 for AI. Think really big.

Why This Is The Opportunity of a Lifetime

  • Founder-Market Fit : Our CEO previously founded Stormpath (acquired by Okta), where he created the first Authentication API for developers. He's done this before - and this time the market is 10x bigger. Our CTO led the vector database team at Redis, shipped 100+ LLM applications, and is a contributor to LangChain and LlamaIndex. He knows this space better than anyone.
  • Dream Team : We've assembled authentication, integrations, distributed systems, and AI experts from Okta, Redis, Microsoft, Splunk, Ngrok, Google, Airbyte, Disney, Snowflake, and HPE who've built and founded multiple successful developer platforms.
  • Perfect Timing : We're at the inflection point of AI adoption. The biggest problem isn't better models - it's connecting AI to real-world actions. That's us.
  • Massive Market : We're building critical infrastructure for the biggest technological shift of our generation. Every AI app will need what we're building.
  • Backed By The Best : Our investors have backed Databricks, Clickhouse, MongoDB, Perplexity, Cohere, ScaleAI, Confluent, Elastic, and Firebase. They see what we see - this is going to be huge.

The Challenge

As our first Machine Learning / AI Operations Engineer, you will be responsible for building and maintaining the models and infrastructure that power Arcade's agentic features. We are building state?of?the?art features to make everyone's agents more powerful, including tool selection, memory & context management, and related products. For certain tasks, building custom models is the way to deliver the performance and accuracy that customers are asking for, but hasn't been possible until now. We need your help ensuring that these features work reliably and quickly within our cloud and on-premise for our enterprise customers.

What Youll Do

  • Build : Create bleeding?edge models fine?tuned for Arcade's agentic products.
  • Deploy : Test and deploy our models and related application software, both on?prem and in our cloud.
  • Monitor : Prevent model drift. Make the models and APIs better. Collect the data you need to do it.
  • Build our stack. Use your experience to chose the right tools for the job, balancing speed, maintainability and cost.
  • Shape the roadmap for the team.
  • Build leverage (via AI) - projects that take a week today should take a day next time.
  • Share your work with our customers and community, building our (and your) brand.
  • Required Skills

  • An insatiable desire to ship.
  • Strong understanding of the state of the art in machine learning, especially LLMs and tool-calling (e.g. MCP).
  • Comfortable with tuning libraries (HuggingFace Trainer, DeepSpeed, FSDP, QLoRA, etc)
  • Familiarity with model lifecycle management tools (MLflow, Weights & Biases, DVC, etc)
  • Experience with model optimization, quantization and deployment formats (ONNX, OpenVino, TensortRT, etc)Experience with modern monitoring tools (Prometheus, Grafana, Datadog, ELK, Arize AI, etc.)
  • Production experience with at least one major agent framework (Langchain, LlamaIndex, OpenAI Agents SDK, Mastra, etc)
  • 5+ years of software engineering experience comprising of :
  • 3+ years experience working on a production level ML training or inference system

  • 2+ years of ML Expertise with frameworks for ML (TensorFlow, PyTorch, Scikit-learn, etc)
  • 2+ years of backend development experience with either Python or Go
  • 2+ years of experience with infrastructure and deployment (AWS / GCP, Terraform, Helm, etc)
  • Strong experience building and maintaining production APIs.
  • Track record of writing clean, well?documented, well?tested code.
  • User-centered approach to designing developer-centric products and tools.
  • Bonus Points

  • Experience building developer tools or SDKs
  • Open-source contributions
  • Experience with high-scale distributed systems
  • Youve been a startup founder or early-state startup employee before and love it.
  • Join The Movement

    We're not just building a product - we're leading a movement to transform AI from just chatbots to agents that can take actions against real systems. This is your chance to be at the forefront of that revolution.

    If you want to look back in 5 years and say, "I helped build that", then we want to talk to you.

    Ready to make AI actually useful? Apply Now

    #J-18808-Ljbffr

    Create a job alert for this search

    Machine Learning Engineer Ai • San Francisco, CA, United States

    Related jobs
    • Promoted
    AI Engineer - Machine Learning (US)

    AI Engineer - Machine Learning (US)

    Gauss LabsPalo Alto, CA, United States
    Full-time
    AI Engineer - Machine Learning (US).Continue with Google Continue with Google.AI Engineer - Machine Learning (US).Gauss Labs is looking for a passionate and talented AI Engineer for developing cutt...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Research Engineer, Agents - Enterprise GenAI

    Machine Learning Research Engineer, Agents - Enterprise GenAI

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    AI is becoming vitally important in every function of our society.At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, ...Show moreLast updated: 12 days ago
    • Promoted
    AI Engineer

    AI Engineer

    RelantoFremont, CA, US
    Full-time
    Engage with customers to understand problem statements and technical requirements.Design and architect AI / ML cloud solutions. Develop system architecture diagrams and establish best practices for ML...Show moreLast updated: 6 days ago
    • Promoted
    Machine Learning Engineer, AI Agent Platform

    Machine Learning Engineer, AI Agent Platform

    GrammarlySan Francisco, CA, United States
    Full-time
    Machine Learning Engineer, AI Agent Platform.Grammarly offers a dynamic hybrid working model for this role.This flexible approach gives team members the best of both worlds : plenty of focus time al...Show moreLast updated: 20 days ago
    • Promoted
    Staff Machine Learning Engineer - Responsible AI

    Staff Machine Learning Engineer - Responsible AI

    PinterestPalo Alto, CA, United States
    Full-time
    Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Machine Learning Engineer - Ads Modules

    Staff Machine Learning Engineer - Ads Modules

    PinterestPalo Alto, CA, United States
    Full-time
    Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning / AI Operations Engineer

    Machine Learning / AI Operations Engineer

    ArcadeSan Francisco, CA, United States
    Full-time
    But here's the truth : ChatGPT can't send your emails.Why? Because AI is trapped in a chat box.It can't take real actions in the real world. We're not just building another AI company - we're creatin...Show moreLast updated: 24 days ago
    • Promoted
    AI & Machine Learning Engineer - Manager - Consulting - Open Location

    AI & Machine Learning Engineer - Manager - Consulting - Open Location

    EYSan Francisco, CA, United States
    Full-time
    At EY, we're all in to shape your future with confidence.We'll help you succeed in a globally connected powerhouse of diverse teams and take your career wherever you want it to go.Join EY and help ...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Machine Learning Engineer, AI Platform

    Staff Machine Learning Engineer, AI Platform

    General MotorsSunnyvale, CA, United States
    Full-time
    Remote : This role is based remotely but if you live within a 50-mile radius of Mountain View, you are expected to report to that location three times a week, at minimum. We are seeking an experience...Show moreLast updated: 30+ days ago
    • Promoted
    AI / Machine Learning Engineer

    AI / Machine Learning Engineer

    4 Staffing CorpFremont, CA, United States
    Full-time
    About the job AI / Machine Learning Engineer.Our client is seeking a highly skilled and motivated AI / Machine Learning Engineer to join their team. As an AI / Machine Learning Engineer, you will play a c...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Machine Learning Engineer, Tidal Personalization AI

    Staff Machine Learning Engineer, Tidal Personalization AI

    Block USAAlbany, CA, United States
    Full-time
    TIDAL was founded for artists by artists as the next innovative streaming platform to bring value back to the music industry. We empower artists with the products, resources, services, and content r...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Operations Contractor

    Machine Learning Operations Contractor

    CoherentFremont, CA, United States
    Full-time
    Machine Learning Operations (MLOps) Contractor.The emphasis will be on yield improvement, screening accuracy, and design optimization. The MLOps Contractor will develop and validate models based on ...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Machine Learning Engineer, AI

    Staff Machine Learning Engineer, AI

    SentrySan Francisco, CA, United States
    Full-time
    Bad software is everywhere, and were tired of it.Sentry is on a mission to help developers write better software faster so we can get back to enjoying technology. With more than $217 million in fund...Show moreLast updated: 30+ days ago
    • Promoted
    AI & Machine Learning Engineer - Senior - Consulting - Open Location

    AI & Machine Learning Engineer - Senior - Consulting - Open Location

    EYSan Francisco, CA, United States
    Full-time
    At EY, we're all in to shape your future with confidence.We'll help you succeed in a globally connected powerhouse of diverse teams and take your career wherever you want it to go.Join EY and help ...Show moreLast updated: 30+ days ago
    • Promoted
    AI Engineer - Machine Learning (US)

    AI Engineer - Machine Learning (US)

    Slab Inc.Palo Alto, CA, United States
    Full-time
    Gauss Labs is looking for a passionate and talented AI Engineer for developing cutting-edge Industrial AI solutions that will normalize the standard of AI for manufacturing.We are working with the ...Show moreLast updated: 30+ days ago
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    Diverse LynxSan Francisco, CA, United States
    Full-time
    Experience presenting data to executives (must-have).Strong communication, written skills, and interpersonal skills (required to establish and maintain inter-departmental relationships.Experience e...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Machine Learning Engineer, Applied AI Platform

    Staff Machine Learning Engineer, Applied AI Platform

    Block USAAlbany, CA, United States
    Full-time
    Block is one company built from many blocks, all united by the same purpose of economic empowerment.The blocks that form our foundational teams - People, Finance, Counsel, Hardware, Information Sec...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Operations (MLOps) Engineer

    Machine Learning Operations (MLOps) Engineer

    Together AISan Francisco, CA, United States
    Full-time
    Together AI is looking for an MLOps engineer who will develop systems and APIs that enable our customers to perform inference and fine tune LLMs. Relevant experience includes implementing runtime sy...Show moreLast updated: 30+ days ago