Talent.com
Machine Learning, Platform Engineer
Machine Learning, Platform EngineerTogether AI • San Francisco, CA, United States
Machine Learning, Platform Engineer

Machine Learning, Platform Engineer

Together AI • San Francisco, CA, United States
30+ days ago
Job type
  • Full-time
Job description

This role focuses on enabling custom models and dedicated inference on Together. We are responsible for optimizing autoscaling, minimizing cold starts, achieving the best end-to-end model performance, and providing a best-in-class developer experience with great tooling.

Required Qualifications

  • 5+ years of demonstrated experience in building large scale, fault tolerant, distributed systems and API microservices
  • Experience running serverless inference platforms, doing model bring-up on short notice, being on call, or general cloud provider is a very big plus
  • Good taste and ability to thoughtfully discuss how what you’ve built has failed over time
  • Experience designing, analyzing and improving efficiency, scalability, and stability of various system resources
  • Excellent understanding of low level operating systems concepts including concurrency, networking and storage, performance and scale
  • Expert-level programmer in one or more of Golang, Rust, Python, C++, or Haskell
  • Proficiency in writing and maintaining Infrastructure as Code (IaC) using tools like Terraform
  • Experience with Kubernetes or other container orchestration systems
  • Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or a related technical field, or equivalent practical experience
  • Writing-heavy roles or companies are a plus

Key Responsibilities

  • New hires may work on multi-cluster orchestration, portfolio optimization, predictive autoscaling, control panes, model bring-up, light model optimization, APIs for managing deployments, inference worker SDKs, and CLI tools.
  • Analyze and improve the robustness and scalability of existing distributed systems, APIs, databases, and infrastructure
  • Partner with product teams to understand functional requirements and deliver solutions that meet business needs
  • Write clear, well-tested, and maintainable software and IaC for both new and existing systems
  • Conduct design and code reviews, create developer documentation, and develop testing strategies for robustness and fault tolerance
  • About Together AI

    Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure.

    Compensation

    We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is : $160,000 - $250,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.

    Equal Opportunity

    Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

    Please see our privacy policy at https : / / www.together.ai / privacy

    #J-18808-Ljbffr

    Create a job alert for this search

    Machine Learning Engineer • San Francisco, CA, United States

    Related jobs
    Founding Machine Learning Engineer

    Founding Machine Learning Engineer

    ARCADE • San Francisco, California, US
    Full-time
    Check you match the skill requirements for this role, as well as associated experience, then apply with your CV below.Senior Software Engineer, Machine Learning. We’re looking for our first product-...Show more
    Last updated: 1 day ago • Promoted
    Senior ML Platform Engineer - Large-Scale AI Infra

    Senior ML Platform Engineer - Large-Scale AI Infra

    Apple Inc. • San Francisco, CA, United States
    Full-time
    A leading technology company in San Francisco seeks a Machine Learning Engineer to contribute to AI and machine learning projects. This role involves managing large data systems, designing algorithm...Show more
    Last updated: 4 days ago • Promoted
    Senior Machine Learning Engineer, Relevance

    Senior Machine Learning Engineer, Relevance

    Patreon • San Francisco, CA, United States
    Full-time
    Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans ...Show more
    Last updated: 15 days ago • Promoted
    Machine Learning Engineer, Enterprise Brain

    Machine Learning Engineer, Enterprise Brain

    Glean Technologies, Inc. • San Francisco, CA, United States
    Full-time
    Glean is the Work AI platform that helps everyone work smarter with AI.What began as the industry’s most advanced enterprise search has evolved into a full‑scale Work AI ecosystem, powering intell...Show more
    Last updated: 4 days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Visa • Foster City, CA, United States
    Permanent
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show more
    Last updated: 7 days ago • Promoted
    Founding Machine Learning Engineer

    Founding Machine Learning Engineer

    Arcade • San Francisco, CA, United States
    Full-time
    Senior Software Engineer, Machine Learning.We’re looking for our first product-minded machine learning engineer who’s excited to push the boundaries of what’s possible with AI.At Arcade, you’ll exp...Show more
    Last updated: 19 days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Trov • San Francisco, CA, United States
    Full-time
    At Pave, we’re building the industry’s leading compensation platform, combining the world's largest real-time compensation dataset with deep expertise in AI and machine learning.Our platform is per...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Latent • San Francisco, CA, United States
    Full-time
    Latent is building the intelligence infrastructure for American healthcare.Our flagship multi-modal search and question-answering platform analyzes EHR data to surface the most relevant information...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Machine Learning

    Senior Software Engineer, Machine Learning

    Planet Labs PBC • San Francisco, CA, United States
    Full-time
    We believe in using space to help life on Earth.Planet designs, builds, and operates the largest constellation of imaging satellites in history. This constellation delivers an unprecedented dataset ...Show more
    Last updated: 14 days ago • Promoted
    Senior Machine Learning Platform Engineer : EVENT

    Senior Machine Learning Platform Engineer : EVENT

    Coinbase • San Francisco, CA, United States
    Full-time
    Senior Machine Learning Platform Engineer : EVENT.Senior Machine Learning Platform Engineer : EVENT.At Coinbase, our mission is to increase economic freedom in the world. It’s a massive, ambitious opp...Show more
    Last updated: 11 days ago • Promoted
    Principal Machine Learning Engineer

    Principal Machine Learning Engineer

    Tubi • San Francisco, CA, United States
    Full-time
    About Tubi : Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world's largest collection of Hollywood movies and ...Show more
    Last updated: 30+ days ago • Promoted
    Senior / Principal Machine Learning Engineer, Performance DSP

    Senior / Principal Machine Learning Engineer, Performance DSP

    PubMatic, Inc. • Redwood City, CA, United States
    Full-time
    Senior / Principal Machine Learning Engineer, Performance DSP.PubMatic is one of the world’s leading scaled digital advertising platforms, offering more transparent advertising solutions to publishe...Show more
    Last updated: 30+ days ago • Promoted
    AIML - Senior Software Engineer, Machine Learning Platform Technologies

    AIML - Senior Software Engineer, Machine Learning Platform Technologies

    San Francisco Staffing • San Francisco, CA, United States
    Full-time
    We are looking for a Senior Software Engineer to shape how we measure the success and reliability of Apple Intelligence software features. This role is at the intersection of feature delivery, telem...Show more
    Last updated: 10 days ago • Promoted
    Machine Learning Engineer, AI Product and Platform

    Machine Learning Engineer, AI Product and Platform

    Motive Software • San Francisco, CA, United States
    Full-time
    Machine Learning Engineer, Coach Platform.Machine Learning Engineer, Coach Platform.Machine Learning Engineer, Coach Platform. Machine Learning Engineer, Coach Platform.Get AI-powered advice on this...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning, Platform Engineer

    Machine Learning, Platform Engineer

    Together • San Francisco, CA, United States
    Full-time
    This role focuses on enabling custom models and dedicated inference on Together.We are responsible for optimizing autoscaling, minimizing cold starts, achieving the best end-to-end model performanc...Show more
    Last updated: 30+ days ago • Promoted
    ML Engineer : Predictive Maintenance & Asset Intelligence

    ML Engineer : Predictive Maintenance & Asset Intelligence

    MaintainX • San Francisco, CA, United States
    Full-time
    A leading mobile-first Asset and Work Intelligence platform is seeking a Senior Applied Machine Learning Engineer to guide the architecture of predictive maintenance and asset intelligence initiati...Show more
    Last updated: 4 days ago • Promoted
    Machine Learning Engineer, Identity

    Machine Learning Engineer, Identity

    Adamcad • San Francisco, CA, United States
    Full-time
    Before Stripe, every growing internet platform had a payments team.Today, every growing internet platform has an Identity team. Identity verification is a core piece of economic infrastructure for o...Show more
    Last updated: 1 day ago • Promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    Mercury • San Francisco, CA, United States
    Full-time
    Locations : San Francisco, CA; New York, NY; Portland, OR; or Remote within Canada or United States.Mercury is building systems to efficiently review customers’ applications to prevent money launder...Show more
    Last updated: 30+ days ago • Promoted