Talent.com
Machine Learning, Platform Engineer
Machine Learning, Platform EngineerTogether AI • San Francisco, CA, United States
Machine Learning, Platform Engineer

Machine Learning, Platform Engineer

Together AI • San Francisco, CA, United States
30+ days ago
Job type
  • Full-time
Job description

This role focuses on enabling custom models and dedicated inference on Together. We are responsible for optimizing autoscaling, minimizing cold starts, achieving the best end-to-end model performance, and providing a best-in-class developer experience with great tooling.

Required Qualifications

  • 5+ years of demonstrated experience in building large scale, fault tolerant, distributed systems and API microservices
  • Experience running serverless inference platforms, doing model bring-up on short notice, being on call, or general cloud provider is a very big plus
  • Good taste and ability to thoughtfully discuss how what you’ve built has failed over time
  • Experience designing, analyzing and improving efficiency, scalability, and stability of various system resources
  • Excellent understanding of low level operating systems concepts including concurrency, networking and storage, performance and scale
  • Expert-level programmer in one or more of Golang, Rust, Python, C++, or Haskell
  • Proficiency in writing and maintaining Infrastructure as Code (IaC) using tools like Terraform
  • Experience with Kubernetes or other container orchestration systems
  • Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or a related technical field, or equivalent practical experience
  • Writing-heavy roles or companies are a plus

Key Responsibilities

  • New hires may work on multi-cluster orchestration, portfolio optimization, predictive autoscaling, control panes, model bring-up, light model optimization, APIs for managing deployments, inference worker SDKs, and CLI tools.
  • Analyze and improve the robustness and scalability of existing distributed systems, APIs, databases, and infrastructure
  • Partner with product teams to understand functional requirements and deliver solutions that meet business needs
  • Write clear, well-tested, and maintainable software and IaC for both new and existing systems
  • Conduct design and code reviews, create developer documentation, and develop testing strategies for robustness and fault tolerance
  • About Together AI

    Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure.

    Compensation

    We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is : $160,000 - $250,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.

    Equal Opportunity

    Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

    Please see our privacy policy at https : / / www.together.ai / privacy

    #J-18808-Ljbffr

    Create a job alert for this search

    Machine Learning Engineer • San Francisco, CA, United States

    Related jobs
    Machine Learning Engineer, GenAI Platform

    Machine Learning Engineer, GenAI Platform

    Lightfield • San Francisco, California, United States
    Full-time
    Lightfield is a new kind of CRM.It's a collaborative system for founders to find, understand, and serve customers faster than anything before it. It captures every customer interaction, generates an...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Casap • San Francisco, California, United States
    Remote
    Full-time
    Series A startup that has raised over $30M from Emergence, Lightspeed, and Primary Ventures.Based in San Francisco, the company was founded by product leaders from Robinhood and Chime.We are on a m...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer, Relevance

    Machine Learning Engineer, Relevance

    Patreon • San Francisco, California, United States
    Full-time
    Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans ...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Frontend - Machine Learning Platform

    Software Engineer, Frontend - Machine Learning Platform

    Doordash Usa • San Francisco, California, United States
    Full-time
    DoorDash is building the world’s most reliable on-demand logistics engine.Behind the scenes, our Machine Learning Platform (MLP) powers critical real-time decision-making for millions of orders eac...Show more
    Last updated: 30+ days ago • Promoted
    Founding Senior Machine Learning Engineer

    Founding Senior Machine Learning Engineer

    Retell Ai • San Francisco Bay, California, United States
    Full-time
    Retell AI is using the first principles to reimagine the call center with cutting edge voice AI.We believe voice is still the most natural way humans communicate, yet it has been trapped in outdate...Show more
    Last updated: 30+ days ago • Promoted
    MTS, Machine Learning Engineer

    MTS, Machine Learning Engineer

    Delphina • San Francisco, California, United States
    Full-time
    Today’s Data Scientists are in pain - spending their time manually wrangling data, building models through slow trial and error, taking on painstaking rewrites for deployment, and dealing with coun...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Machine Learning

    Senior Software Engineer, Machine Learning

    Planet Labs PBC • San Francisco, CA, United States
    Full-time
    We believe in using space to help life on Earth.Planet designs, builds, and operates the largest constellation of imaging satellites in history. This constellation delivers an unprecedented dataset ...Show more
    Last updated: 16 days ago • Promoted
    Machine Learning, Platform Engineer

    Machine Learning, Platform Engineer

    Together Ai • San Francisco, California, United States
    Full-time
    This role focuses on enabling custom models and dedicated inference on Together.We are responsible for optimizing autoscaling, minimizing cold starts, achieving the best end-to-end model performanc...Show more
    Last updated: 30+ days ago • Promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    Curai • San Francisco, California, United States
    Full-time
    Curai Health is an AI-powered virtual clinic on a mission to improve access to care at scale.As the pioneer in deploying machine learning into clinical workflows, Curai Health enables its dedicated...Show more
    Last updated: 30+ days ago • Promoted
    Founding Machine Learning Engineer

    Founding Machine Learning Engineer

    Fermàt • San Francisco, California, United States
    Full-time
    Commerce brands to transform clicks into conversions with highly-personalized , 1 : 1 dynamic shopping experiences.We've raised $30M+ to date and are backed by Bain Capital Ventures, Greylock, QED, a...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Jobot • San Francisco, CA, US
    Full-time
    Entry Level ML Engineer Needed for Growing AI Startup!.This Jobot Job is hosted by : Reed Kellick.Are you a fit? Easy Apply now by clicking the "Apply Now" button and sending us your resume.Salary : ...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Bland • San Francisco, California, United States
    Full-time
    Based out of San Francisco, we're a quickly growing team striving to change the way customers interact with businesses.We've raised $65 million from Silicon Valley's finest; Including Emergence Cap...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Platform Engineer, API

    Machine Learning Platform Engineer, API

    Anthropic • San Francisco, California, United States
    Full-time
    Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...Show more
    Last updated: 30+ days ago • Promoted
    Principal Machine Learning Engineer, Account Identity

    Principal Machine Learning Engineer, Account Identity

    Roblox • San Mateo, California, United States
    Full-time
    At Roblox, we strive to connect a billion people with optimism and civility, and the Safety organization’s mission is to become the leader in civil immersive online communities.We systematically de...Show more
    Last updated: 30+ days ago • Promoted
    Principal Machine Learning Engineer, Ads Measurement

    Principal Machine Learning Engineer, Ads Measurement

    Reddit • San Francisco, California, United States
    Full-time
    Reddit is a community of communities.It’s built on shared interests, passion, and trust and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote...Show more
    Last updated: 30+ days ago • Promoted
    Senior Machine Learning Engineer - Data Foundation and AI

    Senior Machine Learning Engineer - Data Foundation and AI

    Plaid • San Francisco, California, United States
    Full-time
    We believe that the way people interact with their finances will drastically improve in the next few years.We’re dedicated to empowering this transformation by building the tools and experiences th...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer - Machine Learning Platform

    Software Engineer - Machine Learning Platform

    Snowflake • Menlo Park, California, United States
    Full-time
    The Snowflake Machine Learning Platform team’s mission is to enable customers to bring their ML / AI workload to Snowflake. Our customers want to leverage ML / AI to extract business values from ever in...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer South San Francisco CA

    Machine Learning Engineer South San Francisco CA

    Esrhealthcare • San Bruno, California, United States
    Full-time
    Machine Learning Engineer (Operations).South San Francisco CA (Hybrid, 3 days / week) (Not remote).Strong understanding of machine learning concepts, algorithms, and best practices.Proven experience ...Show more
    Last updated: 30+ days ago • Promoted