Talent.com
Engineering Manager - Model Performance

Engineering Manager - Model Performance

BasetenSan Francisco, CA, US
30+ days ago
Job type
  • Full-time
Job description

Join Our Dynamic Team at Baseten

Join our dynamic team at Baseten, where we're revolutionizing AI deployment with cutting-edge inference infrastructure. Backed by premier investors such as IVP, Spark Capital, Greylock, and Conviction, we're trusted by leading enterprises and AI-driven innovatorsincluding Descript, Bland.ai, Patreon, Writer, and Robust Intelligenceto deliver top-tier performance, security, and reliability for their production workloads. With our recent $75 million Series C funding, we're poised to accelerate our mission to make AI accessible across all products. If you're passionate about tackling impactful challenges and building transformative solutions from the ground up, we invite you to join us on this exciting journey!

The Role

Are you passionate about advancing the frontiers of artificial intelligence while leading a team of exceptional engineers? We are looking for a Tech Lead Manager focused on ML performance and inference. This role is ideal for someone with a strong engineering background who is eager to lead and mentor a team while remaining hands-on with technology. If you thrive in a fast-paced startup environment and are excited about both leadership and technical challenges, we want to hear from you.

Responsibilities

  • Lead, mentor, and manage a team of engineers focused on developing and optimizing ML model inference and performance.
  • Oversee technical strategy and architecture decisions, driving improvements across our engineering organization.
  • Collaborate with cross-functional teams to ensure seamless integration and scalability of ML models in production environments.
  • Dive into the codebase of frameworks like TensorRT, PyTorch, CUDA, and others to identify and solve complex performance bottlenecks.
  • Drive the development and deployment of large-scale optimization techniques for various ML models, especially large language models (LLMs).
  • Own the full lifecycle of projects from inception through delivery, including planning, execution, and resource management.
  • Foster a collaborative, inclusive team environment that encourages continuous learning and growth.

Requirements

  • Bachelor's, Master's, or Ph.D. in Computer Science, Engineering, or a related field.
  • 5+ years of professional experience in software engineering, with at least 2 years in a technical leadership role.
  • Proven experience managing and mentoring teams of engineers.
  • Expertise in one or more programming languages, such as Python, C++, or Go.
  • In-depth understanding of ML model performance optimization, especially using libraries such as PyTorch, TensorRT, and CUDA.
  • Strong knowledge of containerization (Docker) and orchestration systems (Kubernetes).
  • Experience with production-level AI / ML solutions, including scaling and deploying large models.
  • Ability to balance hands-on technical work with team leadership and project management.
  • Bonus Points

  • Experience enhancing the performance of large language models (LLMs) or similar AI systems.
  • Familiarity with LLM optimization techniques such as quantization, speculative decoding, or continuous batching.
  • Deep knowledge of GPU architecture and performance tuning.
  • Previous experience in a high-growth startup environment.
  • Benefits

  • Competitive compensation package (Unlimited PTO, 401k, covered healthcare premiums).
  • An opportunity to lead a talented engineering team at a rapidly growing startup in the machine learning space.
  • Inclusive and supportive work culture with ample opportunities for professional development.
  • Exposure to a wide range of ML use cases, offering unmatched learning and networking potential.
  • Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

    At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

    Create a job alert for this search

    Manager Performance • San Francisco, CA, US

    Related jobs
    • Promoted
    Senior Engineering Manager, RCM Product

    Senior Engineering Manager, RCM Product

    Commure + AthelasMountain View, CA, US
    Full-time
    At Commure, our mission is to simplify healthcare.We have bold ambitions to reimagine the healthcare experience, setting a new standard for how care is delivered and experienced across the industry...Show moreLast updated: 22 days ago
    • Promoted
    Engineering Manager, Fleets

    Engineering Manager, Fleets

    Socotra, Inc.San Francisco, CA, United States
    Full-time
    At Lyft, our purpose is to serve and connect.We aim to achieve this by cultivating a work environment where all team members belong and have the opportunity to thrive. Fleets is a dynamic area worki...Show moreLast updated: 30+ days ago
    • Promoted
    Engineering Manager - Autonomy

    Engineering Manager - Autonomy

    BoosterSan Mateo, CA, United States
    Full-time
    Skydio is the leading US drone company and the world leader in autonomous flight, the key technology for the future of drones and aerial transportation. The Skydio team combines deep expertise in ar...Show moreLast updated: 30+ days ago
    • Promoted
    Product Saftey Engineering Manager

    Product Saftey Engineering Manager

    TUV Rheinland of North AmericaPleasanton, CA, US
    Full-time
    Joining TÜV Rheinland means working for one of the world’s leading testing, inspection, and certification service providers with more than 25,000 employees globally.Our employees are our...Show moreLast updated: 16 days ago
    • Promoted
    Engineering Manager

    Engineering Manager

    VirtualVocationsOakland, California, United States
    Full-time
    A company is looking for an Engineering Manager, Relationships.Key Responsibilities Lead, mentor, and grow a high-performing team of software engineers Own the technical strategy around key feat...Show moreLast updated: 30+ days ago
    • Promoted
    Manager, Performance and Capacity Engineering

    Manager, Performance and Capacity Engineering

    Meta PlatformsMenlo Park, CA, US
    Full-time
    Manager, Performance And Capacity Engineering.Meta is seeking an Engineering Manager to join the Capacity Engineering team to focus on site-wide performance and capacity optimization at the interse...Show moreLast updated: 8 days ago
    • Promoted
    Engineering Manager, Delivery Engineering

    Engineering Manager, Delivery Engineering

    OpenAISan Francisco, CA, United States
    Full-time
    Engineering Manager, Delivery Engineering | OpenAI.Engineering Manager, Delivery Engineering.Applied AI Engineering - San Francisco. Apply now (opens in a new window).The Delivery Engineering team s...Show moreLast updated: 30+ days ago
    • Promoted
    Engineering Manager, ML Training Platform

    Engineering Manager, ML Training Platform

    ZooxSan Mateo, CA, US
    Full-time
    Engineering Manager, ML Training Platform.Software Software & Machine Learning Infrastructure / Full-time / Hybrid.Zoox is on a mission to reimagine transportation and ground-up build autonomous r...Show moreLast updated: 30+ days ago
    • Promoted
    Engineering Manager II, AI Model Foundations

    Engineering Manager II, AI Model Foundations

    BoxRedwood City, CA, US
    Full-time
    Engineering Manager II, Ai Model Foundations.Box (NYSE : BOX) is the leader in Intelligent Content Management.Our platform enables organizations to fuel collaboration, manage the entire content lifec...Show moreLast updated: 28 days ago
    • Promoted
    Engineering Manager- Machine Learning Infrastructure

    Engineering Manager- Machine Learning Infrastructure

    Plaid IncSan Francisco, CA, United States
    Full-time
    Plaid is evolving into an AI-first company, where data and machine learning are the key enablers of smarter, more secure insight products built on top of Plaid’s vast financial data network.The Mac...Show moreLast updated: 20 days ago
    • Promoted
    Senior Engineering Manager - Cortex LLM Platform

    Senior Engineering Manager - Cortex LLM Platform

    SnowflakeMenlo Park, CA, US
    Full-time
    We are seeking a Senior Engineering Manager to lead our Cortex LLM Platform team, a critical component of Snowflake's AI strategy. This platform underpins all AI products that use LLMs, including Sn...Show moreLast updated: 23 days ago
    • Promoted
    • New!
    Engineering Manager

    Engineering Manager

    GraphiteSan Francisco, CA, United States
    Full-time
    Graphite builds consumer-quality tools for modern software engineering teams, so they can ship faster and create amazing products. Anyone can start using Graphite individually without needing their ...Show moreLast updated: 13 hours ago
    • Promoted
    Engineering Manager - Sensors & Rendering

    Engineering Manager - Sensors & Rendering

    Applied IntuitionMountain View, CA, US
    Full-time
    Engineering Manager - Sensors & Rendering.We're seeking a Software Engineering Manager to lead the Sensors & Rendering team within our broader Simulation team. This team builds critical software for...Show moreLast updated: 30+ days ago
    • Promoted
    Technical Lead Manager, Foundation Model Post-Training

    Technical Lead Manager, Foundation Model Post-Training

    WaymoSan Francisco, CA, United States
    Full-time
    Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...Show moreLast updated: 30+ days ago
    • Promoted
    Engineering Manager, Applied AI

    Engineering Manager, Applied AI

    VirtualVocationsSanta Clara, California, United States
    Full-time
    A company is looking for an Engineering Manager, Applied AI.Key Responsibilities Lead and build the Applied AI Team, defining technical vision and success metrics Oversee end-to-end ownership of...Show moreLast updated: 20 days ago
    • Promoted
    Engineering Manager, Performance & Capacity Engineering - Capacity Planning Optimization

    Engineering Manager, Performance & Capacity Engineering - Capacity Planning Optimization

    Meta PlatformsMenlo Park, CA, US
    Full-time
    Engineering Manager, Performance & Capacity Engineering - Capacity Planning Optimization.Meta is seeking an Engineering Manager to join the Capacity Engineering team to focus on site-wide performan...Show moreLast updated: 30+ days ago
    • Promoted
    Manager / Sr Manager, Solution Engineering MuleSoft Life Sciences

    Manager / Sr Manager, Solution Engineering MuleSoft Life Sciences

    Salesforce.Com IncSan Francisco, CA, United States
    Full-time
    To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Salesforce is the #1 AI CRM, where humans with age...Show moreLast updated: 4 days ago
    • Promoted
    Manager, Cell Mechanical EngineeringMechanical Design Engineering • Berkeley, CA • Full time • On-site

    Manager, Cell Mechanical EngineeringMechanical Design Engineering • Berkeley, CA • Full time • On-site

    Form EnergyBerkeley, CA, United States
    Full-time
    Are you ready to build America's energy future? Form Energy is an American manufacturing and energy technology company.We're revolutionizing energy storage with cost-effective, multi-day technology...Show moreLast updated: 30+ days ago