Talent.com
Software Engineer - ML Performance
Software Engineer - ML PerformanceBaseten • San Ramon, California, United States
Software Engineer - ML Performance

Software Engineer - ML Performance

Baseten • San Ramon, California, United States
30+ days ago
Job type
  • Full-time
Job description

ABOUT BASETEN

We’re a growing team of builders backed by top-tier investors, including IVP , Spark Capital , Greylock , and Sarah Guo at Conviction . ML teams at enterprises and category-defining AI-native companies like Descript , Bland.ai , Patreon , Writer , and Robust Intelligence use Baseten to power their core production workloads with best-in-class performance, security, and reliability. While we’ve unlocked PMF and secured Series B funding , the ML infrastructure market is massive, and we’re just getting started. If you’re excited to work on engaging and relevant problems while building something new from the ground up, come join us!

THE ROLE

Are you passionate about advancing the application of artificial intelligence? We are looking for a Software Engineer focused on ML performance to join our dynamic team. This role is ideal for someone who thrives in a fast-paced startup environment and is eager to make significant contributions to the exciting field of LLM Inference. If you are a backend engineer who thrives on making things faster and is excited about open-source ML models, we look forward to your application.

RESPONSIBILITIES :

Implement, refine, and productionize cutting-edge techniques (quantization, speculative decoding, kv cache reuse, chunked prefill and LoRA) for ML model inference and infrastructure.

Deep dive into underlying codebases of TensorRT, PyTorch, TensorRT-LLM, vllm, sglang, CUDA, and other libraries to debug ML performance issues.

Apply and scale optimization techniques across a wide range of ML models, particularly large language models.

Collaborate with a diverse team to design and implement innovative solutions.

Own projects from idea to production.

REQUIREMENTS :

Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field.

Experience with one or more general-purpose programming languages, such as Python or C++.

Familiarity with LLM optimization techniques (e.g., quantization, speculative decoding, continuous batching).

Strong familiarity with ML libraries, especially PyTorch, TensorRT, or TensorRT-LLM.

Demonstrated interest and experience in LLM’s.

Deep understanding of GPU architecture.

BONUS POINTS :

Proficiency in enhancing the performance of software systems, particularly in the context of large language models (LLMs).

Experience with CUDA or similar technologies.

Deep understanding of software engineering principles and a proven track record of developing and deploying AI / ML inference solutions.

Experience with Docker and Kubernetes.

BENEFITS :

Competitive compensation package (Unlimited PTO, 401k, covered healthcare premiums).

This is a unique opportunity to be part of a rapidly growing startup in one of the most exciting engineering fields of our era.

An inclusive and supportive work culture that fosters learning and growth.

Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Apply Now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

Create a job alert for this search

Software Engineer • San Ramon, California, United States

Related jobs
Software Engineer, Preprint

Software Engineer, Preprint

Velo3d • Fremont, California, United States
Full-time
We are starting a new team to bring machine learning into our build setup pipeline.You will be working in close collaboration with the people who develop our print processes and help customers with...Show more
Last updated: 30+ days ago • Promoted
ML Software Engineer, Online Maps

ML Software Engineer, Online Maps

Aurora Innovation • Mountain View, California, United States
Full-time
Aurora hires talented people with diverse backgrounds who are ready to help build a transportation ecosystem that will make our roads safer, get crucial goods where they need to go, and make mobili...Show more
Last updated: 30+ days ago • Promoted
Software Engineer - Observability

Software Engineer - Observability

Snowflake • Menlo Park, California, United States
Full-time
The Observability team at Snowflake is in charge of building an extensible, self-service Observability platform that reliably collects and serves telemetry data such as metrics, logs, traces to bot...Show more
Last updated: 30+ days ago • Promoted
AI / ML System Software Engineer, Staff

AI / ML System Software Engineer, Staff

D-matrix • Santa Clara, California, United States
Full-time
AI to power the transformation of technology.We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. We value humility and believe in direct communic...Show more
Last updated: 30+ days ago • Promoted
Applied ML Engineer

Applied ML Engineer

Kognitos • San Jose, California, United States
Full-time
Kognitos is at the forefront of revolutionizing the trillion-dollar hyper-automation market.Our mission is to redefine how software is built and maintained by leveraging cutting-edge multi-agent au...Show more
Last updated: 30+ days ago • Promoted
Software Engineer, Fullstack

Software Engineer, Fullstack

Ambient.ai • Redwood City, California, United States
Full-time
Build a safer world with us, one incident at a time.AI-powered physical security platform helping the world’s leading enterprises reduce risk, improve operational efficiency, and gain critical insi...Show more
Last updated: 1 day ago • Promoted
Software Engineer, ML

Software Engineer, ML

Heartflow • San Francisco, California, United States
Full-time
Heartflow is a medical technology company advancing the diagnosis and management of coronary artery disease, the #1 cause of death worldwide, using cutting-edge technology.The flagship product—an A...Show more
Last updated: 30+ days ago • Promoted
Experienced Software Engineer - Machine Learning Infra

Experienced Software Engineer - Machine Learning Infra

Plaid • San Francisco, California, United States
Full-time
The Plaid Machine Learning Infrastructure (ML Infra) team is responsible for creating and maintaining the foundational systems, tools, and processes that enable efficient, scalable, reliable, respo...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer - Machine Learning Platform

Senior Software Engineer - Machine Learning Platform

Snowflake • Menlo Park, California, United States
Full-time
The Snowflake Machine Learning Platform team’s mission is to enable customers to bring their machine learning and deep learning workloads to Snowflake. Our customers want to build powerful models wi...Show more
Last updated: 30+ days ago • Promoted
Software Engineer - Machine Learning

Software Engineer - Machine Learning

Celonis • Redwood City, California, United States
Full-time
We're Celonis, the global leader in Process Mining technology and one of the world's fastest-growing SaaS firms.We believe there is a massive opportunity to unlock productivity by placing data and ...Show more
Last updated: 30+ days ago • Promoted
Software Engineer - ML Infrastructure

Software Engineer - ML Infrastructure

Specter • San Francisco, California, United States
Full-time
Specter is creating a software-defined "control plane" for the physical world.We are starting with protecting American businesses by granting them ubiquitous perception over their physical assets.T...Show more
Last updated: 3 days ago • Promoted
Software Engineer, Machine Learning Infrastructure

Software Engineer, Machine Learning Infrastructure

Datologyai • Redwood City, California, United States
Full-time
Companies want to train their own large models on their own data.The current industry standard is to train on a random sample of your data, which is inefficient at best and actively harmful to mode...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer - Machine Learning

Senior Software Engineer - Machine Learning

Celonis • Redwood City, California, United States
Full-time
We're Celonis, the global leader in Process Intelligence technology and one of the world's fastest-growing SaaS firms.We believe there is a massive opportunity to unlock productivity by placing AI,...Show more
Last updated: 2 days ago • Promoted
Staff Software Platform EngineerSoftware Engineering • Berkeley, CA; Somerville, MA; Weirton, WV • Full time • On-site

Staff Software Platform EngineerSoftware Engineering • Berkeley, CA; Somerville, MA; Weirton, WV • Full time • On-site

Form Energy • Berkeley, CA, United States
Full-time
Are you ready to build America's energy future? Form Energy is an American manufacturing and energy technology company.We're revolutionizing energy storage with cost-effective, multi-day technology...Show more
Last updated: 30+ days ago • Promoted
AI / ML System Software Engineer, Senior Staff

AI / ML System Software Engineer, Senior Staff

D-matrix • Santa Clara, California, United States
Full-time
AI to power the transformation of technology.We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. We value humility and believe in direct communic...Show more
Last updated: 30+ days ago • Promoted
Software Engineer L4 / L5, Model Serving Systems, Machine Learning Platform

Software Engineer L4 / L5, Model Serving Systems, Machine Learning Platform

Netflix • Los Gatos, California, United States
Full-time
Netflix is one of the world's leading entertainment services, with 283 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and lan...Show more
Last updated: 30+ days ago • Promoted
Software Engineer L4, Machine Learning Platform (Metaflow)

Software Engineer L4, Machine Learning Platform (Metaflow)

Netflix • Los Gatos, California, United States
Full-time
Netflix is one of the world's leading entertainment services, with 283 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and lan...Show more
Last updated: 30+ days ago • Promoted
Software Engineer - Machine Learning Platform

Software Engineer - Machine Learning Platform

Snowflake • Menlo Park, California, United States
Full-time
The Snowflake Machine Learning Platform team’s mission is to enable customers to bring their ML / AI workload to Snowflake. Our customers want to leverage ML / AI to extract business values from ever in...Show more
Last updated: 30+ days ago • Promoted