Talent.com
No longer accepting applications
Software Engineer - ML / LLM Inference

Software Engineer - ML / LLM Inference

AlldusSan Francisco, CA, US
1 day ago
Job type
  • Full-time
Job description

Get AI-powered advice on this job and more exclusive features.

Direct message the job poster from Alldus

Principal Recruitment Consultant | AI & Machine Learning | Co-organizer of the AI in Action Podcast

My client is searching for a talented engineer to work on ML / LLM inference and serving. They specialize in developing next-gen LLM fine-tuning and inference engines.

We are seeking a talented and motivated Software Engineer specializing in Machine Learning (ML) and Large Language Model (LLM) inference to join our dynamic ML Inference team. In this role, you will bridge the gap between AI / ML research and systems programming to build and enhance our next-generation LLM Inference Engine. You will play a crucial role in optimizing the performance, scalability, and efficiency of our LLM serving systems.

Key Responsibilities :

Develop and Enhance Inference Engine :

  • Design, implement, and optimize the next-generation LLM Inference Engine.
  • Integrate the latest LLM inference techniques from research to enhance latency and throughput.

Performance Optimization :

  • Conduct deep performance optimizations across multiple layers of the technology stack, including PyTorch, C++, and CUDA.
  • Analyze and improve system performance to meet the demands of various use cases.
  • Work closely with customers to understand specific performance requirements and optimize solutions accordingly.
  • Provide technical expertise and support to ensure successful deployment and operation of inference systems.
  • Technical Leadership :

  • Define the roadmap and technical vision for the inference stack.
  • Lead initiatives to drive innovation and maintain the competitive edge of our inference technologies.
  • Infrastructure Development :

  • Collaborate with partner teams to build and maintain scalable, multi-replica serving infrastructure.
  • Ensure the reliability and scalability of LLM serving systems to handle increasing workloads.
  • Qualifications : Technical Skills :

  • Proficiency in systems programming languages such as C++.
  • Strong experience with machine learning frameworks, particularly PyTorch.
  • Expertise in GPU programming and CUDA for performance optimization.
  • Solid understanding of AI / ML concepts, especially related to large language models.
  • Experience :

  • Proven experience in developing and optimizing ML / LLM inference systems.
  • Demonstrated ability to integrate research advancements into production systems.
  • Experience with performance tuning and profiling across various technology stacks.
  • Experience with vLLM
  • Seniority level

  • Seniority levelMid-Senior level
  • Employment type

  • Employment typeFull-time
  • Job function

  • IndustriesStaffing and Recruiting and Software Development
  • Referrals increase your chances of interviewing at Alldus by 2x

    Inferred from the description for this job

    San Francisco, CA $130,000.00-$238,000.00 3 days ago

    San Francisco, CA $40,000.00-$70,000.00 2 weeks ago

    San Francisco, CA $145,000.00-$230,000.00 5 days ago

    Full-Stack Software Engineer (Jr / Mid level)

    San Francisco, CA $220,000.00-$350,000.00 4 hours ago

    San Francisco, CA $150,000.00-$230,000.00 2 months ago

    San Francisco, CA $150,000.00-$176,000.00 2 months ago

    San Francisco, CA $99,500.00-$200,000.00 1 day ago

    San Francisco, CA $130,000.00-$140,000.00 2 days ago

    San Francisco, CA $120,000.00-$190,000.00 8 months ago

    San Francisco, CA $125,000.00-$175,000.00 1 month ago

    Software Engineer, Frontend (All Levels)

    San Francisco, CA $150,000.00-$220,000.00 1 hour ago

    San Francisco, CA $56.25-$173,000.00 2 weeks ago

    San Francisco, CA $176,000.00-$250,000.00 2 weeks ago

    Alameda, CA $130,000.00-$160,000.00 4 weeks ago

    San Francisco, CA $150,000.00-$283,000.00 2 weeks ago

    San Francisco, CA $150,000.00-$300,000.00 5 days ago

    San Francisco, CA $165,000.00-$165,000.00 2 years ago

    San Francisco, CA $140,000.00-$280,000.00 7 months ago

    San Francisco, CA $140,000.00-$180,000.00 1 month ago

    San Francisco, CA $130,000.00-$185,000.00 2 months ago

    San Francisco, CA $99,500.00-$200,000.00 1 day ago

    San Francisco, CA $150,500.00-$269,200.00 2 days ago

    San Francisco, CA $100,000.00-$200,000.00 1 year ago

    San Francisco, CA $120,000.00-$200,000.00 2 years ago

    San Francisco, CA $150,000.00-$250,000.00 9 months ago

    We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

    #J-18808-Ljbffr

    Create a job alert for this search

    Software Engineer • San Francisco, CA, US

    Related jobs
    • Promoted
    Software Engineer III

    Software Engineer III

    VirtualVocationsConcord, California, United States
    Full-time
    A company is looking for a Software Engineer III.Key Responsibilities Develop business workflows using JBPM in XML and API endpoints using Apache Camel XML DSL Debug and troubleshoot software, c...Show moreLast updated: 30+ days ago
    • Promoted
    Security Software Engineer

    Security Software Engineer

    VirtualVocationsConcord, California, United States
    Full-time
    A company is looking for a Software Engineer - Security.Key Responsibilities Build and maintain security controls to protect data, applications, and infrastructure Automate compliance with frame...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    AI / ML Engineer

    AI / ML Engineer

    FoodHealth CompanySan Francisco, CA, US
    Full-time
    We are on a mission to improve the world's health through food.Through our flagship product, the FoodHealth Score, we provide data tools that empower consumers to make better choices, help reta...Show moreLast updated: 12 hours ago
    • Promoted
    AIML - ML Research Engineer, UI Understanding

    AIML - ML Research Engineer, UI Understanding

    Apple Inc.San Francisco, CA, United States
    Full-time
    San Francisco, California, United States Machine Learning and AI.We are building machine learning technologies to understand, interact with, evaluate and generate user interfaces.These technologies...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer, GenAI Applied ML

    Machine Learning Engineer, GenAI Applied ML

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    At Scale AI, our mission is to accelerate the development of AI applications.For 8 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including : g...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    AI / ML Consultant

    AI / ML Consultant

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for an AI / ML Consultant with a focus on Red Hat and virtualization technologies, working remotely. Key Responsibilities : Assess client needs and design AI / ML strategies aligne...Show moreLast updated: 19 hours ago
    • Promoted
    Research Software Engineer

    Research Software Engineer

    VirtualVocationsConcord, California, United States
    Full-time
    A company is looking for a Research Software Engineer - Tokenomics.Key Responsibilities Set up and maintain the project's compute infrastructure to meet research and development needs Collaborat...Show moreLast updated: 30+ days ago
    • Promoted
    ML Systems Engineer

    ML Systems Engineer

    GenmoSan Francisco, CA, US
    Full-time
    We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the bo...Show moreLast updated: 30+ days ago
    • Promoted
    Senior MLOps Engineer

    Senior MLOps Engineer

    VirtualVocationsConcord, California, United States
    Full-time
    A company is looking for a Senior MLOps Engineer to design and scale infrastructure for AI research and product development. Key Responsibilities Identify and resolve infrastructure and software b...Show moreLast updated: 30+ days ago
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    VirtualVocationsConcord, California, United States
    Full-time
    A company is looking for an AI / ML Engineer - Model Dev & Data Pipeline.Key Responsibilities Design and train custom neural networks and fine-tune large language models for specific tasks Build a...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Staff ML Engineer, Robotics

    Staff ML Engineer, Robotics

    VirtualVocationsConcord, California, United States
    Full-time
    A company is looking for a Staff ML Engineer, Robotics.Key Responsibilities Develop and deploy ML models for perception / navigation tasks Design and implement sensor fusion and mapping pipelines ...Show moreLast updated: 13 hours ago
    • Promoted
    Senior AI / ML Engineer

    Senior AI / ML Engineer

    VirtualVocationsConcord, California, United States
    Full-time
    A company is looking for a Senior AI / ML Engineer specializing in Generative AI to develop and implement advanced AI solutions. Key Responsibilities Implement and optimize AI orchestration framewor...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer, ML Developer Experience

    Software Engineer, ML Developer Experience

    CerebrasSan Francisco, CA, United States
    Full-time
    At Anyscale, we are on a mission to democratize distributed computing and make it accessible to software developers of all skill levels. We are commercializing Ray, a popular open-source project tha...Show moreLast updated: 19 days ago
    • Promoted
    ML / AI Engineer

    ML / AI Engineer

    RilletSan Francisco, CA, United States
    Full-time
    Our customers are the financial brains of their companies.Our job is to help them run the numbers with impossible speed, accuracy, and insight. Today, we do that with powerful and elegant accounting...Show moreLast updated: 30+ days ago
    • Promoted
    AI Engineer - LLM Infra

    AI Engineer - LLM Infra

    YutoriSan Francisco, CA, United States
    Full-time
    Yutori is reimagining how people interact with the web by building AI agents that can reliably do everyday digital tasks. We are building the entire stack to be agent-first, from training our own mo...Show moreLast updated: 30+ days ago
    • Promoted
    Senior ML Engineer

    Senior ML Engineer

    VirtualVocationsConcord, California, United States
    Full-time
    A company is looking for a Senior / Staff ML Engineer to enhance their AI capabilities for residential construction.Key Responsibilities Shape the vision for AI / ML integration into products and ope...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer - ML Pricing

    Software Engineer - ML Pricing

    PendoSan Francisco, CA, United States
    Full-time
    At Opendoor, pricing is at the core of our product — our models directly influence high-stakes decisions around real estate transactions across the country. We are looking for a mid-level.This is a ...Show moreLast updated: 14 days ago
    • Promoted
    Software Engineer, Machine Learning Menlo Park, CA • Software Engineering • Engineering +2 more[...]

    Software Engineer, Machine Learning Menlo Park, CA • Software Engineering • Engineering +2 more[...]

    MetaMenlo Park, CA, United States
    Full-time
    Meta), formerly known as Facebook Inc.When Facebook launched in 2004, it changed the way people connect.Apps and services like Messenger, Instagram, and WhatsApp further empowered billions around t...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer - AI / LLM

    Software Engineer - AI / LLM

    SupermicroSan Jose, CA, United States
    Full-time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Senior Machine Learning Engineer to join their Data Science team.Key Responsibilities Design data architecture for real-time model endpoints and batch solutions Engine...Show moreLast updated: 30+ days ago