Talent.com
Machine Learning Engineer, vLLM Inference

Machine Learning Engineer, vLLM Inference

Red HatBoston, MA, United States
13 hours ago
Job type
  • Full-time
Job description

Overview

Machine Learning Engineer, vLLM Inference role at Red Hat. Red Hat’s Inference team accelerates AI for the enterprise and provides a stable platform for open-source LLM deployments. This role focuses on vLLM, working to improve model performance and efficiency within our open-source software stack.

What You Will Do

  • Write robust Python and C++, working on vLLM systems, high performance machine learning primitives, performance analysis and modeling, and numerical methods
  • Contribute to the design, development, and testing of various inference optimization algorithms
  • Participate in technical design discussions and provide innovative solutions to complex problems
  • Give thoughtful and prompt code reviews. Proactively utilize AI-assisted development tools for code generation, auto-completion, and intelligent suggestions to accelerate development cycles and enhance code quality
  • Mentor and guide other engineers and foster a culture of continuous learning and innovation

What You Will Bring

  • Extensive experience in writing high performance code for GPUs and deep knowledge of GPU hardware
  • Strong understanding of computer architecture, parallel processing, and distributed computing concepts
  • Experience with tensor math libraries such as PyTorch
  • Modern C++, CUDA, Triton, and CUTLASS experience
  • Mathematical software, especially linear algebra or signal processing
  • Experience optimizing kernels for deep neural networks
  • Experience with NVIDIA Nsight is a plus
  • Strong communications skills with both technical and non-technical team members
  • BS, or MS in computer science or computer engineering or a related field. A PhD in a ML related domain is considered a plus
  • Pay Transparency

    The salary range for this position is $133,650.00 - $220,680.00. Actual offer will be based on your qualifications. This position may also be eligible for bonus, commission, and / or equity. For Remote-US locations, the actual salary range may differ by location but will be commensurate with job duties and relevant experience.

    About Red Hat

    Red Hat is the world’s leading provider of enterprise open source software solutions, using a community-powered approach to deliver high-performing Linux, cloud, container, and Kubernetes technologies. We hire creative, passionate people ready to contribute their ideas, help solve complex problems, and make an impact. We are committed to an open, inclusive environment where ideas come from people with diverse backgrounds and perspectives.

    Benefits

  • Comprehensive medical, dental, and vision coverage
  • Flexible Spending Account - healthcare and dependent care
  • Health Savings Account - high deductible medical plan
  • Retirement 401(k) with employer match
  • Paid time off and holidays
  • Paid parental leave plans for all new parents
  • Leave benefits including disability, paid family medical leave, and paid military leave
  • Employee stock purchase plan, tuition reimbursement, and other benefits
  • Equal Opportunity Policy (EEO)

    Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, veteran status, genetic information, disability, medical condition, marital status, or any other basis prohibited by law. Red Hat provides reasonable accommodations to applicants and will respond to inquiries regarding application status via the designated channels.

    #J-18808-Ljbffr

    Create a job alert for this search

    Machine Learning Engineer • Boston, MA, United States

    Related jobs
    • Promoted
    • New!
    Azure Machine Learning Engineer

    Azure Machine Learning Engineer

    Diverse LynxBoston, MA, United States
    Full-time
    Position : Azure Machine Learning Engineer.Type of Hire : Long Term Contract.Azure Machine Learning (ML) - 6+ years’ experience required. Lead, mentor, and grow a team of identity and collaboration en...Show moreLast updated: 13 hours ago
    • Promoted
    • New!
    Principal Machine Learning Engineer, Distributed vLLM Inference

    Principal Machine Learning Engineer, Distributed vLLM Inference

    Red HatBoston, MA, United States
    Full-time +1
    Principal Machine Learning Engineer, Distributed vLLM Inference page is loaded## Principal Machine Learning Engineer, Distributed vLLM Inferenceremote type : Hybridlocations : Bostonposted on : ...Show moreLast updated: 13 hours ago
    • Promoted
    • New!
    Machine Learning Engineer, vLLM Inference

    Machine Learning Engineer, vLLM Inference

    Red Hat, Inc.Boston, MA, United States
    Full-time +1
    At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise a...Show moreLast updated: 13 hours ago
    • Promoted
    • New!
    Machine Learning Engineer, llama.cpp

    Machine Learning Engineer, llama.cpp

    Red HatBoston, MA, United States
    Full-time +1
    Machine Learning Engineer, llama.Machine Learning Engineer, llama.Hybridlocations : Bostonposted on : Posted Todayjob requisition id : R-046441 • •Job Summary • •At Red Hat we believe the future of ...Show moreLast updated: 13 hours ago
    • Promoted
    • New!
    Machine Learning Engineer

    Machine Learning Engineer

    C the SignsBoston, MA, United States
    Full-time
    The Machine Learning Engineer will be responsible for the end-to-end development and deployment of Large language and machine learning models, with a primary focus on data preprocessing, model trai...Show moreLast updated: 13 hours ago
    • Promoted
    Lead Machine Learning Engineer

    Lead Machine Learning Engineer

    Capital OneHarvard Square, MA, US
    Full-time +1
    Lead Machine Learning Engineer As a Capital One Machine Learning Engineer (MLE), you'll be part of an Agile team dedicated to productionizing machine learning applications and systems at scale.You’...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Staff Machine Learning Engineer - AI / ML Risk Platform

    Staff Machine Learning Engineer - AI / ML Risk Platform

    CoinbaseBoston, MA, United States
    Full-time
    Ready to be pushed beyond what you think you’re capable of?.At Coinbase, our mission is to increase economic freedom in the world. It’s a massive, ambitious opportunity that demands the best of us, ...Show moreLast updated: 13 hours ago
    • Promoted
    • New!
    Principal Machine Learning Engineer, Distributed vLLM Inference

    Principal Machine Learning Engineer, Distributed vLLM Inference

    Red Hat, Inc.Boston, MA, United States
    Full-time +1
    At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise a...Show moreLast updated: 13 hours ago
    • Promoted
    • New!
    Machine Learning Engineer

    Machine Learning Engineer

    Analog DevicesBoston, MA, United States
    Full-time
    Machine Learning Engineer at Analog Devices.You will play a key role in developing and implementing machine learning models that enhance semiconductor solutions. This position offers the opportunity...Show moreLast updated: 13 hours ago
    • Promoted
    • New!
    Machine Learning Engineer - Modeling

    Machine Learning Engineer - Modeling

    PryonBoston, MA, United States
    Full-time
    Machine Learning Engineer - Modeling.This range is provided by Pryon.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. We’re a team of AI, technol...Show moreLast updated: 13 hours ago
    • Promoted
    • New!
    Machine Learning Engineer

    Machine Learning Engineer

    PeKe LabsBoston, MA, United States
    Full-time
    We're hiring our first ML Software Engineer to work alongside our cofounder, Thane Hunt, in developing our software stack to help interpret raw data s. We're building the next generation of technolo...Show moreLast updated: 13 hours ago
    • Promoted
    • New!
    Machine Learning Engineer – LLM Engineering

    Machine Learning Engineer – LLM Engineering

    LiberateBoston, MA, United States
    Full-time
    Liberate is reimagining how the $2.By starting with voice, the most valuable and complex channel in insurance, the company proved that even the hardest problems can be automated.Now expanding into ...Show moreLast updated: 13 hours ago
    • Promoted
    • New!
    Machine Learning Engineer (LLM)

    Machine Learning Engineer (LLM)

    DeepRec.aiBoston, MA, United States
    Full-time
    US Recruitment Consultant : Guiding GenAI professionals towards their dream careers.Machine Learning Engineer (LLM).Boston, 2 days per week in-office. We’re working with a fast-growing AI company on ...Show moreLast updated: 13 hours ago
    • Promoted
    • New!
    Machine Learning Engineer

    Machine Learning Engineer

    Lawrence HarveyBoston, MA, United States
    Full-time
    Design & deploy on-prem NLP / ML solutions.Build intelligent agents for real-time comms, integrations, and workflows.Develop dashboards, APIs & infrastructure exposing ML insights to non-technical te...Show moreLast updated: 13 hours ago
    • Promoted
    • New!
    Machine Learning Engineer

    Machine Learning Engineer

    Red HatBoston, MA, United States
    Full-time +1
    Machine Learning Engineer page is loaded## Machine Learning Engineerremote type : Hybridlocations : Bostonposted on : Posted Todayjob requisition id : R-050384At Red Hat we believe the future o...Show moreLast updated: 13 hours ago
    • Promoted
    • New!
    Machine Learning Platform Engineer

    Machine Learning Platform Engineer

    Brigham and Women's HospitalBoston, MA, United States
    Full-time
    Mass General Brigham relies on a wide range of professionals, including doctors, nurses, business people, tech experts, researchers, and systems analysts to advance our mission.As a not-for-profit,...Show moreLast updated: 13 hours ago
    • Promoted
    • New!
    Machine Learning Engineer, Distributed vLLM Inference

    Machine Learning Engineer, Distributed vLLM Inference

    Red Hat, Inc.Boston, MA, United States
    Full-time +1
    At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise a...Show moreLast updated: 13 hours ago
    • Promoted
    • New!
    Machine Learning Engineer – LLM Engineering Boston

    Machine Learning Engineer – LLM Engineering Boston

    Liberate Inc.Boston, MA, United States
    Full-time
    Location : Boston, MA — hybrid role, 2 days per week in-office.Build LLM-driven pipelines for information extraction and workflow automation in insurance. Develop multi-turn, long-context agents that...Show moreLast updated: 13 hours ago