Talent.com
AI / ML Model Runtime Engineer

AI / ML Model Runtime Engineer

Broadcom CorporationSan Francisco, CA, United States
30+ days ago
Job type
  • Full-time
Job description

Please Note :

1. If you are a first time user, please create your candidate login account before you apply for a job. (Click Sign In >

Create Account)

2. If you already have a Candidate Account, please Sign-In before you apply.

Job Description :

Broadcom is looking for a Systems Engineer to join VMware Cloud Foundation's (VCF) AI and Advanced Services team. This position is key to building a best in class private cloud AI platform. You will have a high impact by playing a critical role designing and implementing scalable solutions along with a team of talented and enthusiastic engineers.

This role will be a member of the Private AI Services' Model Runtime team, which is a Kubernetes based control plane that operates ML inferencing services. The successful candidate must have experience contributing to Open Source projects, and experience participating in upstream Kubernetes and related CNCF projects as a contributor is a major plus.

The AI & Advanced Services team is responsible for building AI platform capabilities into the VMware Cloud Foundation product to enable our enterprise customers to have all of the AI platform features they need to build, deploy, test, manage, and scale their AI infrastructure and workloads.

Responsibilities

  • Collaborate with cross-functional teams to design and deliver expanded capabilities of Kubernetes-based platform services for AI
  • Troubleshoot and resolve complex issues related to Private AI inference services and their performance
  • Augment the AI services platform to support diverse inferencing and embedding engines (e.g,. vLLM or HuggingFace Transformers library)
  • Develop enhancements to the AI services platform to support a large set of ML models
  • Participate in code reviews and ensure that the code is aligned with VMware's coding standards and best practices
  • Develop and maintain automated tests to ensure the quality and reliability of the Private AI feature set

Requirements

  • Ability to succeed on an take-home homework assignment and in-person technical interview including coding and debugging
  • 5+ years experience in production quality Python code
  • 5+ years of hands on experience with Container technologies (Docker and Kubernetes)
  • Experience with CUDA or other numeric computing libraries and their Python bindings is a plus
  • Strong analytical and diagnostic skills with ability to independently solve complex problems
  • Excellent communication and collaboration skills, with the ability to work with cross-functional teams
  • Experience with agile development methodologies and version control systems, such as Git
  • BS in Computer Science or related technical fields and 12+ years of related experience in the software industry or MS in Computer Science or related technical fields and 10+ years of related experience in the software industry
  • Candidate should not require work visa / sponsorship
  • What We Offer :

  • Competitive salary and benefits package
  • Opportunities for career growth and professional development
  • Collaborative and dynamic work environment
  • Access to cutting-edge technologies and tools
  • Additional Job Description :

    Compensation and Benefits

    The annual base salary range for this position is $127,100 - $226,000.

    This position is also eligible for a discretionary annual bonus in accordance with relevant plan documents, and equity in accordance with equity plan documents and equity award agreements.

    Broadcom offers a competitive and comprehensive benefits package : Medical, dental and vision plans, 401(K) participation including company matching, Employee Stock Purchase Program (ESPP), Employee Assistance Program (EAP), company paid holidays, paid sick leave and vacation time. The company follows all applicable laws for Paid Family Leave and other leaves of absence.

    Broadcom is proud to be an equal opportunity employer. We will consider qualified applicants without regard to race, color, creed, religion, sex, sexual orientation, national origin, citizenship, disability status, medical condition, pregnancy, protected veteran status or any other characteristic protected by federal, state, or local law. We will also consider qualified applicants with arrest and conviction records consistent with local law.

    If you are located outside USA, please be sure to fill out a home address as this will be used for future correspondence.

    Create a job alert for this search

    Aiml Engineer • San Francisco, CA, United States

    Related jobs
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    Reflective IncSan Francisco, CA, United States
    Full-time
    Sunlight reflection may be the only available option, alongside dramatic emissions reductions, adaptation, and rapid scaling of carbon removal, to rapidly limit many climate impacts over the coming...Show moreLast updated: 30+ days ago
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    Air AppsSan Francisco, CA, United States
    Full-time
    At Air Apps, we believe in thinking bigger-and moving faster.We're a family-founded company on a mission to create the world's first AI-powered Personal & Entrepreneurial Resource Planner (PRP), an...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer, ML Runtime & Optimization

    Machine Learning Engineer, ML Runtime & Optimization

    Pony.aiFremont, CA, United States
    Full-time
    Founded in 2016 in Silicon Valley, Pony.Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony. CNBC Disruptor list of the 50 most innovative and disruptive tech comp...Show moreLast updated: 30+ days ago
    • Promoted
    AIML - Machine Learning Engineer, Foundation Model Services

    AIML - Machine Learning Engineer, Foundation Model Services

    Apple Inc.Santa Clara, CA, United States
    Full-time
    Work closely with product teams to build production grade solutions to launch models serving millions of customers in real time. Work along side Foundation Model Research team to prototype and devel...Show moreLast updated: 30+ days ago
    • Promoted
    AIML - Machine Learning Researcher, MLR

    AIML - Machine Learning Researcher, MLR

    AppleCupertino, CA, United States
    Full-time
    Play a part in building the next revolution in machine learning technology.We’re looking for passionate researchers to work on ambitious, curiosity-driven, long-term research projects.In this role,...Show moreLast updated: 30+ days ago
    • Promoted
    AI / ML Architect

    AI / ML Architect

    Cooley LLPSan Francisco, CA, United States
    Full-time
    Cooley is seeking an AI / ML Architect to join the Practice Engineering team within the Innovation department.As a leading technology law firm, Cooley is determined to become a leader in the digital ...Show moreLast updated: 28 days ago
    • Promoted
    Staff Machine Learning Engineer - Responsible AI

    Staff Machine Learning Engineer - Responsible AI

    PinterestPalo Alto, CA, United States
    Full-time
    Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...Show moreLast updated: 30+ days ago
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    Seven Seven SoftwareSan Francisco, CA, United States
    Full-time
    AI / ML (Artificial Intelligence , Machine Learning) Engineer.Experience in engineering and deploying Generative AI models, specifically focusing on Retrieval-Augmented Generation (RAG) systems and...Show moreLast updated: 30+ days ago
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    General MotorsSan Francisco, CA, United States
    Full-time
    As an AI / ML Engineer on the Metrics Frameworks team, part of the Simulation, Evaluation, and Data organization, you will be an individual contributor focused on developing and optimizing infrastruc...Show moreLast updated: 30+ days ago
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    Cogent SecuritySan Francisco, CA, United States
    Full-time
    Cogent Security is on a mission to stop breaches and prevent cybercrime by innovating at the frontier of generative AI systems. We are building the world's first AI cyber taskforce, composed of AI a...Show moreLast updated: 30+ days ago
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    PragmatikeSan Francisco, CA, United States
    Full-time
    On-site Cambridge, MA (Eastern Time / UTC -4).This role is on-site in in the.Please inquire for more details during the interview process. We are hiring at Pragmatike to expand our team and drive th...Show moreLast updated: 30+ days ago
    • Promoted
    Founding Applied ML Engineer

    Founding Applied ML Engineer

    Recruiting from ScratchSan Francisco, CA, United States
    Full-time
    Who is Recruiting from Scratch : .Recruiting from Scratch is a talent firm that focuses on placing the best candidate for our clients. Applied Machine Learning Engineer - Audio AI.San Francisco, CA or...Show moreLast updated: 30+ days ago
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    Diverse LynxSan Francisco, CA, United States
    Full-time
    Experience presenting data to executives (must-have).Strong communication, written skills, and interpersonal skills (required to establish and maintain inter-departmental relationships.Experience e...Show moreLast updated: 30+ days ago
    • Promoted
    Founding AI / ML Engineer - Known

    Founding AI / ML Engineer - Known

    Matter IntelligenceSan Francisco, CA, United States
    Full-time
    Youll develop and deploy the ML systems that make Known work - from compatibility scoring and recommendations to natural language / voice interaction. Youll work closely with platform engineers to sha...Show moreLast updated: 1 day ago
    • Promoted
    Engineer - AI & ML

    Engineer - AI & ML

    EchoStarSan Mateo, CA, United States
    Full-time
    EchoStar is reimagining the future of connectivity.Our business reach spans satellite television service, live-streaming and on-demand programming, smart home installation services, mobile plans an...Show moreLast updated: 1 day ago
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    Trek HealthFremont, CA, United States
    Full-time
    Trek Health high energy startup that empowers provider organizations with AI-driven insights, tools, and strategies to improve payer contract reimbursement, optimize service line performance, and f...Show moreLast updated: 1 day ago
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    InterSourcesFremont, CA, United States
    Temporary
    Experience in AI / ML development, with focus on OpenAI services, NLPs and LLMs.Ability to fine-tune pre-trained models for custom tailored solutions. Drive AI-powered automation for testing and test-...Show moreLast updated: 1 day ago
    • Promoted
    AI ML Engineer

    AI ML Engineer

    A5 Talent FindersSan Francisco, CA, United States
    Full-time
    As a Founding AI / ML Engineer you will design and build core systems from the ground up working across the stack to ship fast reliable features. You'll lead technical architecture decisions impleme...Show moreLast updated: 1 day ago