Talent.com
AIML - Machine Learning Engineer, Foundation Model Services
AIML - Machine Learning Engineer, Foundation Model ServicesApple Inc. • Santa Clara, CA, United States
AIML - Machine Learning Engineer, Foundation Model Services

AIML - Machine Learning Engineer, Foundation Model Services

Apple Inc. • Santa Clara, CA, United States
30+ days ago
Job type
  • Full-time
Job description

AIML - Machine Learning Engineer, Foundation Models

San Francisco Bay Area, California, United States Machine Learning and AI

Do you feel you think differently, you are eager to break status quo, are bold and ambitious, aren’t afraid to take risks and are passionate to build the best of class technology. If yes, what better place to be at and do this than Apple? At Apple, “we think different, we push the boundaries of computing and intelligence. We build products that bring smile to people’s face”. Foundation Model Infrastructure team, within Machine Learning Platform Technologies organization is the back-bone of Apple Intelligence. It builds frameworks, services and tools that power the largest Apple foundation models on servers. Our Infrastructure powers a wide gamut of services at Apple including Apple Search, Apple Music, AppleTV, AppStore, iMessages, Photos & Camera, Spotlight, Safari, Siri and upcoming ever exciting Apple products serving millions of queries every day with incredible low latencies, drawing every ounce of compute from our hardware. As part of this group, you will get a chance to bring Intelligence to billions of users across the world. You will have an opportunity to make a difference in life of people. You will have a chance to work on optimizing billions of parameter languge and vision and speech models using state of the art technologies and make it run at scale of Apple.

Description

Work along side Foundation Model Research team to optimize inference for cutting edge model architectures. Work closely with product teams to build Production grade solutions to launch models serving millions of customers in real time. Build tools to understand bottlenecks in Inference for different hardwares and use cases. Mentor and guide engineers in the organization.

Minimum Qualifications

  • 5+ years of experience leading and driving complex, ambiguous projects.
  • Have experience with high throughput services particularly at supercomputing scale.
  • Proficient with running applications on Cloud (AWS / Azure or equivalent) using Kubernetes, Docker etc.
  • Familiar with GPU programming concepts using CUDA.
  • Familiar with one of the popular ML Frameworks like Pytorch, Tensorflow.

Preferred Qualifications

  • Proficient in building and maintaining systems written in modern languages (eg : Golang, python)
  • Familiar with fundamental Deep Learning architectures such as Transformers, Encoder / Decoder models.
  • Familiarity with Nvidia TensorRT-LLM, vLLM, DeepSpeed, Nvidia Triton Server etc.
  • Experience writing custom CUDA kernels using CUDA or OpenAI Triton.
  • At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $181,100 and $318,400, and your base pay will depend on your skills, qualifications, experience, and location.

    Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee Stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including : Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits.

    Note : Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

    Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.

    Apple accepts applications to this posting on an ongoing basis.

    #J-18808-Ljbffr

    Create a job alert for this search

    Machine Learning Engineer • Santa Clara, CA, United States

    Related jobs
    Machine Learning Engineer

    Machine Learning Engineer

    Institute Of Foundation Models • Sunnyvale, California, United States
    Full-time
    About the Institute of Foundation Models.We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Aven • Campbell, California, United States
    Full-time
    We are looking for an experienced Machine Learning Engineers - you have built models in product from idea to delivery.You are passionate about digging into the data, sanitizing it, studying it, com...Show more
    Last updated: 30+ days ago • Promoted
    Lead Machine Learning Engineer, Recommender Systems

    Lead Machine Learning Engineer, Recommender Systems

    HP IQ • Palo Alto, California, United States
    Full-time
    HP IQ is HP’s new AI innovation lab.Combining startup agility with HP’s global scale, we’re building intelligent technologies that redefine how the world works, creates, and collaborates.We’re asse...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Gridmatic • Cupertino, California, United States
    Full-time
    Bay Area and Houston that is accelerating the clean energy transition by applying our expertise in data, machine learning, and energy to power markets. We are the rare startup that has multiple year...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer, Staff - Model Factory

    Machine Learning Engineer, Staff - Model Factory

    D-matrix • Santa Clara, California, United States
    Full-time
    AI to power the transformation of technology.We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. We value humility and believe in direct communic...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer, Tech Lead

    Machine Learning Engineer, Tech Lead

    Creatify Lab • Mountain View, California, United States
    Full-time
    Creatify is building the world’s first end-to-end AI advertising agent—a platform that automates the entire video ad lifecycle, from scripting and avatar-led generation to testing, optimization, an...Show more
    Last updated: 30+ days ago • Promoted
    Staff Machine Learning Engineer (Applied ML)

    Staff Machine Learning Engineer (Applied ML)

    Earnin • Mountain View, California, United States
    Full-time
    As one of the first pioneers of earned wage access, our passion at EarnIn is building products that deliver real-time financial flexibility for those with the unique needs of living paycheck to pay...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer - Intelligent Agents & Systems

    Machine Learning Engineer - Intelligent Agents & Systems

    Zyphra • Palo Alto, California, United States
    Full-time
    Agentic Systems and Interaction projects.You will be at the forefront of building a next-generation desktop and browser-based agent that can autonomously navigate the web, interact with filesystems...Show more
    Last updated: 30+ days ago • Promoted
    Machine learning engineer (Robotics)

    Machine learning engineer (Robotics)

    Dexmate • Santa Clara, California, United States
    Full-time
    We are an early-stage robotics startup working on building multi-purpose mobile robots that can do complex manipulation tasks. We are looking for a creative, skilled, and motivated engineers to join...Show more
    Last updated: 30+ days ago • Promoted
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    Coupand • Mountain View, California, United States
    Full-time
    We know we’re doing the right thing when we hear our customers say, "How did we ever live without Coupang?" Born out of an obsession to make shopping, eating, and living easier than ever, we’re col...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer L4 / L5, Model Serving Systems, Machine Learning Platform

    Software Engineer L4 / L5, Model Serving Systems, Machine Learning Platform

    Netflix • Los Gatos, California, United States
    Full-time
    Netflix is one of the world's leading entertainment services, with 283 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and lan...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Arc Institute • Palo Alto, California, United States
    Full-time
    The Arc Institute is a new scientific institution that conducts curiosity-driven basic science and technology development to understand and treat complex human diseases. Headquartered in Palo Alto, ...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Researcher / Engineer (Foundational Models)

    Machine Learning Researcher / Engineer (Foundational Models)

    Pathway • Palo Alto, California, United States
    Full-time +1
    At Pathway we are shaking the foundations of artificial intelligence by introducing the world’s first post-transformer model that adapts and thinks just like humans. Our breakthrough architecture ou...Show more
    Last updated: 20 days ago • Promoted
    Machine Learning Engineer, Recommendation

    Machine Learning Engineer, Recommendation

    Newsbreak • Mountain View, California, United States
    Full-time
    NewsBreak is redefining the way users interact with local news and their communities.By bridging local users, local content creators, and local businesses, our mission is to foster safer, more vibr...Show more
    Last updated: 30+ days ago • Promoted
    Senior Machine Learning Engineer II - LLM

    Senior Machine Learning Engineer II - LLM

    Moveworks • Mountain View, California, United States
    Full-time
    We are looking for a Machine Learning Engineer to help build cutting edge ML infrastructure for building and serving LLM’s at Moveworks. This role will be critical in building, optimizing and scali...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer 756

    Machine Learning Engineer 756

    Protegrity • Palo Alto, California, United States
    Remote
    Full-time
    At Protegrity, we lead innovation by using AI and quantum-resistant cryptography to transform data protection across cloud-native, hybrid, on-premises, and open source environments.We leverage adva...Show more
    Last updated: 30+ days ago • Promoted
    Staff Machine Learning Engineer - Autonomy

    Staff Machine Learning Engineer - Autonomy

    Wayve • Sunnyvale, California, United States
    Full-time
    At Wayve we're committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives, and regardless of sex, race, religion or bel...Show more
    Last updated: 30+ days ago • Promoted
    Senior Machine Learning Operations (MLOps) Engineer

    Senior Machine Learning Operations (MLOps) Engineer

    Bonfy-ai • Mountain View, California, United States
    Full-time
    AI is building the trust layer for generative AI.Our Adaptive Content Security platform detects and mitigates subtle risks embedded in large language model (LLM) outputs before they reach users.Fro...Show more
    Last updated: 30+ days ago • Promoted