Talent.com
No se aceptan más aplicaciones
Research Engineer Model Optimization

Research Engineer Model Optimization

World LabsSan Francisco, CA, United States
Hace 6 días
Tipo de contrato
  • A tiempo completo
Descripción del trabajo

Overview

At World Labs, our mission is to revolutionize artificial intelligence by developing Large World Models, taking AI beyond language and 2D visuals into the realm of complex 3D environments, both virtual and real. We're the team that's envisioning a future where AI doesn't just process information but truly understands and interacts with the world around us.

We're looking for the overachievers, the visionaries, and the relentless innovators who aren't satisfied with the status quo. You know that person who's always dreaming up the next big breakthrough? That's us. And we want you to be part of it.

As a Model Optimization Engineer at World Labs, you will play a pivotal role in enhancing the performance and efficiency of large-scale foundation models. You will collaborate with Research Scientists to design, implement, and optimize systems that leverage thousands of GPUs to train state-of-the-art AI models. This position offers a unique opportunity to work at the forefront of AI technology, tackling complex challenges in PyTorch, CUDA, and distributed systems. Ensure efficient implementation of models and systems for data processing, training, inference, and deployment. Identify and implement optimization techniques for massively parallel and distributed systems. Profile and resolve efficiency bottlenecks (memory, speed, utilization) by developing high-performance CUDA, Triton, C++, and PyTorch code. Collaborate closely with the research team to integrate efficiency into system designs from the outset. Build tools to visualize, evaluate, and filter datasets to ensure quality and readiness for training. Implement cutting-edge prototypes and product features based on multimodal generative AI.

Key Qualifications :

  • Programming Expertise :

Proficiency in Python and PyTorch with practical experience across the development pipeline : data processing, preparation, training, and inference.

  • Experience optimizing and deploying inference workloads with a focus on throughput and latency.
  • Skilled in profiling CPU and GPU code using tools such as Nvidia Nsight.
  • Parallel and Distributed Systems :
  • Experience writing and improving parallel and distributed PyTorch code using techniques like DDP, FSDP, or Tensor Parallel.

  • Familiarity with high-performance parallel C++ in machine learning contexts (e.g., for data loading and processing).
  • Proficiency in Triton, CUDA, and writing custom PyTorch kernels, including tensor core utilization and memory optimization.
  • Model Knowledge :
  • Understanding of deep learning architectures such as Transformers, Diffusion Models, and GANs.

  • Experience building prototype applications using tools like Gradio and Docker.
  • Preferred Qualifications

  • Strong foundation in distributed computing and experience with large-scale AI training systems.
  • Familiarity with multimodal generative models and emerging AI paradigms.
  • Hands-on experience with dataset curation and visualization tools.
  • passion for collaborating with research teams to translate cutting-edge concepts into real-world solutions.
  • Who You Are :

    Fearless Innovator : We need people who thrive on challenges and aren't afraid to tackle the impossible.

    Resilient Builder : Impacting Large World Models isn't a sprint; it's a marathon with hurdles. We're looking for builders who can weather the storms of groundbreaking research and come out stronger.

    Mission-Driven Mindset : Everything we do is in service of creating the best spatially intelligent AI systems, and using them to empower people.

    Collaborative Spirit : We're building something bigger than any one person. We need team players who can harness the power of collective intelligence.

    We're hiring the brightest minds from around the globe to bring diverse perspectives to our cutting-edge work. If you're ready to work on technology that will reshape how machines perceive and interact with the world - then World Labs is your launchpad.

    Join us, and let's make history together.

    #J-18808-Ljbffr

    Crear una alerta de empleo para esta búsqueda

    Research Engineer • San Francisco, CA, United States

    Ofertas relacionadas
    • Oferta promocionada
    Machine Learning Research Scientist / Engineer, Reasoning

    Machine Learning Research Scientist / Engineer, Reasoning

    Scale AI, Inc.San Francisco, CA, United States
    A tiempo completo
    At Scale AI, our mission is to accelerate the development of AI applications.For 8 years, Scale has been the leading AI data foundry, fueling the most exciting advancements in AI, including generat...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Research Engineer

    Research Engineer

    TranzealSanta Clara, CA, United States
    A tiempo completo
    Our Systems Technology group, a part of Roche Sequencing Solutions, is focused on creating and advancing technologies to significantly enhance DNA sequencing workflows, push system performance, and...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Machine Learning Research Engineer - Robotics

    Machine Learning Research Engineer - Robotics

    Scale AI, Inc.San Francisco, CA, United States
    A tiempo completo
    Scale's Robotics business unit is dedicated to solving the data bottleneck in Physical AI.This position will be a key contributor in conducting applied research in Robotics and developing ML pipeli...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Machine Learning Research Engineer, Agents - Enterprise GenAI

    Machine Learning Research Engineer, Agents - Enterprise GenAI

    Scale AI, Inc.San Francisco, CA, United States
    A tiempo completo
    AI is becoming vitally important in every function of our society.At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, ...Mostrar másÚltima actualización: hace 9 días
    • Oferta promocionada
    Machine Learning Research Engineer, Enterprise ML Systems

    Machine Learning Research Engineer, Enterprise ML Systems

    Scale AI, Inc.San Francisco, CA, United States
    A tiempo completo
    AI is becoming vitally important in every function of our society.At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, ...Mostrar másÚltima actualización: hace 12 días
    • Oferta promocionada
    Research Engineer

    Research Engineer

    eGainSunnyvale, CA, United States
    A tiempo completo
    Fortune 500 clients and government agencies trust eGain AI knowledge solution to improve customer experience and reduce cost of service. Top rated by Gartner, eGain AI Knowledge Hub orchestrates AI ...Mostrar másÚltima actualización: hace 4 días
    • Oferta promocionada
    Research Engineer

    Research Engineer

    Orbifold AILos Altos, CA, United States
    A tiempo completo
    Get AI-powered advice on this job and more exclusive features.This range is provided by Orbifold AI.Your actual pay will be based on your skills and experience talk with your recruiter to learn mor...Mostrar másÚltima actualización: hace 6 días
    • Oferta promocionada
    Sr. Research and Development Technician, Advanced Development

    Sr. Research and Development Technician, Advanced Development

    Calyxo, Inc.Pleasanton, CA, United States
    A tiempo completo
    The company was founded in 2016 to address the profound need for improved kidney stone treatment.Kidney stone disease is a common, painful condition that consumes vast amounts of healthcare resourc...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Research Engineer, World Models

    Research Engineer, World Models

    1X Technologies ASPalo Alto, CA, United States
    A tiempo completo
    Target start date : immediately.Since its founding in 2015, 1X has been at the forefront of developing advanced humanoid robots designed for household use. Our mission is to create an abundant supply...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Senior Research Engineer, Foundation Model Training Infrastructure

    Senior Research Engineer, Foundation Model Training Infrastructure

    NVIDIASanta Clara, CA, United States
    A tiempo completo
    NVIDIA is searching for a senior or principal engineer who specializes in building cutting-edge infrastructure for large-scale foundation model training in the Generalist Embodied Agent Research (G...Mostrar másÚltima actualización: hace 6 días
    • Oferta promocionada
    Research Engineer - Foundation Models, Ads Integrity

    Research Engineer - Foundation Models, Ads Integrity

    Tik TokSan Jose, CA, United States
    A tiempo completo
    Be among the first 25 applicants.Our Business Integrity team has a strong user focus and a dedication to technical excellence. We aim to meet our users needs with reliable and high-performing platfo...Mostrar másÚltima actualización: hace 6 días
    • Oferta promocionada
    Research Engineer

    Research Engineer

    MaticMountain View, CA, United States
    A tiempo completo
    At Matic, we're on a mission to recapture that lost time, and we're doing it by revolutionizing home robotics.Our first product, also called Matic, is a Wall-E-esque floor cleaning robot.We've buil...Mostrar másÚltima actualización: hace 6 días
    • Oferta promocionada
    Research Engineer (Molecule Design Platform)

    Research Engineer (Molecule Design Platform)

    GenBio AIPalo Alto, CA, United States
    A tiempo completo
    Headquartered in Silicon Valley, we are a newly established start-up, where a collective of visionary scientists, engineers, and entrepreneurs are dedicated to transforming the landscape of biology...Mostrar másÚltima actualización: hace 6 días
    • Oferta promocionada
    Research Engineer, Scaling

    Research Engineer, Scaling

    1X Technologies ASPalo Alto, CA, United States
    A tiempo completo
    We're an AI and robotics company based in Palo Alto, California, on a mission to build a truly abundant society through general-purpose robots capable of performing any kind of work autonomously.We...Mostrar másÚltima actualización: hace 6 días
    • Oferta promocionada
    ML Research Engineer, ML Systems

    ML Research Engineer, ML Systems

    Scale AI, Inc.San Francisco, CA, United States
    A tiempo completo
    Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and opera...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Machine Learning Research Scientist / Research Engineer, Post-Training

    Machine Learning Research Scientist / Research Engineer, Post-Training

    Scale AI, Inc.San Francisco, CA, United States
    A tiempo completo
    Scale works with the industry's leading AI labs to provide high quality data and accelerate progress in GenAI research.We are looking for Research Scientists and Research Engineers with expertise i...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Research Engineer - Evaluations

    Research Engineer - Evaluations

    Luma AIPalo Alto, CA, United States
    A tiempo completo
    Luma is pushing the boundaries of generative AI, building tools that redefine how visual content is created.We're seeking a Research Engineer to design and scale the infrastructure that powers our ...Mostrar másÚltima actualización: hace 6 días
    • Oferta promocionada
    Research Engineer, Autonomy

    Research Engineer, Autonomy

    1x.techPalo Alto, CA, United States
    A tiempo completo
    Were an AI and robotics company based in Palo Alto, California, on a mission to build a truly abundant society through general?purpose robots capable of performing any kind of work autonomously.We ...Mostrar másÚltima actualización: hace 2 días