Snapshot
Help us build generative models of the 3D world. World models power numerous domains, such as media generation, visual reasoning, simulation, planning for embodied agents, and real-time interactive experiences. Work with us to build better versions of Gemini, Genie, and Veo, while also exploring new, spatial modalities beyond images and videos.
The Role
Key responsibilities : Conduct research to build generative multimodal models of the 3D world. Solve essential problems to train world models at massive scale : build and train large-scale systems for data annotation, curate and annotate training datasets, build and maintain large model training infrastructure, develop scaling ladders and training recipes, develop metrics for spatial intelligence, enable real-time interactive experiences, study the integration of spatial modalities with multimodal language models, and of course : actually train massive-scale models.
Areas of focus :
About you
We seek individuals who are passionate about large-scale generative models and believe spatial understanding and generation are on the path to intelligence. We strive for simple methods that scale and look for candidates excited to improve models through infrastructure, data, evals, and compute.
In order to set you up for success as a Research Scientist / Engineer at Google DeepMind, we look for the following skills and experience :
In addition, the following would be an advantage :
#J-18808-Ljbffr
Research Scientist • New York, NY, United States