Talent.com
Research Engineer, Audio Dialogue Job at Google DeepMind in New York

Research Engineer, Audio Dialogue Job at Google DeepMind in New York

MediabistroNew York, NY, United States
10 hours ago
Job type
  • Full-time
Job description

Mountain View, California, US; New York City, New York, US

Snapshot

Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.

Who are we?

Real-Time Dialog team in GDM Audio

Our mission

Mission : We are building the next generation of conversational capabilities, powered by the Gemini LLM. Our mission is to empower multimodal conversational agents with groundbreaking speech and audio capabilities. By starting with LLMs that natively understand rich audio input, we aim to create agents that can orchestrate all aspects of a dialog. This includes knowing when to listen and wait, when to interrupt, reading the emotive style, and coordinating complex multimodal interactions that span audio, video, and text. As part of the gemini audio team, our mission is to create, scale, and productionize novel capabilities into the core Gemini Model, impacting many product areas across Google.

Team Objectives :

  • Create real-time dialog capabilities for Gemini agents that seamlessly span audio, video, and text modalities.
  • Pioneer end-to-end ML / Gemini architectures that streamline the dialog process, minimizing the need for complex model cascades.
  • Direct impact on Gemini core model development - setting the direction for real-time dialog capabilities covering pre-training and post-training.
  • Collaborate on deployments in Gemini Live (GL), Cloud, XR (Glasses), Astra, and other product areas pushing the boundaries of multimodal interaction research.
  • Scale core modeling advancements to meet product requirements / model families, including latency, safety, and factuality.

Job responsibilities

As a Senior Research Engineer, you will :

  • Lead efforts to produce more natural and capable real-time dialog agents
  • Collaborate closely with other teams on areas like reasoning, function calling, and multi-agent frameworks
  • Partner with product teams design, develop, and deploy novel multimodal conversational agents.
  • Create new data pipelines covering both real and synthetic sources, influencing pre-training and post-training (SFT & RL)
  • About You

    In order to set you up for success as a Research Engineer at Google DeepMind, we look for the following skills and experience :

  • Bachelor degree in Computer Science, a related field, or equivalent practical experience.
  • Significant industry experience building and deploying Speech / ML models.
  • Demonstrated experience in data preparation, training, and evaluation of ML models.
  • 5 years of experience with software development in Python, esp. ML frameworks like Tensorflow, JAX, PyTorch, etc.
  • In addition, the following would be an advantage :

  • Ph.D. in Computer Science or a related field.
  • Experience with multimodal foundation models.
  • Experience with real-time multimodal dialog systems.
  • Hands-on experience with the Gemini models (data processing, training, SFT, RL, serving).
  • Research background in NLP / Generative AI
  • Experience with C++
  • The US base salary range for this full-time position is between $197,000 - $291,000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.

    At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.

    Create a Job Alert

    Interested in building your career at DeepMind? Get future opportunities sent straight to your email.

    Accepted file types : pdf, doc, docx, txt, rtf

    Enter manually

    Accepted file types : pdf, doc, docx, txt, rtf

    LinkedIn Profile

    Link to external profile e.g. LinkedIn, GitHub etc.

    Where did you hear about this role?

  • Select...
  • U.S. Standard Demographic Questions

    Google DeepMind is subject to certain governmental recordkeeping and reporting requirements for the administration of civil rights laws and regulations. In order to comply with these laws and achieve our goal of a diverse and inclusive workforce, Google DeepMind invites employees to voluntarily self-identify their race or ethnicity. Submission of this information is voluntary and refusal to provide it will not subject you to any adverse treatment. The information obtained will be kept confidential and may only be used in accordance with the provisions of applicable laws, executive orders, and regulations, including those that require the information to be summarized and reported to the federal government for civil rights enforcement. When reported, data will not identify any specific individual. If you'd like more information about your EEO rights as an applicant under the law, please click herehttps : / / www.eeoc.gov / employers / eeo-law-poster.

    How would you describe your gender identity? (mark all that apply) Select...

    How would you describe your racial / ethnic background? (mark all that apply) Select...

    How would you describe your sexual orientation? (mark all that apply) Select...

    Do you identify as transgender? Select...

    Do you have a disability or chronic condition (physical, visual, auditory, cognitive, mental, emotional, or other) that substantially limits one or more of your major life activities, including mobility, communication (seeing, hearing, speaking), and learning? Select...

    Are you a veteran or active member of the United States Armed Forces? Select...

    #J-18808-Ljbffr

    Create a job alert for this search

    Audio Engineer • New York, NY, United States

    Related jobs
    • Promoted
    Conversational AI Engineer

    Conversational AI Engineer

    VirtualVocationsFlushing, New York, United States
    Full-time
    A company is looking for a Conversational AI Frontend Engineer.Key Responsibilities Develop and maintain conversational bots and virtual agents using various platforms Design and implement user ...Show moreLast updated: 2 days ago
    Analytix Solutions is hiring : Lead Field Engineer – Audio Visual in New York

    Analytix Solutions is hiring : Lead Field Engineer – Audio Visual in New York

    MediabistroNew York, NY, United States
    Full-time
    Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features.We are looking for a highly skilled Lead Field Engineer to oversee the installation and commissioning ...Show moreLast updated: 30+ days ago
    • Promoted
    AI / ML Consultant

    AI / ML Consultant

    VirtualVocationsFlushing, New York, United States
    Full-time
    A company is looking for an AI / ML Consultant with a focus on Red Hat and virtualization technologies.Key Responsibilities : Assess client needs and design AI / ML strategies aligned with business go...Show moreLast updated: 1 day ago
    • Promoted
    Senior Director of AI

    Senior Director of AI

    VirtualVocationsYonkers, New York, United States
    Full-time
    A company is looking for a Senior Director, Artificial Intelligence.Key Responsibilities Develop and execute the enterprise AI strategy and governance framework Identify and incubate innovative ...Show moreLast updated: 1 day ago
    • Promoted
    Senior Research Engineer

    Senior Research Engineer

    VirtualVocationsJackson Heights, New York, United States
    Full-time
    A company is looking for a Senior Research Engineer - Multimodal & Video Foundation Model.Key Responsibilities Pioneer multimodal and video-centric research, contributing to usable prototypes and...Show moreLast updated: 5 days ago
    • Promoted
    • New!
    Azure Machine Learning Engineer

    Azure Machine Learning Engineer

    VirtualVocationsBronx, New York, United States
    Full-time
    A company is looking for an Azure Machine Learning Customer Engineer.Key Responsibilities Lead AI strategy and implementation engagements for Azure Machine Learning and related services Guide cl...Show moreLast updated: 9 hours ago
    • Promoted
    Senior AI / ML Engineer

    Senior AI / ML Engineer

    VirtualVocationsElizabeth, New Jersey, United States
    Full-time
    A company is looking for a Senior AI / ML Engineer specializing in Generative AI to develop and implement advanced AI solutions. Key Responsibilities Implement and optimize AI orchestration framewor...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Deep Learning Engineer

    Senior Deep Learning Engineer

    VirtualVocationsNew York, New York, United States
    Full-time
    A company is looking for a Senior Deep Learning Software Engineer - Autonomous Vehicles.Key Responsibilities Train, fine-tune, optimize, and customize perception DNNs in low precision (FP16 / INT8)...Show moreLast updated: 30+ days ago
    • Promoted
    AI Research Engineer

    AI Research Engineer

    VirtualVocationsBronx, New York, United States
    Full-time
    A company is looking for an AI Research Engineer specializing in LLM orchestration and prompting.Key Responsibilities Build LLM-powered software by designing prompt flows and orchestrations for o...Show moreLast updated: 30+ days ago
    • Promoted
    Audio Evaluator

    Audio Evaluator

    VirtualVocationsPaterson, New Jersey, United States
    Full-time
    A company is looking for an Audio Evaluator.Key Responsibilities Listen to audio clips and evaluate sentences and words for accuracy, fluency, prosody, and pronunciation Assist in training AI to...Show moreLast updated: 2 days ago
    • Promoted
    Director, Responsible AI

    Director, Responsible AI

    Novartis Group CompaniesEast Hanover, NJ, United States
    Full-time
    This position will be located at the East Hanover, NJ location and will not have the ability to be located remotely.The Insights and Decision Science (. We collaborate closely with the US business, ...Show moreLast updated: 30+ days ago
    • Promoted
    Associate AI Engineer

    Associate AI Engineer

    OneStream SoftwareNew York, NY, United States
    Full-time
    Associate Artificial Intelligence Engineer.Range applies to US candidates only) + Benefits / Variable Comp / Equity - Range may vary based on experience. Vision, Medical, Life, Dental, 401K.This role re...Show moreLast updated: 29 days ago
    • Promoted
    Generative Machine Learning Engineer

    Generative Machine Learning Engineer

    VirtualVocationsFlushing, New York, United States
    Full-time
    A company is looking for a Generative Machine Learning Engineer.Key Responsibilities Develop and implement Deep Learning Pipelines using cutting-edge models Sustain the infrastructure for Deep L...Show moreLast updated: 30+ days ago
    Senior Deep Learning Engineer

    Senior Deep Learning Engineer

    owl.coNew York, NY, US
    Full-time
    Quick Apply
    Our clients are top insurance companies across North America, achieving remarkable results through our AI-powered, evidence-based platform. We are on a mission to integrate state-of-the-art ML and N...Show moreLast updated: 30+ days ago
    Founding Audio AI Research Engineer Job at David AI Labs, Inc. in New York

    Founding Audio AI Research Engineer Job at David AI Labs, Inc. in New York

    MediabistroNew York, NY, United States
    Full-time
    As an audio data research company, we believe model capabilities come from high quality, differentiated data.David AI’s research team performs ambitious, long-term research into audio capabilities,...Show moreLast updated: 30+ days ago
    • Promoted
    Designated Phone Coordinator

    Designated Phone Coordinator

    Center for Oral and Maxillofacial SurgeryAsbury Park, NJ, United States
    Full-time
    Center For Oral and Maxillofacial Surgery.Jackson, Freehold, Brick, Ocean, Hazlet, Windsor, NJ.On-site Phone Coordinator - Oral Surgery (this is an on-site position, it is not remote or hybrid).We ...Show moreLast updated: 1 day ago
    • Promoted
    AI Development - Agent Specialist

    AI Development - Agent Specialist

    EisnerAmperIselin, NJ, United States
    Full-time
    At EisnerAmper, we look for individuals who welcome new ideas, encourage innovation, and are eager to make an impact.Whether you're starting out in your career or taking your next step as a seasone...Show moreLast updated: 30+ days ago
    • Promoted
    Generative AI Specialist

    Generative AI Specialist

    VirtualVocationsBronx, New York, United States
    Full-time
    A company is looking for a Generative AI Specialist - Humanities (English and Portuguese).Key Responsibilities Evaluate and assess the performance of AI models based on their outputs Label and c...Show moreLast updated: 30+ days ago