Talent.com
Research Intern - Audio-Visual VoiceAI (Open Source)

Research Intern - Audio-Visual VoiceAI (Open Source)

WhissleAISan Mateo, CA, US
16 hours ago
Job type
  • Full-time
Job description

Research Intern – Audio-Visual VoiceAI (Open Source)

We're looking for a Research Intern to join WhissleAI and help advance our open-source work at the intersection of speech, vision, and structured understanding — inspired by projects like

  • https : / / music.whissle.ai /
  • advanced speech recognition asr.whissle.ai
  • and recent multi-modal alignment research (example : https : / / aclanthology.org / 2025.emnlp-main.845.pdf)

You'll work on developing audio-visual foundation models that connect voice, context, and environment — enabling systems that can listen, see, and act coherently in real time. Most of this work is open-source and contributes directly to the broader research community.

Ideal candidate

  • Undergrad, Master's, or PhD student in CS, AI, or related field
  • Prior research experience (conference / workshop publications a plus)
  • Strong background in one or more of : multimodal learning, audio-visual representation learning, speech modeling, or self-supervised methods
  • Experience with PyTorch, Hugging Face, or similar frameworks
  • What you'll do

  • Prototype and evaluate audio-visual alignment models
  • Extend our open-source ASR and meta-speech pipelines
  • Collaborate on papers, demos, and real-time VoiceAI applications
  • Location : Remote

  • Type : Paid internship / research collaboration
  • Create a job alert for this search

    Research Intern • San Mateo, CA, US

    Related jobs
    • Promoted
    • New!
    Research Intern - Audio-Visual VoiceAI (Open Source)

    Research Intern - Audio-Visual VoiceAI (Open Source)

    WhissleAISunnyvale, CA, US
    Full-time
    Research Intern – Audio-Visual VoiceAI (Open Source).We're looking for a Research Intern to join WhissleAI and help advance our open-source work at the intersection of speech, vision, and structure...Show moreLast updated: 16 hours ago
    Senior Research Engineer Multimodal & Video Foundation Model (100% Remote)

    Senior Research Engineer Multimodal & Video Foundation Model (100% Remote)

    Tether Operations LimitedSan Francisco, CA, US
    Remote
    Full-time
    Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...Show moreLast updated: 30+ days ago
    • Promoted
    Associate Director, Global Medical Information (Remote)

    Associate Director, Global Medical Information (Remote)

    Jazz PharmaceuticalsRedwood City, California, USA
    Remote
    Full-time
    If you are a current Jazz employee please apply via the Internal Career site.Jazz Pharmaceuticals is a global biopharma company whose purpose is to innovate to transform the lives of patients and ...Show moreLast updated: 3 days ago
    • Promoted
    Writer / Journalist Internship Part-Time in Worldwide - Remote Worldwide

    Writer / Journalist Internship Part-Time in Worldwide - Remote Worldwide

    The Borgen ProjectVallejo, CA, United States
    Remote
    Part-time +1
    Are you passionate about making a difference in the world? Look no further! The Borgen Project is an international organization that works at the political level to improve living conditions for pe...Show moreLast updated: 11 days ago
    • Promoted
    Earn Money Playing Games, Answering Surveys & Testing Apps

    Earn Money Playing Games, Answering Surveys & Testing Apps

    AttaPollMill Valley, CA, US
    Full-time
    Get paid to answer surveys, play games, test apps, and do tasks.Make extra money on your phone and instantly cash out to PayPal, Revolut, or with gift cards from $3.Show moreLast updated: 4 days ago
    • Promoted
    Senior Research Engineer - Multimodal & Video Foundation Model (Remote)

    Senior Research Engineer - Multimodal & Video Foundation Model (Remote)

    Tether Operations LimitedSan Francisco, CA, United States
    Remote
    Full-time
    Join Tether and Shape the Future of Digital Finance.At Tether, we’re pioneering a global financial revolution with solutions that empower businesses—from exchanges and wallets to payment processors...Show moreLast updated: 30+ days ago
    • Promoted
    Chemical, Biological, Radiological, and Nuclear Specialist

    Chemical, Biological, Radiological, and Nuclear Specialist

    United States ArmyMoss Beach, CA, United States
    Permanent
    As a Chemical, Biological, Radiological, and Nuclear Specialist, you’ll protect the country against the threat of CBRN weapons of mass destruction, and you’ll decontaminate hazardous material spill...Show moreLast updated: 2 days ago
    Research Intern (Deep Learning), 2026 Spring (Master / PhD)

    Research Intern (Deep Learning), 2026 Spring (Master / PhD)

    pony.aiFremont, CA, US
    Full-time
    Quick Apply
    Founded in 2016 in Silicon Valley, Pony.Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony. CNBC Disruptor list of the 50 most innovative and disruptive tech comp...Show moreLast updated: 11 days ago
    • Promoted
    Signals Intelligence Voice Interceptor - Up To $22.5k Signing Bonus

    Signals Intelligence Voice Interceptor - Up To $22.5k Signing Bonus

    United States ArmyMenlo Park, CA, United States
    Full-time
    As a Signals Intelligence Voice Interceptor, you’ll use your skills as a linguist to identify, categorize, translate, and summarize foreign language communications from a specific location.You’ll a...Show moreLast updated: 2 days ago
    • Promoted
    Community Intern

    Community Intern

    MeshySunnyvale, CA, US
    Full-time
    Headquartered in the Silicon Valley, Meshy is the leading 3D generative AI company on a mission to.Meshy makes it effortless for both professional artists and hobbyists to create unique 3D assetstu...Show moreLast updated: 28 days ago
    • Promoted
    Technical Advisor Intern - GenAI (Winter / Spring 2026)

    Technical Advisor Intern - GenAI (Winter / Spring 2026)

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    The Scale AI Technical Advisor Internship is a summer semester opportunity designed for university students, ideally with experience in competitive coding, mathematics, and STEM disciplines.Interns...Show moreLast updated: 30+ days ago
    • Promoted
    Cryptologic Linguist - Up To $22.5k Signing Bonus

    Cryptologic Linguist - Up To $22.5k Signing Bonus

    United States ArmyMontara, CA, United States
    Full-time
    As a Signals Intelligence Voice Interceptor, you’ll use your skills as a linguist to identify, categorize, translate, and summarize foreign language communications from a specific location.You’ll a...Show moreLast updated: 2 days ago
    Applied Research Internship - Computer Vision

    Applied Research Internship - Computer Vision

    Geomagical LabsPalo Alto, CA, US
    Remote
    Temporary +1
    Quick Apply
    Geomagical Labs is a 3D R&D lab, in partnership with IKEA.We create magical mixed-reality experiences for hundreds of millions of users, using computer vision, neural networks, graphics, and co...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Research Intern (Summer 2026)

    Machine Learning Research Intern (Summer 2026)

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    Machine Learning Research team is focused on building the foundation for the next generation of AI systems-pushing the boundaries of what's possible with frontier models while ensuring safety, reli...Show moreLast updated: 30+ days ago
    • Promoted
    Intern, Quantum Solutions (PDEs)

    Intern, Quantum Solutions (PDEs)

    PsiQuantumPalo Alto, CA, United States
    Full-time
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Research Intern – Audio-Visual VoiceAI (Open Source)

    Research Intern – Audio-Visual VoiceAI (Open Source)

    WhissleAISan Jose, CA, United States
    Full-time
    Research Intern – Audio-Visual VoiceAI (Open Source).We’re looking for a Research Intern to join WhissleAI and help advance our open-source work at the intersection of speech, vision, and structure...Show moreLast updated: 21 hours ago
    • Promoted
    UX Quantitative Research Intern

    UX Quantitative Research Intern

    PinterestSan Francisco, California, United States
    Full-time
    Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...Show moreLast updated: 4 days ago
    Senior Research Engineer Multimodal & Video Foundation Model (Remote)

    Senior Research Engineer Multimodal & Video Foundation Model (Remote)

    Tether Operations LimitedSan Francisco, CA, US
    Remote
    Full-time
    Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...Show moreLast updated: 30+ days ago