Overview
We’re hiring voice talents to record 60 minutes of scripted audio demonstrating a range of emotions (e.g., excitement, contempt, confusion) and speaking styles (e.g., narrative, whispered, reading to children). Recordings will be used to train text-to-speech (TTS) models. There is no public broadcast or advertising usage, and nothing will be created in your likeness.
Project Details
- Length : 1 hour of finished speech (solo; no dialogues). Fully scripted; scripts provided.
- Usage : Internal model training / evaluation only. No public-facing usage and no voice cloning.
- Script quality : Scripts are machine-translated; please read as written to the best of your ability without letting translation quality compromise a natural, believable delivery.
Artistic Direction
Aim for genuine, natural performances that do not feel performed or stiff.Maintain clear, confident delivery and correct pronunciation.Technical / Audio Requirements
Format : WAV, stereo, 16-bitSample rate : 48 kHzQuality : Very clean audio (noise floor ≤ −60 dB), no reverb, no clipping, minimal plosives.Processing : Do not apply denoisers, EQ, compression, normalization, or any plugins—raw audio only.Revisions
Include up to 2 rounds of revisions due to talent error per script.To Apply
Submit your audition with your proposal using the attached script in the language you are applying for.
#J-18808-Ljbffr