June 17, 2024


With the rise of digital assistants and conversational interfaces, individuals have grown accustomed to listening to and chatting with artificial voices. However what do these voices sound like? Typically, fairly repetitive.  We’re all conversant in the Google Assistant voice, for instance. 

That’s why we’re excited to announce the overall availability of Customized Voice in our Cloud Textual content-to-Speech (TTS) API, a brand new function that allows you to practice customized voice fashions with your personal audio recordings to create distinctive experiences.

For companies trying to construct a robust model identification, establishing a novel voice will help flip cellular app interactions or customer support primarily based on interactive voice responses (IVR) into differentiated buyer experiences. Our TTS API has included a speech synthesis service with a static checklist of voices for a while, however now, with Customized Voice, transferring past these predefined choices is less complicated than ever.   

Customized Voice allows you to merely submit your audio recordings to get entry to the brand new voice instantly within the TTS API.  Customized Voice TTS consists of steerage on the audio necessities to assist ensure you generate a top quality customized TTS voice mannequin. As soon as this new mannequin is educated, all it’s a must to do to start out utilizing the newly educated voice is reference the mannequin ID in your calls to the Cloud TTS API. 

At Google, we’re dedicated to constructing secure and accountable AI merchandise, not solely as a result of it’s the appropriate factor to do, however as a result of it’s a crucial step in guaranteeing profitable use in manufacturing. As a part of Google Cloud’s Accountable AI governance course of, we carried out a deep moral analysis of Customized Voice TTS, and its relation to artificial media, in an effort to floor and mitigate potential harms that it might create. In case you are concerned about Customized Voice TTS, there’s a evaluate course of to assist guarantee every use case is aligned with our AI Rules and satisfactory voice actor consent is given. 

Moreover, to confirm that voice actors are literally those producing the audio, you will have to submit an audio file producing a sentence that Google Cloud chooses (for instance: “I agree that my voice will probably be used to create an artificial customized Textual content-to-Speech voice).  

We’re trying ahead to seeing this API assist companies remedy issues in a simple, quick, and scalable method. TTS Customized Voice is now GA in these languages:

  • English (US)

  • English (AU)

  • English (UK)

  • Spanish (US)

  • Spanish (Spain)

  • French (France)

  • French (Canada)

  • Italian (Italy)

  • German (Germany)

  • Portugues (Brazil)

  • Japanese (Japan)

We plan to proceed increasing this lineup in an effort to meet your wants. Able to strive for your self? Contact your vendor to get began in your use case analysis at this time!


Source link

Leave a Reply

Your email address will not be published. Required fields are marked *