MyShell Text To Speech

MyShell's Voice Cloning and Text to Speech - Transform your audio content with realistic, personalized voices. Experience high-quality, efficient, and cost-effective audio synthesis.


API

If you're looking for an API, you can choose from your desired programming language.

POST
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 import requests import base64 # Use this function to convert an image file from the filesystem to base64 def image_file_to_base64(image_path): with open(image_path, 'rb') as f: image_data = f.read() return base64.b64encode(image_data).decode('utf-8') # Use this function to fetch an image from a URL and convert it to base64 def image_url_to_base64(image_url): response = requests.get(image_url) image_data = response.content return base64.b64encode(image_data).decode('utf-8') api_key = "YOUR_API_KEY" url = "https://api.segmind.com/v1/myshell-tts" # Request payload data = { "voice": "michael", "language": "EN_NEWEST", "text": "Did you ever hear a folk tale about a giant turtle?", "speed": 1 } headers = {'x-api-key': api_key} response = requests.post(url, json=data, headers=headers) print(response.content) # The response is the generated image
RESPONSE
image/jpeg
HTTP Response Codes
200 - OKImage Generated
401 - UnauthorizedUser authentication failed
404 - Not FoundThe requested URL does not exist
405 - Method Not AllowedThe requested HTTP method is not allowed
406 - Not AcceptableNot enough credits
500 - Server ErrorServer had some issue with processing

Attributes


voiceenum:str *

Select the name of the voice to generate output audio.

Allowed values:


languageenum:str ( default: EN_NEWEST )

Language of the text or audio

Allowed values:


textstr *

Text to be spoken or processed


speedfloat *

Speed at which the text or audio is processed

min : 0.5,

max : 2

To keep track of your credit usage, you can inspect the response headers of each API call. The x-remaining-credits property will indicate the number of remaining credits in your account. Ensure you monitor this value to avoid any disruptions in your API usage.

MyShell Voice Cloning and Text-to-Speech Technology

MyShell Voice Cloning and Text-to-Speech (TTS) technology represents a significant advancement in audio synthesis. By leveraging state-of-the-art deep learning techniques, it offers exceptional realism, flexibility, and cost-effectiveness.

Key Features

  • Advanced TTS: TTS engine converts written text into natural-sounding speech, mimicking human vocal characteristics with high fidelity.

  • State-of-the-Art Voice Cloning: With just a brief voice sample, the model can accurately replicate a speaker's unique vocal identity, enabling the creation of highly personalized and realistic audio content.

  • Efficiency and Cost-Effectiveness: MyShell's technology offers substantial cost reductions compared to traditional TTS methods, making advanced audio synthesis accessible to a wider range of users and applications.

Use Cases

  • Content Creation: Generate realistic voiceovers for videos, podcasts, and audiobooks.

  • Gaming and Virtual Assistants: Develop engaging and personalized virtual characters.

  • Accessibility: Provide audio alternatives for text-based content, making it accessible to individuals with visual impairments.

  • Business and Marketing: Create branded voice experiences for advertising, customer service, and interactive campaigns.

Cookie settings

We use cookies to enhance your browsing experience, analyze site traffic, and personalize content. By clicking "Accept all", you consent to our use of cookies.