1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
import requests
import base64
# Use this function to convert an image file from the filesystem to base64
def image_file_to_base64(image_path):
with open(image_path, 'rb') as f:
image_data = f.read()
return base64.b64encode(image_data).decode('utf-8')
# Use this function to fetch an image from a URL and convert it to base64
def image_url_to_base64(image_url):
response = requests.get(image_url)
image_data = response.content
return base64.b64encode(image_data).decode('utf-8')
api_key = "YOUR_API_KEY"
url = "https://api.segmind.com/v1/myshell-tts"
# Request payload
data = {
"voice": "michael",
"language": "EN_NEWEST",
"text": "Did you ever hear a folk tale about a giant turtle?",
"speed": 1
}
headers = {'x-api-key': api_key}
response = requests.post(url, json=data, headers=headers)
print(response.content) # The response is the generated image
Select the name of the voice to generate output audio.
Allowed values:
Language of the text or audio
Allowed values:
Text to be spoken or processed
Speed at which the text or audio is processed
min : 0.5,
max : 2
To keep track of your credit usage, you can inspect the response headers of each API call. The x-remaining-credits property will indicate the number of remaining credits in your account. Ensure you monitor this value to avoid any disruptions in your API usage.
MyShell Voice Cloning and Text-to-Speech (TTS) technology represents a significant advancement in audio synthesis. By leveraging state-of-the-art deep learning techniques, it offers exceptional realism, flexibility, and cost-effectiveness.
Advanced TTS: TTS engine converts written text into natural-sounding speech, mimicking human vocal characteristics with high fidelity.
State-of-the-Art Voice Cloning: With just a brief voice sample, the model can accurately replicate a speaker's unique vocal identity, enabling the creation of highly personalized and realistic audio content.
Efficiency and Cost-Effectiveness: MyShell's technology offers substantial cost reductions compared to traditional TTS methods, making advanced audio synthesis accessible to a wider range of users and applications.
Content Creation: Generate realistic voiceovers for videos, podcasts, and audiobooks.
Gaming and Virtual Assistants: Develop engaging and personalized virtual characters.
Accessibility: Provide audio alternatives for text-based content, making it accessible to individuals with visual impairments.
Business and Marketing: Create branded voice experiences for advertising, customer service, and interactive campaigns.