Openvoice

OpenVoice is a versatile voice cloning model that supports multiple languages and offers precise tone replication, flexible style control, and zero-shot cross-lingual capabilities

~9.21s

~$0.008

Playground API

Pricing

~9.21s

~$0.008

 1import requests
 2import json
 3
 4url = "https://api.segmind.com/v1/openvoice"
 5headers = {
 6    "x-api-key": "YOUR_API_KEY",
 7    "Content-Type": "application/json"
 8}
 9
10data = {
11    "input_audio": "https://segmind-sd-models.s3.amazonaws.com/display_images/openvoice-ip.mp3",
12    "language": "EN_NEWEST",
13    "speed": 1,
14    "text": "Did you ever hear a folk tale about a giant turtle?"
15}
16
17response = requests.post(url, headers=headers, json=data)
18
19if response.status_code == 200:
20    result = response.json()
21    print(json.dumps(result, indent=2))
22else:
23    print(f"Error: {response.status_code}")
24    print(response.text)

 1import requests
 2import json
 3
 4url = "https://api.segmind.com/v1/openvoice"
 5headers = {
 6    "x-api-key": "YOUR_API_KEY",
 7    "Content-Type": "application/json"
 8}
 9
10data = {
11    "input_audio": "https://segmind-sd-models.s3.amazonaws.com/display_images/openvoice-ip.mp3",
12    "language": "EN_NEWEST",
13    "speed": 1,
14    "text": "Did you ever hear a folk tale about a giant turtle?"
15}
16
17response = requests.post(url, headers=headers, json=data)
18
19if response.status_code == 200:
20    result = response.json()
21    print(json.dumps(result, indent=2))
22else:
23    print(f"Error: {response.status_code}")
24    print(response.text)

API Endpoint

POSThttps://api.segmind.com/v1/openvoice

Parameters

input_audiorequired

string (uri)

Input reference audio (5-120 seconds) of a person speaking, used for training an audio model to capture voice characteristics

Default: "https://segmind-sd-models.s3.amazonaws.com/display_images/openvoice-ip.mp3"

speedrequired

number

Speed at which the output audio is generated

Default: 1Range: 0.5 - 2

textrequired

string

Text to be spoken or processed

Default: "Did you ever hear a folk tale about a giant turtle?"

languageoptional

string

The language of the audio to be generated British English, American English, Indian English, French ,Chinese, Japanese, Korean are supported

Default: "EN_NEWEST"

Allowed values (11 total):

"EN_NEWEST""EN""ES""FR""ZH""JP""KR""EN_US""EN_BR""EN_INDIA"+1 more

Response Type

Returns: Audio

Common Error Codes

The API returns standard HTTP status codes. Detailed error messages are provided in the response body.

400

Bad Request

Invalid parameters or request format

401

Unauthorized

Missing or invalid API key

403

Forbidden

Insufficient permissions

404

Not Found

Model or endpoint not found

406

Insufficient Credits

Not enough credits to process request

429

Rate Limited

Too many requests

500

Server Error

Internal server error

502

Bad Gateway

Service temporarily unavailable

504

Timeout

Request timed out