Video Captioner

With Video Captioner create accurate, customizable subtitles for your videos effortlessly.

API

If you're looking for an API, you can choose from your desired programming language.

POST

import requests
import base64

# Use this function to convert an image file from the filesystem to base64
def image_file_to_base64(image_path):
    with open(image_path, 'rb') as f:
        image_data = f.read()
    return base64.b64encode(image_data).decode('utf-8')

# Use this function to fetch an image from a URL and convert it to base64
def image_url_to_base64(image_url):
    response = requests.get(image_url)
    image_data = response.content
    return base64.b64encode(image_data).decode('utf-8')

# Use this function to convert a list of image URLs to base64
def image_urls_to_base64(image_urls):
    return [image_url_to_base64(url) for url in image_urls]

api_key = "YOUR_API_KEY"
url = "https://api.segmind.com/v1/video-captioner"

# Request payload
data = {
  "MaxChars": 10,
  "bg_blur": False,
  "bg_color": "None",
  "color": "white",
  "font": "Poppins/Poppins-ExtraBold.ttf",
  "fontsize": 7,
  "highlight_color": "yellow",
  "input_video": "https://segmind-sd-models.s3.amazonaws.com/display_images/hallo_output.mp4",
  "kerning": -2,
  "opacity": 0,
  "right_to_left": False,
  "stroke_color": "black",
  "stroke_width": 2,
  "subs_position": "bottom75"
}

headers = {'x-api-key': api_key}

response = requests.post(url, json=data, headers=headers)
print(response.content)  # The response is the generated image

RESPONSE

image/jpeg

HTTP Response Codes

200 - OKImage Generated

401 - UnauthorizedUser authentication failed

404 - Not FoundThe requested URL does not exist

405 - Method Not AllowedThe requested HTTP method is not allowed

406 - Not AcceptableNot enough credits

500 - Server ErrorServer had some issue with processing

Attributes

MaxCharsint ( default: 10 )

Max characters space for subtitles. 20 is good for videos, 10 is good for reels

bg_blurboolean ( default: 1 )

Blur the background colour of the subtitle

bg_colorenum:str ( default: null )

Background color of the subtitles

Allowed values:

colorenum:str ( default: white )

Color of the subtitles

Allowed values:

fontenum:str ( default: Poppins/Poppins-ExtraBold.ttf )

Font for the subtitles

Allowed values:

fontsizefloat ( default: 7 )

Font size. 7.0 is good for videos, 4.0 is good for reels

highlight_colorenum:str ( default: yellow )

Highlight color for subtitles

Allowed values:

input_videostr *

URL of the input video to be captioned

kerningfloat ( default: -2 )

Kerning (spacing between individual letters or characters) for the subtitles

opacityfloat ( default: 1 )

Opacity for the subtitles background

right_to_leftboolean ( default: 1 )

Right to left subtitles, for right to left languages. Only Arial fonts are supported.

stroke_colorenum:str ( default: black )

Stroke color for the subtitles

Allowed values:

stroke_widthfloat ( default: 2 )

Stroke width for the subtitles

subs_positionenum:str ( default: bottom75 )

Position of the subtitles

Allowed values:

To keep track of your credit usage, you can inspect the response headers of each API call. The x-remaining-credits property will indicate the number of remaining credits in your account. Ensure you monitor this value to avoid any disruptions in your API usage.

Video Captioner

The Video Captioner model is engineered to revolutionize the way you handle video subtitle integration, enhancing both accessibility and viewer engagement. Leveraging state-of-the-art algorithms, this tool provides a seamless process for generating precise video captions with customized stylistic options.

Key Features of Video Captioner

Dynamic Subtitle Positioning: Configure subtitles to display at your desired position, providing optimal readability by setting preferences such as bottom,top, left, right etc .
Customizable Aesthetics: Tailor subtitle appearance with comprehensive settings including color adjustments (e.g., white subtitles, yellow highlight, black stroke), font selection (such as Poppins ExtraBold), and precise font sizing.
Background and Opacity Control: Adjust transparency levels for subtitles and background color flexibility to ensure clarity and visibility on various video backgrounds.
Text Handling and Kerning: Fine-tune your text with a maximum character setting and kerning adjustments to achieve precise alignment and spacing for multilingual subtitles that adhere to right-to-left language requirements.

Use cases

Educational Content: Enhance online courses, lectures, and tutorials with clear and accurate subtitles, improving comprehension and accessibility for diverse learners.
Corporate Training: Facilitate employee training programs by providing captioned videos that cater to multilingual staff and those with hearing impairments.
Social Media Marketing: Boost engagement on platforms like YouTube, Instagram, and Facebook by adding eye-catching captions to videos, ensuring content is accessible even when muted.
Film and TV Production: Streamline the post-production process by efficiently generating subtitles, enabling faster distribution across different languages and regions.
E-Learning Platforms: Offer inclusive learning experiences by integrating subtitles in courses, allowing institutions to cater to global audiences.

Other Popular Models

sdxl-img2img

SDXL Img2Img is used for text-guided image-to-image translation. This model uses the weights from Stable Diffusion to generate new images from an input image using StableDiffusionImg2ImgPipeline from diffusers

sdxl-controlnet

SDXL ControlNet gives unprecedented control over text-to-image generation. SDXL ControlNet models Introduces the concept of conditioning inputs, which provide additional information to guide the image generation process

faceswap-v2

Take a picture/gif and replace the face in it with a face of your choice. You only need one image of the desired face. No dataset, no training

sdxl1.0-txt2img

The SDXL model is the official upgrade to the v1.5 model. The model is released as open-source software

Video Captioner

API

Attributes

Video Captioner

Key Features of Video Captioner

Use cases

Other Popular Models

sdxl-img2img

sdxl-controlnet

faceswap-v2

sdxl1.0-txt2img

Cookie settings