1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
import requests
import base64
# Use this function to convert an image file from the filesystem to base64
def image_file_to_base64(image_path):
with open(image_path, 'rb') as f:
image_data = f.read()
return base64.b64encode(image_data).decode('utf-8')
# Use this function to fetch an image from a URL and convert it to base64
def image_url_to_base64(image_url):
response = requests.get(image_url)
image_data = response.content
return base64.b64encode(image_data).decode('utf-8')
api_key = "YOUR_API_KEY"
url = "https://api.segmind.com/v1/cog-video-5b-i2v"
# Request payload
data = {
"image": image_url_to_base64("https://segmind-sd-models.s3.amazonaws.com/display_images/cog_vid_i2v_ip.png"), # Or use image_file_to_base64("IMAGE_PATH")
"negative_prompt": "The video is not of a high quality, it has a low resolution. Watermark present in each frame. Strange motion trajectory.",
"prompt": "Animate this serene lakeside night scene with a flickering campfire, gentle water ripples, and subtle movements from the three people sitting around the campfire under the full moon.",
"seed": 42
}
headers = {'x-api-key': api_key}
response = requests.post(url, json=data, headers=headers)
print(response.content) # The response is the generated image
Input Image
Text to describe what should be avoided in the generated video
Text prompt for generating the video
Seed for random number generation to ensure reproducibility
To keep track of your credit usage, you can inspect the response headers of each API call. The x-remaining-credits property will indicate the number of remaining credits in your account. Ensure you monitor this value to avoid any disruptions in your API usage.
CogVideoX image-to-video is a cutting-edge image-to-video generation model transforms static images into dynamic, high-quality videos, making it an essential tool for content creators, marketers, and developers.
High-Resolution Output: CogVideoX generates videos with resolutions up to 720 x 480 pixels, ensuring that the output is clear and detailed. This high resolution is ideal for professional use, including marketing materials, social media content, and educational videos.
Versatile Precision: CogVideoX supports multiple inference precisions, such as FP16, BF16, and INT8. This flexibility allows users to balance between performance and quality based on their specific requirements.
Enhanced Video Quality: The integration of VEnhancer technology ensures that the generated videos have higher resolution and better visual effects. This feature is particularly useful for creating visually appealing content that stands out.
Content Creation: Ideal for generating engaging video content from images for social media, marketing campaigns, and more.
Animation and Film: Streamline the animation process by converting storyboard images into animated sequences.
Education and Training: Create instructional videos from static diagrams and illustrations, enhancing learning experiences.