If you're looking for an API, you can choose from your desired programming language.
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
import requests
import base64
# Use this function to convert an image file from the filesystem to base64
def image_file_to_base64(image_path):
with open(image_path, 'rb') as f:
image_data = f.read()
return base64.b64encode(image_data).decode('utf-8')
# Use this function to fetch an image from a URL and convert it to base64
def image_url_to_base64(image_url):
response = requests.get(image_url)
image_data = response.content
return base64.b64encode(image_data).decode('utf-8')
api_key = "YOUR_API_KEY"
url = "https://api.segmind.com/v1/easy-animate"
# Request payload
data = {
"input_image": image_url_to_base64("https://segmind-sd-models.s3.amazonaws.com/display_images/easy-animate/easy-animate-ip.png"), # Or use image_file_to_base64("IMAGE_PATH")
"prompt": "photo of a beautiful woman in a night party. The video is of high quality, and the view is very clear. High quality, masterpiece, best quality, highres, ultra-detailed, fantastic",
"negative_prompt": "The video is not of a high quality, it has a low resolution, and the audio quality is not clear. Strange motion trajectory, a poor composition and deformed video, low resolution, duplicate and ugly, strange body structure, long and strange neck, bad teeth, bad eyes, bad limbs, bad hands, rotating camera, blurry camera, shaking camera. Deformation, low-resolution, blurry, ugly, distortion.",
"video_length": 72,
"seed": 43,
"steps": 25,
"cfg": 7,
"scheduler": "Euler",
"frame_rate": 24
}
headers = {'x-api-key': api_key}
response = requests.post(url, json=data, headers=headers)
print(response.content) # The response is the generated image
URL of the input image
Text prompt describing the desired video
Negative prompt to exclude specific details
Length of the generated video in seconds
min : 40,
max : 136
Seed for random number generation
Number of inference steps for generating the video
min : 20,
max : 50
Classifier-Free Guidance scale
min : 4,
max : 15
Scheduler for the video generation process
Allowed values:
Frame rate of the generated video
min : 20,
max : 27
To keep track of your credit usage, you can inspect the response headers of each API call. The x-remaining-credits property will indicate the number of remaining credits in your account. Ensure you monitor this value to avoid any disruptions in your API usage.
Easy Animate is a state-of-the-art animation toolkit developed by the team at Alibaba PAI. The model leverages advanced deep learning frameworks to convert static images into dynamic animations with remarkable accuracy and fluidity. Easy Animate utilizes a convolutional neural network (CNN) based architecture intertwined with temporal coherence algorithms to ensure realistic and natural movement between frames.
Architecture: Easy Animate is built upon the transformer architecture, which has proven effective for various natural language processing and computer vision tasks.
Purpose: It serves as an end-to-end solution for video generation, making it suitable for applications such as video synthesis, style transfer, and creative content production.
Components: Easy Animate integrates motion modules, u-vit (a variant of the Vision Transformer), and slice-vae (for processing long videos).
Prompt: Enter a detailed description of the video you want to generate in the "Prompt" box. This should include specifics about the scene, characters, actions, and desired quality.
Input Image: Upload an image that the AI can use as a reference for generating the video. You can do this by clicking on the "Click or Drag n Drop" area and selecting the appropriate image from your files.
Seed:
If you want consistent results for repeated attempts, enter a specific seed value.
If you want to explore different possibilities, check the "Randomise Seed" box to get varying results each time.
Negative Prompt: In the "Negative Prompt" box, specify what you don’t want in the video.
Video Length: Enter the desired number of frames for the video. Remember the maximum allowed is 136 frames.
CFG Scale:
Set the CFG (Classifier-Free Guidance) scale to direct how closely the generation should follow the prompt and image reference.
Example: Enter “7” for a balanced influence.
Scheduler: Choose a scheduler type like “Euler” which might affect how the inference process is handled. Different schedulers can produce different visual styles in your video.
Frame Rate:
Enter the frames per second (fps) for your video. Remember the maximum allowed is 27 fps.
Example: Enter “24” for a smoothly animated result.
Easy Animate finds applications in various domains:
Content Creation: Content creators can use Easy Animate to produce engaging videos for social media, marketing campaigns, or educational purposes.
Artistic Expression: Artists and designers can explore style transfer and create visually appealing animations.
Research and Development: Researchers can leverage Easy Animate for experiments related to video synthesis and generative models.
SDXL Img2Img is used for text-guided image-to-image translation. This model uses the weights from Stable Diffusion to generate new images from an input image using StableDiffusionImg2ImgPipeline from diffusers
SDXL ControlNet gives unprecedented control over text-to-image generation. SDXL ControlNet models Introduces the concept of conditioning inputs, which provide additional information to guide the image generation process
InstantID aims to generate customized images with various poses or styles from only a single reference ID image while ensuring high fidelity
CodeFormer is a robust face restoration algorithm for old photos or AI-generated faces.