Files up to 2MB
Fashion AI model can effectively replace a piece of clothing in an image with a new one, creating a seamless and realistic result. The fashion AI model working in two steps:
The model first identifies and separates the piece of clothing from the rest of the image. This precise segmentation allows the model to focus solely on the clothing item.
Once the clothing item is segmented, the model then inpaints the segmented image. This process is guided by a text prompt, which describes the new piece of clothing that should replace the original one.
Under the hood, the Fashion AI model is a combination of object detection, segmentation and inpainting.
Object Detection: The Grounding DINO model is used for object detection. It identifies the object of interest in the image, such as clothing. This is achieved by inputting the image and the object category into the Grounding DINO model, which uses both language and vision modalities to detect objects.
Segmentation: The Segment Anything Model (SAM) is employed for segmentation. It precisely segments elements in images based on semantic text prompts.
Inpainting: Following segmentation, a mask is created for the segmented clothing image. White pixels represent the clothing that will be inpainted, while black pixels are preserved. Based on the text prompt, a new clothing item replaces the old one. This process is known as inpainting.
Input Image: Provide an image of a person wearing clothing. This will serve as the base for the clothing replacement.
Text Prompt: Specify your desired changes to the clothing. This could include color, design, etc.
Clothing: Adjust the ‘clothing’ parameter to select the type of clothing you want to modify. This could be topwear, bottomwear, or full body wear, depending on the input image.
*Please note, if there is a mismatch between the clothing in the input image and the selected clothing type, it may result in unsatisfactory results. Always ensure that the clothing type matches the clothing present in the input image for the best results.