Svd Keyframe Interpolation
🐨
66
Generate a smooth video between two keyframe images
Replace objects in images using prompts or reference images
Generate depth map from a single image
Segment objects in images using text prompts or scribbles
Predict depth map from a single image
Generate audio from text using VITS model
Generate anime-style speech in Japanese, Chinese, and English
Restore and enhance faces in photos
Transcribe audio to text with speaker diarization