F5-TTS
π£
2.87k
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Ask questions about images and get detailed answers
Generate images based on prompts and input images
Create AI images with FLUX LoRA styles
More advanced and challenging multi-task evaluation
Explore and compare model scores on RewardBench benchmarks
Generate Python code solutions for coding problems
Annotate and describe images with text prompts
Analyze images to detect objects, generate captions, or perform OCR
Generate captions, detections, and segmentations for any image
Chat with an AI that understands images and text
a tiny vision language model
Transcribe audio with emotions and sound events