Voxtral TTS Demo
Generate realistic speech from text with custom or preset voices
Generate realistic speech from text with custom or preset voices
Transcribe audio clips to text in many languages
Run Cohere Transcribe locally in your browser on WebGPU.
Chat with a Victorianโera language model chatbot
Generate audio for a video using a text prompt
World-first embodied AI world model
VFig converts any diagram image into editable SVG code.
Portrait animation & lipsync with LTX 2.3
generate a video from an image with a text prompt
FireRed-Image-Edit ร Qwen-Image-Edit-Rapid (Transformers)
Adjust the camera angle of your photo
Turn any image into a DLSS 5 meme (using FLUX.2-klein-9b-kv)
Generate images from text prompts in seconds
Chat with a multimodal AI using text, images, audio, or video
Generate realistic speech from text with custom or preset voices
text to video, image to video, video extend
Image edit, text to image, image upscale, remove watermark
Demo of the Collection of Qwen Image Edit LoRAs
generate a video from an image with a text prompt
Run Cohere Transcribe locally in your browser on WebGPU.
Transcribe audio clips to text in many languages
High-quality voice cloning TTS for 600+ languages
Chat with a multimodal AI using text, image, audio, or video
Generate speech from text with custom voice, cloning, or presets
Chat with a Victorianโera language model chatbot
Generate high-quality motions from text prompts
High-fidelity 3D Generation from images
Run Gemma 4 locally in-browser on WebGPU w/ Transformers.js
Create cinematic videos with audio from text prompts
Portrait animation & lipsync with LTX 2.3
Generate short videos from an image and text prompt
Embedding Leaderboard