Loading…
Video · Tavus
Real-time conversational video AI and digital human replicas.
A developer platform for building face-to-face AI agents that see, listen, and respond in live video through its Conversational Video Interface (CVI). It also generates personalized videos at scale from digital replicas of a real person. Built on Tavus's own models — Phoenix for rendering, Raven for perception, and Sparrow for conversational timing — with the ability to plug in custom LLMs and text-to-speech.
Model support
Ships its own Phoenix/Raven/Sparrow models; CVI lets you bring a custom LLM and TTS.
Where it runs
Tags
Related in Video
Kuaishou
State-of-the-art AI video + image, with strong motion and multishot.
Kuaishou's creative studio — text- and image-to-video with convincing motion, lip-sync, and multishot sequences up to ~15s, plus image generation. A leading Runway/Sora rival.
AI insight: From short-video giant Kuaishou — its motion realism and multishot sequences make it the leading non-Western Sora rival.
Runway
Professional video AI. Gen-series models + a full editor.
End-to-end video AI platform — text-to-video, image-to-video, in-painting, motion brush, and a timeline editor. Longest-running studio in the space; default choice when the output ships to clients.
AI insight: The elder of the space — it pairs its Gen models with an actual timeline editor, which is why client work tends to land here.
Mirage
AI video editor and avatar creator for short-form, talking-head content.
An AI video app for creators that auto-edits talking-head footage — generating captions, inserting B-roll, correcting eye contact, and dubbing into other languages. Its AI Creator mode renders a talking video from a script using AI personas. Built by Mirage on its own generative-video foundation model.
AI insight: Captions is built on Mirage, its parent company's in-house UGC video foundation model, rather than wrapping third-party video generators.
MiniMax
Text- and image-to-video generation from MiniMax.
MiniMax's consumer video generator, turning text prompts and reference images into short cinematic clips with subject-reference for consistent characters. Available on the web and as iOS and Android apps. A free tier offers limited credits; subscriptions add HD output, faster generation, and commercial use.
AI insight: From MiniMax, the lab behind the MiniMax LLMs; its Hailuo video models are widely resold via APIs like fal and Replicate.
Hedra
Turn a photo and voice into talking, expressive characters.
Hedra generates lip-synced, expressive talking-character video from a single image plus audio or a script. Its Character-3 model handles facial performance and emotion, and a Live Avatars tier streams those characters in real time for conversational AI agents.
AI insight: Its Live Avatars tier streams lip-synced talking-head video in real time at $0.05/min, aimed at giving voice AI agents a face.
OpusClip
Turns long videos into viral short clips with AI captions and auto-reframing.
Repurposes long-form videos and podcasts into short, vertical clips ready for TikTok, Reels, and Shorts. The AI finds the most engaging moments, adds animated captions, reframes to keep speakers centered, and scores each clip's virality. Credits are billed per minute of source video, not per clip produced.
AI insight: Billing counts source-video minutes, not output clips — a 45-minute podcast costs 45 credits whether the AI yields 3 shorts or 20.
Submagic
Edit short-form videos 10x faster with AI.
AI video editor for short-form content that auto-generates captions in dozens of languages, removes silences, inserts B-roll, and extracts the highest-engagement clips from long videos. Upload footage or a YouTube link and get TikTok/Reels/Shorts-ready edits. Pricing is per finished video rather than per credit.
AI insight: Billed per finished video rather than per credit or source-minute, and it auto-inserts context-matched B-roll, not just captions.
Google's AI filmmaking studio — Veo video + Imagen, in one canvas.
Google Labs' creative studio for filmmakers — generate and stitch shots with Veo, craft keyframes with Imagen, and direct camera + scene with a Gemini-powered agent. Now folds in Whisk and ImageFX.
AI insight: Google's filmmaking surface over Veo and Imagen — it directs continuity across shots, and has absorbed Whisk and ImageFX.
Higgsfield AI
Cinematic AI video with camera controls — many models, one subscription.
An AI video + image studio built around cinematic camera motion and presets. Aggregates 15+ third-party models (Sora, Veo, Kling, and more) so you switch engines without switching tools.
AI insight: An aggregator, not a model-maker — one subscription fronts 15+ engines (Sora, Veo, Kling) behind cinematic camera presets.
Pika Labs
Playful AI video with Pikaffects, ingredients, and quick edits.
Pika Labs' video generator — text- and image-to-video with signature effects (Pikaffects), character ingredients, and fast iteration. Popular for social-native, fun clips.
AI insight: Leans into playful effects (Pikaffects) over photoreal output — it competes on shareable fun, not cinematic fidelity.