Omnia
Prompt Guide
Gallery
Click any example to view details
What is Omnia?
Omnia is Hedra's image-to-video model designed for creating realistic human motion and dynamic scenes. Unlike text-to-video models, Omnia transforms a single image into fluid, natural video with precise control over movement, camera, and background elements.
Natural Human Motion
Industry-leading lip sync and body movement that maintains identity consistency throughout the video
Camera Control
Precise camera movements including orbits, push-ins, tracking shots, and dynamic angles
Background Dynamics
Add dramatic elements like explosions, weather effects, or fantastical creatures to any scene
Video Duration
Omnia generates videos up to 8 seconds in length. This duration is optimal for social content while maintaining high quality throughout.
Prompt Structure
Omnia prompts work best when they're concise and focused. A good prompt combines camera motion, subject action, and background elements to create dynamic, engaging video.
Camera Motion
Describes how the virtual camera moves through the scene. This creates cinematic depth and visual interest.
Subject Action
What the person or subject physically does. Omnia excels at natural human movements and expressions.
Background Elements
Dynamic elements that happen around the subject. Great for adding drama, atmosphere, or surreal effects.
Use Cases & Examples
Podcast & Interview
Perfect for talking head content where the subject speaks directly to camera.
Stationary tripod shot with subtle push-in on podcast host gesturing while speaking, studio lighting.
Medium shot of host laughing on leather sofa, soft room ambience, natural hand movements.
Social Media & UGC
Selfie-style content that feels authentic and relatable. Handheld camera motion adds energy.
Handheld selfie shot, woman talking excitedly to camera, walking on city street, natural movement.
POV selfie angle, subject winking and smiling, slight camera shake, urban background.
Cinematic & Film
Dramatic scenes with bold camera movements and atmospheric effects.
Cinematic slow push-in on woman in rain, neon lights reflecting, dramatic atmosphere.
Orbit around warrior, burning ruins in background, smoke filling the scene dramatically.
Product & Commercial
Showcase products with authentic presenter style and natural movements.
Close-up of hands holding product, gentle handheld movement, soft lighting, product in focus.
Handheld selfie shot of influencer holding skincare bottle, talking enthusiastically, bathroom setting.
Creative & Experimental
Push creative boundaries with dramatic motion and surreal elements.
Woman walking confidently across zebra crossing, massive explosion erupts behind her.
Dramatic zoom into subject's face, dragon flies overhead in the background.
Audio & Voice
Audio is essential for bringing Omnia videos to life. The model synchronizes lip movements with your audio, creating natural-looking speech.
Voice Options
AI Voices: Access a library of high-quality AI voices with different accents, ages, and styles. Choose voices that match your subject's appearance for the most natural results.
Upload Your Own: For maximum authenticity, upload your own voice recordings. This is ideal for branded content, personal projects, or when you need a specific voice that matches the subject exactly.
Duration
Maximum length: Audio must be 8 seconds or less. This is the maximum supported duration for Omnia videos.
Pacing: Natural speech patterns work best. Avoid rushing through dialogue; let the character breathe between sentences.
Voice Matching
Match the character: Select a voice that aligns with the subject's apparent age, gender, and personality.
Consider the context: Podcast hosts sound energetic. Narrators are calm and authoritative. Product reviewers are enthusiastic.
Regional accents: If the image suggests a specific region or culture, consider matching the voice accent for authenticity.
Best Practices
Do
- Keep prompts concise—under 25 words works best
- Use descriptive verbs: "walking," "gesturing," "adjusting"
- Match camera style to content type (handheld for UGC, tripod for podcasts)
- Include environmental context that supports the scene
- Consider the subject's starting position in the source image
Don't
- Use vague descriptions like "cool video" or "awesome scene"
- Request actions that contradict the source image's pose
- Overcomplicate background elements—one dramatic element is enough
- Exceed 8 seconds of audio
- Rush dialogue—natural pacing works better
Quick Reference Templates
Stationary [shot type] with subtle [camera motion] on [subject description], [setting details].
Example: Stationary medium shot with subtle push-in on podcast host gesturing, studio lighting with plants in background.
Handheld selfie shot, [subject] [action] [emotion], [location], natural movement.
Example: Handheld selfie shot, woman talking excitedly holding coffee, rainy city street, natural movement.
[Camera motion] on [subject], [atmospheric element], [mood descriptor].
Example: Slow orbit around detective, rain falling heavily, noir atmosphere with neon reflections.
[Subject] [simple pose/action], [dramatic background element] behind.
Example: Stylish woman crosses street confidently, massive explosion erupts behind her.