// AI TOOLS · GEMINI OMNI FLASH

Gemini Omni Flash

Google's conversational video model — generate a clip, then refine it by chatting instead of re-prompting.

Last verified · 2026-07-02 · by Moe Ameen

What Gemini Omni Flash is

Gemini Omni Flash is a video generation and editing model from Google, launched in public preview via the Gemini API on June 30, 2026 (model ID gemini-omni-flash-preview). It is the fast, cost-efficient tier of the new Gemini Omni family and shipped the same day as Nano Banana 2 Lite, Google's fast image model. Where a plain text-to-video tool takes one prompt and hands you a finished clip, Omni Flash is built around a conversation: you generate a clip, then keep talking to it — "make it night," "move the camera left," "swap the jacket to red" — and each turn builds on the last result.

The model is multimodal on the way in. It accepts text, images, and video as references and produces video as output, drawing on Gemini's broader reasoning to keep the physics and continuity of a scene plausible rather than just stitching frames that look right. The headline capability is stateful editing: Google describes it as remembering the video context across turns and applying your change while preserving the elements you did not mention, so you refine a shot by describing edits instead of re-rolling the whole generation.

At launch there are real limits worth planning around. Clips are capped at 10 seconds, with longer durations described as coming soon. Output is 16:9 (the default) or 9:16 vertical. Every clip carries Google's invisible SynthID watermark for AI provenance. A handful of features are unsupported in the preview — audio references, multi-video referencing, and system-instruction / temperature controls — and Google notes character consistency can wobble when you change scenes. Video editing of uploaded content is also restricted in the EEA, Switzerland, and the UK. Treat any specific limit as a preview-era snapshot; this tier is shipping fast.

Pricing is usage-based at $0.10 per second of video output — the same rate Google charges for Veo 3.1 Fast — so a full 10-second clip runs about a dollar in raw generation cost. Omni Flash is reachable through the Gemini API, Google AI Studio, the Gemini app, and Google Flow.

What you can make with it

Short 10-second video clips from a text prompt, a reference image, or a reference video
Iterative edits to a generated clip through conversation — relight, recolor, restyle, adjust camera — without re-rolling the whole thing
Image-to-video: bring a still (a product shot, a poster, an AI image) into motion
9:16 vertical clips sized for Reels, TikTok, and Shorts, or 16:9 for YouTube and landscape
Scene variations of the same concept by branching the conversation into different directions
Hook and B-roll snippets for ads, explainers, and social openers

How Kompozy turns Gemini Omni Flash output into content

Omni Flash's chat-to-edit loop is genuinely the fastest way to nail one 10-second shot. But a 10-second clip is not a post, and it is definitely not a content week. Kompozy is the layer that turns that clip into finished, scheduled content and then multiplies it. Drop an Omni Flash export into Kompozy and it burns in branded, on-style captions, reframes the clip cleanly for each destination's aspect ratio, and lets you stack hook text or lower-thirds through HyperFrames so the silent-autoplay first second actually reads. Then Kompozy schedules and fans that clip to TikTok, Reels, YouTube Shorts, X, LinkedIn, and the rest of its nine connected platforms from one queue — instead of you exporting and re-uploading into six apps by hand.

The bigger unlock is fan-out. A single Omni Flash clip can seed a whole content unit inside Kompozy: the video for short-form feeds, plus a quote graphic, a set of native text posts, and a thread written in your own voice through your Persona Brief — one 10-second render becomes a week of cross-platform posts. And where Omni Flash caps out (10 seconds, no talking head, no long-form), Kompozy generates the formats it can't: Persona Shorts and HeyGen avatar video, Clipped Shorts from long-form, carousels, blogs, and newsletters. Omni Flash owns the fast, conversational shot; Kompozy owns the captions, the format fan-out, the brand voice, the schedule, and the publish.

Generate and refine a clip in Gemini Omni Flash — prompt it, then chat your edits until the 10-second shot is right.
Export the MP4 and bring it into Kompozy.
Let Kompozy add branded captions, reframe it per platform, and layer hook text or overlays via HyperFrames.
Fan the clip out into supporting posts — a quote card, native text posts, and a thread in your voice via your Persona Brief.
Schedule and publish the whole set across TikTok, Reels, Shorts, X, LinkedIn, and more from one queue.

Frequently asked questions

What is Gemini Omni Flash?

Gemini Omni Flash is a video generation and editing model from Google, launched in public preview via the Gemini API on June 30, 2026. Its defining feature is conversational editing — you generate a clip, then keep chatting to refine it, and each turn builds on the previous result while preserving what you did not change.

How long can Gemini Omni Flash videos be?

Clips are capped at 10 seconds in the launch preview, with longer durations described by Google as coming soon. Output is available in 16:9 (default) or 9:16 vertical.

How much does Gemini Omni Flash cost?

It is priced at $0.10 per second of video output — the same rate as Veo 3.1 Fast — so a full 10-second clip costs roughly a dollar in raw generation. It is available via the Gemini API, Google AI Studio, the Gemini app, and Google Flow.

What can you make with Gemini Omni Flash?

Short 10-second clips from text, an image, or a reference video; iterative edits to a generated clip via conversation; image-to-video motion; and 9:16 or 16:9 output for social and landscape. Preview limits include no audio references, no multi-video referencing, and occasional character-consistency drift across scene changes.

How do I turn a Gemini Omni Flash clip into posts across platforms?

Omni Flash generates the clip but does not publish it. Bring the export into Kompozy to add branded captions, reframe it per platform, and schedule and publish it across TikTok, Reels, YouTube Shorts, X, LinkedIn, and more — then fan the same clip out into a quote card, text posts, and a thread in your voice.

Related tools

Nano Banana 2 Lite — Google's fastest, cheapest Nano Banana image model — a 4-second generator built for high-volume creation.
Runway — The AI video platform behind the Lionsgate partnership — cinematic text-, image-, and video-to-video generation with consistent characters and scenes.
Higgsfield — AI video and image platform known for cinematic camera-motion control.
ByteDance Seedance 2.5 — AI video model that generates a 30-second clip in one pass — no stitching.
HeyGen — AI avatar video platform that turns a text script into a talking-head video — in 175+ languages.

← All AI tools · Get started →