A few weeks ago someone posted a video walking through a restaurant where the menu photos, the dishes, the whole vibe were obviously AI-generated. The caption: “everything in this restaurant is AI generated and it’s freaking me tf out.” It got 6,279 likes, and the replies were brutal — “slop ads, slop food,” “looks cheap and lazy,” people questioning whether the kitchen was even clean. One chef summed up the fear perfectly: “I cannot trust your cooking. I need to see what it looks like, not what a data center thinks it looks like.”
So let’s be clear about what we’re doing here, because it’s the opposite of that. AI video got cheap enough that a café owner can make a real promo clip in twenty minutes for the price of a coffee. That’s a genuine gift. But done lazily — faking dishes you don’t serve, gluing on garbled text, over-processing a real photo until the latte looks radioactive — it doesn’t just flop. It makes customers trust you less. The good news is that the honest way to use AI video is also the way that actually works. And it starts with a photo you already have.
What changed (and what to ignore)
Two years ago a decent promo video meant a videographer, a half-day shoot, and a four-figure invoice. Now Google’s Veo 3.1 — built into Gemini and Google’s Flow tool — can turn a single still photo into a short, moving clip with natural light and motion. One bean of coffee can “become” a whole café scene. A flat photo of your croissant can get rising steam and a slow camera push.
Quick housekeeping so you don’t waste time: Sora is gone. OpenAI shut its consumer Sora app down on April 26, 2026. If a tutorial tells you to use Sora, it’s out of date. The live tools that matter for a small business are Google’s Veo and Gemini Omni, with the cheapest real entry point being a Google AI subscription (more on the $4.99 Google AI Plus plan and whether it’s worth it below).
The honest 20-minute workflow
The whole trick is this: you animate a photo of your actual food. Not a generated dish. Not a stock bowl. The real thing you’ll hand the customer. That single rule keeps your ad honest, keeps you on the right side of the FTC’s truth-in-advertising rules, and — not a coincidence — produces the clip that converts.
Step by step:
- Shoot your real dish. Phone camera, 1080p, plain surface, near a window. This photo is both your raw material and your honesty insurance.
- Open Gemini or Google Flow, start a video generation, and attach the photo as the reference image. This is “image-to-video” — the dish stays anchored to reality instead of being invented.
- Write two short lines: one action, one camera move. For a coffee shot, that’s literally: “Rising steam from a fresh latte, wisps curling slowly in warm morning light. Slow push-in, shallow depth of field, no text, no hands.” Keep the subject’s motion and the camera’s motion as separate, simple instructions.
- Generate eight seconds on the “Fast” setting first. It’s a cheap draft. Eight seconds is the hard ceiling for one coherent clip anyway, so don’t fight it. Check the motion looks natural before you spend on a higher-quality render.
- Trim it, then add your text in a normal editor like CapCut or the Instagram/Reels editor. This is the step everyone skips and shouldn’t: never let the AI render your price, café name, or address — it garbles letters into nonsense. Type the text yourself, on top.
- Post it vertical (9:16), 1080p, 7–15 seconds. Drop your hashtags in the first comment.
The motions that read as natural for food are the semi-random, hard-to-fake ones: rising steam, a slow push-in, a liquid pour, a sauce or honey drizzle, a golden-hour light shift. Those hide the small imperfections AI still has. Lean on them.
What this means for you
If you run a café or coffee shop: Steam is your best friend. A latte, a fresh espresso pour, a cinnamon roll with curling steam — these animate beautifully and look real because steam is genuinely random. Start there.
If you own a restaurant: Animate the hero dish you’re known for. A slow push-in on a real plate of pasta with a faint rising warmth beats any generated scene. Post one a week, rotate the dish, and let customers see exactly what they’ll get.
If you’re a local shop or boutique: Same playbook, different subject — a product on a clean surface with a slow rotate or a soft light sweep. The “one good photo becomes a scroll-stopping video” idea works for a candle or a bag just as well as a croissant.
What AI video still can’t do
- It can’t write text. Prices, your café name, “OPEN LATE” — AI renders letters as garbled nonsense. Add every word yourself, in the editor, after.
- It can’t do hands or faces well. Fingers merge, faces go uncanny. Keep people out of the AI clip — if you want a human, film that part for real and cut it in.
- It can’t keep your logo consistent. Your cup design or sign will morph frame to frame. Add the logo as a static overlay in editing, not in the generation.
- It can’t run longer than about 8 seconds per clip cleanly. Want 20 seconds? Stitch a few clips in your editor. Don’t ask one generation for a mini-movie.
- It can’t make fake food honest. This is the big one. An AI-glossed bowl that looks nothing like the real plate isn’t clever marketing — under FTC rules it’s deceptive advertising, and customers punish it harder than no ad at all. Animate your real photo. Don’t generate a dish you don’t serve. A tiny “#AIGenerated” caption costs you nothing and buys trust.
The bottom line
The café that got roasted online didn’t fail because it used AI. It failed because it used AI to lie about the food. Do the opposite. Take a real photo of the thing you’re proud of, let AI add eight seconds of steam and a gentle push-in, type your own words on top, and post it. Twenty minutes, a few cents, and a clip that looks like your place — because it is.
Want the full setup — the exact prompts, the camera-motion cheat sheet, and how to batch a month of clips in an afternoon? Our AI Video Creation course walks you through it for a small business, start to finish. And if you’re deciding which AI plan to actually pay for to get the video tools, here’s the honest breakdown of Google AI Plus at $4.99.
Sources
- Veo 3.1 — Google DeepMind
- The ultimate prompting guide for Veo 3.1 — Google Cloud
- What to know about the Sora discontinuation — OpenAI Help Center
- OpenAI is shutting down Sora — The New York Times
- Truth in Advertising — Federal Trade Commission
- The State of AI Video Generation 2026: what works and what doesn’t — IS4.ai