15-second energetic Indian street food cinematic montage, hyper-realistic documentary style, ultra-wide-angle handheld shots, vibrant roadside atmosphere, chaotic urban energy, lively Indian street market aesthetic, constant movement, fast-paced editing, authentic cooking ASMR mixed with upbeat Indian music.
Main subject:
Hardworking Indian street food vendor cooking at a busy roadside stall beside heavy city traffic. Sweaty face, fast hands, realistic street kitchen, giant wok of boiling oil, colorful sauces and spices everywhere.
Music:
Energetic Indian street-style music starts immediately from the first second — upbeat dhol drums, tabla rhythm, festive brass, fast Bollywood-style percussion, nonstop lively rhythm throughout the entire video.
[00:00-00:02]
HOOK SHOT. Ultra-wide-angle selfie-style close-up of the vendor at the frying station while oil violently sizzles beside him. Camera distortion from wide lens creates immersive realism. Loud Indian music immediately starts.
Traffic rushing behind him.
People shouting food orders.
[00:02-00:04]
Fast handheld shots:
— batter dropped into boiling oil
— flames bursting under giant wok
— vendor rapidly flipping snacks
— close-up of bubbling oil exploding
ASMR cooking sounds layered with music.
[00:04-00:06]
Wide-angle tracking shot of customers crowding the stall, exchanging cash quickly, grabbing hot food plates, constant movement everywhere.
People talking in Hindi.
Motorcycles passing loudly.
[00:06-00:08]
Rapid montage:
— green chutney poured dramatically
— chopped onions flying onto hot plate
— spice powder thrown in slow arc
— tea steaming
— customers eating immediately beside the road
Music intensifies.
[00:08-00:10]
Dynamic low-angle shot from inside the stall looking outward toward the busy street. Vendor works nonstop at incredible speed while multiple customers wait impatiently.
Sweat shines realistically under afternoon sunlight.
[00:10-00:12]
Ultra-fast cuts synchronized perfectly to music beats:
— frying
— plating
— customers smiling
— money exchanged
— wok tossing food
— steam hitting the camera lens
— close-up of crispy street snacks
[00:12-00:15]
Final cinematic wide-angle shot of the entire roadside stall during peak rush hour. Vendor still cooking rapidly while traffic, smoke, people, and music create overwhelming vibrant energy.
Camera slowly pulls back into the chaotic Indian street atmosphere.
Style:
Netflix food documentary mixed with Bollywood street energy, hyper-realistic cinematic footage, handheld realism, immersive street photography, authentic Indian urban atmosphere, vibrant warm colors, dynamic editing.
Audio:
lively Indian music nonstop, dhol drums, tabla, frying ASMR, street chatter in Hindi, honking, motorcycles, metal utensils, customers ordering food loudly.
Negative prompts:
luxury restaurant, quiet environment, staged cooking, modern clean kitchen, low energy, slow pacing, blurry footage, cartoon visuals.
Create a cinematic text-to-video scene featuring an original non-copyrighted moment where a census worker in a rural region reaches an extremely remote homestead at dusk and discovers an elderly woman who has been living entirely alone for twenty-three years following the death of her husband, who has not been counted in any census since 1999, and who, when asked the census questions, answers with such specific and vivid detail about her daily life that the census worker — whose job is to record data — finds herself simply sitting at the kitchen table listening long past the official requirements of the visit. The mood is richly human, about the lives that exist outside visibility, with a documentary warmth feeling.
The drive is forty minutes on an unpaved road. The homestead is real and functioning — the woman has a vegetable garden, chickens, a woodpile that suggests serious competence. She opens the door without surprise, as if she expected someone eventually. She makes tea without being asked — the kettle was already on. She answers every census question precisely and without elaboration. Then the census worker finishes her form and does not immediately stand to leave.
The woman talks about her winter routine, her garden calendar, a fox that has been visiting the chicken coop for three years that she has not killed because she has come to respect its persistence. She talks about the radio programs she prefers and why. She talks about her husband — not with grief, with the specific warmth of someone who keeps someone present through the act of mentioning them. The census worker misses her last two appointments of the day. She does not realize until she is back in her car in the dark.
Visual tone: hyper-realistic observational drama quality, remote homestead at dusk — the specific quality of rural evening light, the self-sufficient homestead as a world complete in itself, kitchen interior warm against the outside dark arriving, premium domestic detail — the kettle, the cups, the table worn smooth, the census form as an official document in an unofficial conversation, the census worker's face as the scene's emotional register. Camera language: unpaved road approach, homestead arrival, the woman opening the door — no surprise, tea being made, census questions sequence — form and her face answering, form completion and not leaving, the conversation beginning, specific details — the fox story, the radio programs, the husband mentioned and present, the census worker's face moving through professional to something else, window showing dark arriving outside, census worker in her car in the dark — the appointments missed, the form filled, something else also having happened. Include: homestead ambient — the specific quiet of remote rural evening, chickens, the kettle, the kitchen's warmth, the census worker's pen, tea cups, and the sound of a woman talking about her life to someone who came to count her and stayed to hear her.
Use the uploaded storyboard as the exact visual reference. Maintain the same young woman throughout the entire video, preserving identical facial features, hairstyle, skin tone, body proportions, age, and overall appearance. No character changes. Ultra-realistic vlog-style football fan journey, natural camera motion, realistic crowd behavior, cinematic sports documentary quality, 9:16 vertical format.
Scene 1 – Excitement at Home (4–5s)
Modern apartment in daylight. The young woman wears a beige oversized hoodie, black leggings, and white sneakers. She sits on her bed checking football tickets on her phone and becomes visibly excited. Medium shot, phone close-up, facial close-up. She grabs her backpack and prepares to leave.
Scene 2 – Finding the Jersey (3–4s)
Standing in front of an open wardrobe. A Portugal football jersey with Ronaldo number 7 and matching shorts are visible. Over-the-shoulder shot. She reaches for the jersey and smiles.
Scene 3 – Transformation Sequence (4–5s)
Fast cinematic cuts. Close-ups of unfolding the Portugal jersey, putting on the shorts, adjusting her hair, and looking into a mirror. Seamless transformation from casual clothes into the full Portugal supporter outfit.
Scene 4 – Mirror Confidence Shot (3s)
Full-body mirror reflection. She is now wearing the Portugal Ronaldo jersey and shorts. She smiles confidently, gives a thumbs-up, picks up her backpack, and leaves.
Scene 5 – Leaving Home (3s)
Tracking shot from behind as she exits her apartment building and walks toward the match.
Scene 6 – Travel Montage (5–6s)
City streets filled with football supporters. Mix of selfie-vlog clips, walking shots, and wide city views. She records herself while heading toward the stadium, smiling and interacting with the atmosphere.
Scene 7 – First Stadium Reveal (4s)
Wide cinematic shot from behind. The massive football stadium appears ahead. She pauses briefly and admires the venue.
Scene 8 – Outside the Venue (5s)
Selfie vlog shot outside the stadium entrance surrounded by fans and Portugal flags. Looking directly into the camera, she says: "I've reached the venue!" Natural lip-sync. Excited expression. Energetic pre-match atmosphere.
Scene 9 – Walking Toward Entrance (3s)
Tracking shot as she walks through the crowd toward the stadium gates.
Scene 10 – Inside Concourse (4s)
POV and handheld shots inside the stadium concourse. Large screens, fans, and match-day activity. She looks around excitedly while filming.
Scene 11 – Entering the Stands (4s)
Over-the-shoulder reveal. She steps into the seating area and sees the football pitch for the first time. Dramatic stadium reveal.
Scene 12 – Emotional Reaction (3s)
Close-up of her face. Genuine amazement, excitement, and joy as she takes in the atmosphere.
Scene 13 – Fan Atmosphere (5s)
Wide crowd shot. Portugal supporters wave flags and sing. She joins the chants and celebrations.
Scene 14 – Selfie From the Stands (4s)
Handheld selfie with the pitch visible behind her. She records herself enjoying the atmosphere.
Scene 15 – Celebration Moment (5s)
Crowd erupts in celebration. Slow-motion mixed with handheld footage. She jumps, cheers, and celebrates with other supporters.
Scene 16 – Final Hero Shot (5s)
Night stadium under bright floodlights. Close-up selfie with packed stands behind her. She smiles, waves a Portugal flag, and celebrates as the crowd roars around her. Epic sports-commercial ending.
Style: Ultra-realistic, cinematic vlog, sports documentary, football fan experience, natural facial expressions, realistic crowds, handheld smartphone footage, authentic stadium atmosphere, smooth camera movement, high detail, 4K quality, consistent character identity throughout.
Create a cinematic text-to-video scene featuring an original non-copyrighted moment where a seed bank curator in an Arctic facility receives a shipment of seeds from a botanical garden in a conflict zone the last living samples of seventeen plant species, evacuated by a botanist who stayed behind to do it and processes them into long-term storage with the specific care and professionalism of someone who understands that what she is putting into the vault today may be the reason something beautiful exists in a hundred years. The mood is quietly heroic by proxy, scientifically tender, about the long-chain human effort of preservation, with a documentary realism feeling. The shipment arrives with documentation in two languages and a handwritten note from the botanist she has corresponded with for eight years but never met, whose name she knows from dozens of professional emails, who is still in the city that was being shelled when he sent these. The seeds are properly packaged he did it right, under whatever conditions he did it under. She checks each sample against the documentation. Seventeen species. All present.She processes them according to protocol moisture testing, viability assessment, labeling, the specific cold chain steps that are the difference between a seed that can germinate in a century and one that cannot. She works slowly, carefully, with the attention of someone who knows what these specific seeds cost to get here. When the last sample is sealed and logged, she sits for a moment. Then takes his note handwritten, brief, professional and puts it in the physical archive file with the seed documentation. So that whoever opens this vault in fifty years knows his name.
Visual tone: hyper-realistic scientific facility quality, Arctic seed bank interior cold, functional, the specific visual language of long-term preservation premium seed sample and documentation texture, her hands in processing gloves as primary subjects, the handwritten note as the human artifact inside the scientific process, the vault as the scene's implied destination and meaning, the cold as a constant visual and physical presence.
Camera language: facility exterior Arctic, remote, the vault in the landscape, receiving the shipment, documentation check, sample verification sequence seventeen species confirmed, her face on each one, processing work moisture testing, labeling, cold chain steps in real procedural time, the last sample sealed and logged, sitting for a moment, the handwritten note, reading it close-up, opening the physical archive file, putting the note in his name now in the vault record, closing the file, the vault door, the facility interior with the work complete, Arctic exterior the facility in the landscape, holding what she just put in it.
Include: facility ambient cold air ventilation, the specific quiet of a temperature-controlled preservation environment, processing equipment, her movements in cold conditions, the note unfolding, the archive file, the vault closing, and the Arctic wind outside a building that is keeping something alive for people who are not born yet.
15-second cinematic Turkish wedding celebration, inspired by the attached reference image.
Style:
Ultra-realistic wedding documentary, handheld wedding cinematography, vibrant Turkish village wedding atmosphere, high energy, joyful chaos, authentic celebration, warm evening lighting, festive string lights overhead.
Audio:
Traditional Turkish Halay music from the very first frame.
Fast tempo davul and zurna.
Crowd clapping in rhythm.
Cheering, laughter, joyful shouts.
No romantic music.
No slow moments.
Constant energy throughout.
0:00 - 0:02
Hook shot.
Close-up of synchronized feet performing fast Halay steps on the stone courtyard.
Dust kicks up from the ground.
Powerful davul beats hit immediately.
0:02 - 0:04
Fast cut.
Bride in sparkling white wedding dress with red ribbon belt dances energetically in the Halay line.
Guests smile and shout encouragement.
Camera circles around the group.
0:04 - 0:06
Wide shot.
Long Halay chain moving dynamically across the dance floor.
Men and women holding hands, stepping in perfect rhythm.
String lights glow overhead.
0:06 - 0:08
Rapid close-ups.
Hands linked together.
Feet stomping.
Bride laughing.
Guests clapping loudly to the beat.
Quick energetic cuts.
0:08 - 0:10
Low-angle tracking shot moving alongside the dancers.
Fast synchronized Halay footwork.
Wedding guests cheering from tables in the background.
0:10 - 0:12
Camera dives into the crowd.
Children dancing.
Older relatives clapping and singing.
Everyone moving with the music.
0:12 - 0:14
Fast rotating shot around the bride and groom as they lead the Halay.
Dress swirling.
Crowd erupts with excitement.
Continuous music and celebration.
0:14 - 0:15
Final explosive wide shot.
Entire wedding party dancing together.
Hands raised.
Massive energy.
Music reaches peak intensity as the frame ends.
Visual keywords:
Turkish wedding,
Halay dance,
davul zurna,
celebration,
festival atmosphere,
authentic wedding documentary,
multishot,
close-ups,
wide shots,
handheld camera,
energetic crowd,
warm night lighting,
string lights,
joyful chaos,
constant movement,
cinematic realism.
Negative prompts:
slow motion,
romantic scenes,
kissing,
emotional crying,
empty dance floor,
still camera,
silence,
sad mood,
formal posing,
staged photography,
text,
captions,
watermarks.
Time freezes mid car chase crash on a rain soaked neon city highway at night, shattered glass and water droplets suspended in air, camera orbits around a stunned driver looking exactly like Elon Musk still gripping the wheel in mid impact, high contrast, photorealistic, ultra hyper detail.
Presented in a style that resembles real video footage captured with an unprocessed iPhone handheld camera. All camera settings are automated, with no post-processing coloring or special effects applied. The footage features genuine handheld vibrations and the slight movement due to the operator's breathing. Autofocus occasionally experiences delays and brief loss of focus. Automatic white balance switches rapidly between strong lighting from flashlights and ambient classroom light. The image is overall flat, preserving the real-life lens flare, edge blurriness, slight overexposure, and motion blur. Only natural environmental sounds from within the scene are used, including sounds of chalk writing, page turning, student whispers, and recording notifications from the phone. The footage is captured from a first-person handheld perspective from behind the classroom. The camera movement is natural, with slight shaking occurring occasionally due to nervousness. A tall, short-haired Asian female teacher is depicted in the footage.
Clothing
She stood in front of the classroom blackboard. She had a well-proportioned figure, with a delicate makeup, and wore a pair of black-rimmed glasses. A high-intensity lighting source was placed on the floor in front of the classroom, shining brightly upwards. At 0 seconds, the camera was positioned behind the classroom, focusing on the female teacher as she stood sideways in front of the blackboard, writing with chalk. The high-intensity light from below illuminated her body, creating a strong backlight effect and prominent highlights. At 1 second, the light passed through the semi-transparent fabric of her black net dress, clearly highlighting the contours of her thighs, hips, and body. The autofocus briefly lost focus due to the intense light, but then re-established itself. At 2 seconds, the female teacher continued to write on the blackboard, and the sound of chalk against the blackboard was clearly audible. Fine chalk dust floated in the air of the classroom. At 3 seconds, the student on the left side of the front row already noticed the bright light around them. Excitedly, she took out her phone and started recording the teacher secretly. The reflection on the screen made a brief flash in the video. At 4 seconds, the female teacher finished writing, put down her chalk, and turned to face the students. At 5 seconds, she picked up the dark red textbook on the table, held it with both hands, and stood up straight. At 6 seconds, the female teacher began teaching, holding the textbook in her hands, with her gaze directed towards the back of the classroom. The bright light from below made her silhouette even more distinct. At 7 seconds, the students in the front row already started talking in low voices. The student on the left continued to record secretly, while the student on the right tilted her head to look at the screen. At 8 seconds, the female teacher suddenly noticed that the back of the classroom was unusually quiet. Her movements paused briefly, and she slowly closed the textbook, with a slight frown on her face. At 9 seconds, she raised her head, her gaze piercing straight through the black frame of her glasses, towards the back of the classroom. Her expression changed from concentration to obvious confusion and seriousness. At 10 seconds, the female teacher looked directly at the camera, speaking in a low but firm voice: "Wait, what's going on?" Her voice sounded particularly clear in the classroom. At 11 seconds, it could be seen that several students were lowering their heads to avoid her gaze. One student quickly put away his phone. At 12 seconds, the female teacher remained standing in the middle of the classroom. The bright light from below elongated her silhouette. She didn't press further, but simply stared silently at the back of the classroom, her eyes filled with obvious questioning and dissatisfaction. At 13 seconds, she tilted her head slightly, and the bright light highlighted the outline of her black net-style dress. A tense silence filled the classroom. At 14 seconds, the female teacher held the book against her chest. Her voice was calm, but there was a sense of pressure in her tone: "Let's continue with the lesson." At 15 seconds, she turned back towards the blackboard. The scene ended in a tense and awkward atmosphere. The footage was captured using an unprocessed iPhone, with a natural, unrefined quality, without any post-processing adjustments or special effects. All of the camera movements were consistent with the physical characteristics of real-life classroom snaps. Changes in lighting and focus adjustments were all handled automatically.
Presented in the style of authentic footage shot by an unedited iPhone handheld, all camera settings are automatic, with no post-processing color grading or special effects. The image features realistic handheld slight shake and the operator's breathing sensation; autofocus occasionally shows search, lag, and brief out-of-focus effects, and auto white balance switches naturally under cloudy natural light. The image is overall flat, preserving authentic lens halos, edge chromatic aberration, slight overexposure, and motion blur. Only natural sound effects within the painting are used, including wind noise, rustling leaves, rubbing clothes, and distant birdsong. Shooting from a first-person handheld perspective, the camera moves naturally with occasional slight shake due to movement or angle adjustments. A well-proportioned young Asian woman
, dressed in a deep blue, loose Taoist robe
Image_20260527080407_316_19
(Traditional Taoist robe style, with overlapping collars), layered with purple tight pants underneath, standing barefoot on the outdoor bluestone floor. Her hair was tied up in a bun and secured with a wooden hairpin. Behind her was a mottled gray-white wall with a large blue character "Mo" on the wall, and in the distance green trees. At midnight, the camera focused on her from the middle shot. She stood in the center of the frame, feet naturally apart, hands hanging at her sides, eyes calmly looking ahead. At 1 second, she slowly rises, raising both arms forward to her chest, palms facing down, her body weight slightly lowered, her movements steady and continuous. At 2 seconds, she performs the "Hold the Sparrow's Tail" movement, pushing forward with her right hand and drawing an arc backward with her left hand, slowly turning her body with the hem of her robe gently swaying with the movement. At 3 seconds, she continued with the "cloud hand," alternating circles in front of her with both hands and slowly swaying her body from side to side. At 4 seconds, she lowered her center of gravity and gradually transitioned to a wide horse stance, squatting with both knees outward, keeping her body steady. At 5 seconds, she fully transitioned into a wide horse stance squat, clenching her fists in front of her chest, focusing her gaze, and keeping her body low in a center of gravity posture. At 6 seconds, she held the squat position in a wide horse stance for a moment, clenched fists in front of her chest, breathed steadily, and her robe naturally fell down. At 7 seconds, she began to slowly rise from a wide horse stance, raising both hands upward with a slow and powerful movement. At 8 seconds, she stood up and squatted halfway, pushing both hands forward to complete a change of pose. At 9 seconds, she continued with the "single whip" movement, turning her body to the left, lifting her right hand upward, and pressing down with her left. At 10 seconds, she performs the "Cloud Hand" movement, drawing circles in front of her chest with both hands, slowly swaying her body side to side, with purple pant legs and Taoist robe swaying with the movement. At 11 seconds, she lowered her center of gravity and transitioned back into a wide horse stance, clenching her fists in front of her chest and keeping her center of gravity low. At 12 seconds, she maintained stability in a wide horse stance squat, her gaze calm and her posture steady. At 13 seconds, she began to slowly stand up, withdrawing her hands in front of her. At 14 seconds, she fully stood up, her hands naturally lowered, returning to an upright standing posture, and calmly looking ahead. At 15 seconds, she remains standing, the camera slightly shakes, and the frame ends in natural light. The visuals present the authentic, unedited handheld video feel of an iPhone, with documentary-level natural imperfections and no post-production color grading or special effects. All camera actions conform to the physical characteristics of real outdoor handheld shooting, with slow, steady, and smooth movements, matching the authentic movement characteristics of Tai Chi.
tai-chi
handheld
iphone
documentary
realistic
martial arts
cinematic continuous shot / motivated camera movement / 15s
SCENE
A bustling Elizabethan-era market street in London, circa 1500s, at dusk. Timber-framed buildings lean over a narrow muddy street, upper floors nearly touching. Wooden market stalls overflow with bread, fish, herbs, and cloth. Butchers' hooks hang with raw meat, barrels of ale line the الطريق, and livestock—geese, chickens, and a stray pig—wander freely.
Vendors shout in the 거리, accents rough and lively. Smoke rises from chimneys, mixing with damp fog rolling in from the nearby River Thames. The ground is wet, reflecting torchlight and lantern glow in uneven puddles.
CAMERA CONCEPT
A continuous, motivated camera move where each subject's motion naturally redirects attention—flowing like a living observer through the chaos of the market.
SEQUENCE
0:00–0:03
Close street-level view of a wooden stall stacked with apples and root vegetables.
CAMERA FOCUS: a cloaked woman argues over price with a merchant. She bites into an apple to test it, then reluctantly drops worn coins into his hand.
0:03–0:05
A horse-drawn cart loaded with sacks of grain suddenly splashes through muddy water across the foreground, briefly obscuring the scene.
CAMERA SHIFT: the camera picks up the cart and begins tracking alongside it.
0:05–0:07
The cart grinds through the tight street, brushing against a hanging cloth banner bearing a faded guild emblem. The fabric whips across the lens.
CAMERA SHIFT: as the cloth clears, a group of geese scatter loudly across the الطريق.
0:07–0:09
A ragged street boy darts into frame, chasing the geese with a stick, weaving through townsfolk and baskets.
CAMERA SHIFT: the camera follows him, dodging passersby.
0:09–0:12
The boy rushes past the entrance of a dim tavern. He disappears into the crowd.
CAMERA SHIFT: the tavern door creaks open as a drunk patron stumbles out.
0:12–0:15
The camera glides past him and into the tavern interior. Thick smoke hangs in the air, lit by flickering oil lanterns. Rough wooden tables, spilled ale, murmuring voices.
CAMERA FINAL FOCUS: a silent armored figure sits in the corner—less polished knight, more worn mercenary—mud-stained boots, dented armor, a longsword resting against the bench. He slowly raises his eyes toward the camera.
STYLE
Authentic Elizabethan street life, dense crowd choreography, grounded realism, historically textured details.
LIGHTING
Warm torchlight reflecting in wet mud outside, cool fog diffusion, dim golden lantern glow inside, smoke-heavy atmosphere.
QUALITY
photorealistic, cinematic lighting, historically grounded detail, immersive camera flow, rich 1500s London atmosphere
Cinematic short film. Subject: A 29-year-old extremely handsome athletic Japanese professional street chef. Sharp defined features, intense focused eyes, messy black hair, lean muscular build, small burn scars on forearms. Wearing open black chef jacket, fitted dark apron, rolled sleeves, professional kitchen boots.
SHOT 1 — Extremely low ground-level tracking shot. He is already moving at full speed across the open fire kitchen station. After a violent explosive pan flip sending flames three feet into the air, he barely catches the landing before immediately charging to the next station.
SHOT 2 — Whip pan transition into an absurdly fast knife sequence. Continuous blade movement blurs across the frame while the camera struggles to keep up with his overwhelming speed and precision.
SHOT 3 — Wide moving shot across the entire kitchen line. Multiple rapid dish preparations happen back-to-back with almost no setup time. Cameraman nearly loses balance moving backwards to follow the run.
SHOT 4 — Compressed long-lens shot capturing a massive open fire explosion in slow motion. Extreme body control through the heat and flame, nearly losing his grip before miraculously recovering at the final second.
SHOT 5 — Ultra-low circular tracking shot around an extended plating sequence. Constant high-speed micro adjustments push the presentation to the edge of perfection, yet he calmly wipes the plate edge mid-motion without slowing down.
SHOT 6 — Final shot. He slides the last plate forward at full speed and freezes completely still. Steam and flame trails drift through the air behind him. He slowly straightens up and locks into a completely still final pose under harsh kitchen light. The sound cuts instantly. Camera freezes on his completely calm and emotionless face. Fade.
Style: Ultra-realistic cinematic culinary documentary combined with premium food advertising energy. Warm fire tones, dramatic single light source, heavy contrast. Real human presence.
Use the uploaded image as the first frame. Keep the skateboarder's face, red hair, outfit, skateboard, tattoo, pink ramp, blue sky, bright sunlight, and backyard skatepark fully consistent. Create a hyper-realistic super slow motion cinematic skate video.
The skateboarder is suspended mid-air above the pink ramp while performing a stylish one-hand-supported trick. Start with a very low angle shot close to the ramp surface to emphasize height and impact. The action unfolds in elegant super slow motion. The camera moves in a smooth cinematic arc around her body, focusing on her vivid red hair flowing in the wind, the loose shirt fabric fluttering behind her, the skateboard rotating subtly above her feet, and the bright sun flare glowing behind the board.
Keep the motion graceful, premium, and visually satisfying. Add subtle dust particles, natural shadows, realistic light bloom, and believable action-sports physics. Then she regains control of the board, lowers naturally toward the ramp, lands smoothly with realistic balance, bends her knees softly on impact, and rides away across the pink ramp. End with a smooth follow-cam shot as she rolls away confidently. The final result should feel like a premium slow-motion skate film: polished, cinematic, graceful, and emotionally powerful.
I2V | 10.0s | 16:9 | 24fps | ultra-real live-action | cinematic action thriller | shot on Arri Alexa | 50mm lens | practical lighting | practical stunt combat only | no CGI
Use Image 1 as the exact identity lock, wardrobe lock, first-frame composition, and scene continuity reference.
SUBJECT / IDENTITY LOCK:
Same woman as Image 1 in every frame. Mid-20s. Pale skin. Sharp blue eyes. Short asymmetrical black bob. Strong cheekbones. Athletic feminine build. Same blue-black fitted bodysuit with choker collar, black armbands, glossy practical material. She must remain identical in face, hair, body, wardrobe, and silhouette. Real skin pores, subtle sweat, natural facial asymmetry, no beauty filter, no plastic AI skin.
SCENE:
Dark moody industrial warehouse interior at night. Smoky haze in the air. Warm practical bulbs and dim industrial backlights. Gritty, atmospheric, cinematic, realistic. Two masked assassins in black tactical clothing engage her at close ranges. Everything must feel like a real photographed action film seen on TV, not stylized art.
ACTION TIMELINE:
[0.0–1.5s]
Start on the same composition as Image 1. The woman finishes a brutal right-arm strike into the assassin on frame right. His head snaps back hard and his shoulders twist from impact. Her bob whips with the motion.
[1.5–3.0s]
The left-side assassin rushes in. She pivots sharply, blocks his arm, grabs his wrist, and drives a fast elbow into his chest. Tight, practical body mechanics.
[3.0–5.0s]
She spins low, sweeps his leg, and sends him crashing to the concrete floor. The right-side assassin recovers and lunges back into frame.
[5.0–7.0s]
She ducks the attack, slams him into a pillar or metal support, then follows with two short brutal body shots and a forearm strike. Hair movement, breath, recoil, and impact must feel real.
[7.0–8.5s]
She steps back into a guarded fighting stance as both masked men hesitate, circling. She stares them down with cold focus.
[8.5–10.0s]
She says a short line with controlled intensity: "Who sent you?" One assassin breathes heavily off to the side. The other shifts, preparing for another attack. End on a tense held stance, ready for the next beat.
CAMERA:
Subtle handheld cinematic movement throughout. Slight push-in and small reactive operator adjustments, like a real on-set action shot. Keep the framing grounded and readable. No wild camera swings, no floating impossible motion.
MOTION RULES:
Realistic anatomy, realistic stunt timing, grounded momentum, visible impact, believable recoil. Slight motion blur only on fast limbs and hair. Preserve facial clarity. No rubbery motion, no morphing, no face drift, no extra limbs.
LOOK:
Premium movie screenshot realism. Evening atmosphere. Natural low-key lighting. Practical highlights on costume. Soft haze. Natural shadow falloff. Real wardrobe texture. Gritty but polished feature-film look. Must feel like a frame sequence from a real TV-broadcast action film.
DIALOGUE:
At 8.8s, the woman says clearly and intensely: "Who sent you?"
MUSIC:
Dark pulsing cinematic action score. Low electronic pulse, tense percussion, deep braam hits, metallic impacts, rising string tension, aggressive rhythmic drive. Music should build across the 10 seconds and leave a suspenseful unresolved ending.
SOUND EFFECTS:
Heavy punches, cloth movement, boot scuffs on concrete, body impacts, sharp exhales, masked grunts, light reverberation in the warehouse, subtle metal rattles, smoky room ambience, distant electrical hum, and a final tense silence under the last line.
NEGATIVE:
No CGI look, no cartooniness, no superhero physics, no glossy fake skin, no over-choreographed dance-fighting, no slow motion, no fantasy effects, no morphing, no facial instability, no extra attackers appearing.
Create an ultra-realistic cinematic dinosaur sequence set in a prehistoric world untouched by humans. The video must feel like a fusion of Hollywood blockbuster filmmaking, BBC Earth realism, IMAX nature cinematography, and next-generation photoreal VFX.
Scene opens at sunrise inside a colossal ancient jungle valley covered in fog, towering cliffs, volcanic mountains, giant waterfalls, and dense tropical forests. Massive dinosaurs roam through the environment with cinematic scale and realism.
Opening shot:
A gigantic Tyrannosaurus Rex slowly emerges from dense mist while the ground shakes beneath its footsteps. Rain droplets fall from its scarred skin in ultra slow motion. Camera moves low through wet grass and mud with dramatic cinematic depth.
Cut to:
A herd of enormous Brachiosaurus walking beside gigantic waterfalls while flying Pterosaurs soar across the glowing orange sky. Sun rays pierce through clouds creating epic god rays and atmospheric haze.
Suddenly — the jungle becomes silent.
A terrifying roar echoes across the valley.
A colossal Spinosaurus explodes out of dark water in ultra slow motion, creating massive splashes, flying debris, shaking trees, and cinematic shockwaves. The T-Rex turns aggressively.
Final sequence:
Both apex predators charge toward each other through mud, rain, smoke, and fire as volcanic eruptions begin in the background. Camera rapidly switches between extreme close-ups, ground-shaking footsteps, teeth detail, water splashes, flying dust, and ultra cinematic wide shots.
Final frame:
Both dinosaurs freeze face-to-face before impact while lava erupts behind them and lightning flashes across the sky.
TEXT ON SCREEN:
"WHEN TITANS RULED THE EARTH"
Style:
ultra photorealistic, cinematic masterpiece, IMAX scale, volumetric lighting, realistic dinosaur anatomy, dramatic atmosphere, HDR, film grain, shallow depth of field, dynamic camera movement, Hollywood VFX quality, epic color grading, emotional orchestral tension, 4K, 8K, legendary cinematic realism.
Negative prompt:
cartoon, low quality, blurry, bad anatomy, cheap CGI, extra limbs, unrealistic movement, low detail, oversaturated, distorted faces, text glitches, watermark, frame flicker, low FPS.
Ultra realistic cinematic wide-angle portrait, 9:16 aspect ratio, using the exact face from the reference photo, friendly warm smile, calm and charismatic expression, standing confidently in a luxurious Middle Eastern desert village while gently carrying a cute fluffy baby lamb in both arms. The lamb looks adorable and innocent with soft curly white wool, realistic fur texture, tiny ears, expressive eyes.Same Face as reference image.
Behind him stands a tall realistic camel with height nearly equal to the man, decorated with elegant traditional desert ornaments and detailed saddle accessories, calmly standing close behind him.
He wears elegant black traditional Middle Eastern clothing with oversized flowing fabric, layered scarf around the neck, intricate geometric embroidery, luxury gold bracelets, ornate waist accessories, black robe moving naturally with the desert wind, realistic fabric folds, luxury desert warrior aesthetic.
Wide-angle cinematic composition with slightly low angle camera perspective, foreground depth using pottery, ropes, desert plants, sand textures, and blurred objects. Background filled with cube-shaped sandstone Middle Eastern houses, narrow desert streets, tall palm trees, warm desert atmosphere, highly saturated bright blue sky, soft cinematic sunlight, realistic global illumination, atmospheric depth, subtle shadows.
A majestic eagle flying in the sky, with the words “Eid Mubarak” naturally formed from soft white clouds around the bird, elegant and aesthetic cloud typography integrated realistically into the sky. Ultra detailed photorealism, cinematic lighting, shallow depth of field, luxury editorial photography style, warm golden desert tones, sharp focus, highly detailed textures, 8k realism.
Use the attached storyboard image as an accurate visual blueprint to create a 10-second vertical (9:16) ultra-realistic live-action documentary video.
Strictly maintain the storyboard's composition, chronological order, character evolution, and walking direction.
Do not add extra shots or change the sequence.
【Video Style】
Ultra-realistic live-action movie quality.
High-budget documentary footage like BBC Earth, NHK Special, or National Geographic.
Realistic texture like a historical reenactment film.
Natural light-based.
Cinematic lighting.
Photorealistic.
Realistic skin, fur, clothing, stone tools, and natural environments.
Cinematic depth.
Intellectual and majestic atmosphere.
【Video Structure】
Video length is 10 seconds.
Switch through the following 6 scenes from the storyboard in order 1→6 with equal duration:
1. Ape-man (early primates)
2. Australopithecus
3. Homo erectus
4. Neanderthals
5. Cro-Magnons
6. Modern humans
Edit each scene to flow naturally.
Emphasize the sense of gradual evolution over time.
【Character Performance】
Characters walk from left to right continuously in all scenes.
Walking speed is always constant.
Ensure a sense of continuity in the same lineage across all scenes.
With evolution,
* Posture
* Facial features
* Build
* Fur amount
* Clothing
* Tools
* Gait
gradually evolve naturally.
Express it as evolution within the same lineage, not a switch to different people.
【Camera】
Maintain the same side-view composition in all scenes.
Keep camera distance constant.
Always capture the full body of the character.
Smoothly follow the walking motion.
No intense camera work.
Natural filming like a documentary film.
【Background Performance】
Background environments also evolve naturally with each era:
* Forest
* Grassland
* Arid region
* Glacial region
* Mountainous region
* Modern city
Make environmental changes connect naturally.
【Captions】
Display short Japanese captions in each scene indicating the era.
Display content:
* Ape-man (early primates)
* Australopithecus
* Homo erectus
* Neanderthals
* Cro-Magnons
* Modern humans
Caption style:
* Small size
* Elegant
* Non-intrusive to the footage
* Semi-transparent
* White or light gray
* Bottom center display
* Subtle fade-in and fade-out
* In the style of BBC Earth or NHK Special
* Minimal elegant documentary-style typography
Important:
* Do not hide characters with captions
* No flashy animations
* No YouTube-style UI
* No variety show style
【Music】
Majestic and grand classical music evoking the beginning of the universe or human history.
Intellectual and mystical atmosphere.
Emotional build-up like a movie trailer.
【Important】
Strictly reference the uploaded storyboard.
Maintain the panel layout, era changes, and walking direction.
In the final video, do not display:
* Storyboard frames
* Annotations
* UI
* Page information
* Production notes
【Negative Prompt】
Slow motion, unnatural movements, dancing, exaggerated acting, robotic movements, unnatural muscles, deformed limbs, blurry faces, low-quality textures, fantasy effects, sci-fi effects, mutations, intense camera work, static background figures, low resolution, anime-style expressions, comical expressions, unnatural walking, camera angle changes, switching to different people, excessive motion blur.
documentary
evolution
history
cinematic
realistic
storyboard
Place the woman (ref vid) realistically into these locations(*). Never change the angle, framing, the woman, or the woman's pose. Never zoom in, never zoom out. Keep exactly the same angle and the same framing. (just chng outfit).
* Museum - 1click
* Car showroom - 1click
* Next to the window of a skyscraper - 1click
* Next to the window on a bus - 1click
* Beside a wall in a metro station - 1click
Cinematic ocean documentary film. Subject: A 27-year-old extremely handsome athletic deep-sea fisherman. Strong sharp features, piercing calm eyes, short neat beard, broad athletic shoulders, naturally tanned rugged skin. Wearing heavy yellow waterproof overalls, thick rubber boots, rope coiled over one shoulder. Old rusted fishing vessel.
→ SHOT 1 — Extremely low deck-level tracking shot. He is already moving at full speed across the vessel deck toward a tangled net. After violently hauling the net over the side, he barely catches his footing on the wet deck before immediately pushing forward again.
→ SHOT 2 — Whip pan transition into an extended rope-pulling sequence. Continuous rope friction and ocean spray explode across the frame while the camera struggles to keep up with his overwhelming strength and speed.
→ SHOT 3 — Wide moving shot across the entire vessel deck. Multiple rapid tasks happen back-to-back with almost no setup time. Cameraman nearly loses balance on the rocking boat to follow the action.
→ SHOT 4 — Compressed long-lens shot capturing a massive wave crashing over the bow in slow motion. Extreme body control holding position against the force, nearly swept off before miraculously recovering at the final second.
→ SHOT 5 — Ultra-low circular tracking shot around an extended balance sequence on the slippery deck. Constant adjustment against the rocking ocean pushes stability to the limit, yet he calmly adjusts his grip on the rope mid-motion without slowing down.
→ SHOT 6 — Final shot. A massive wave crashes over the hull completely surrounding him. Spray, foam and momentum trails explode across the deck. He slides into a firm wide stance, slowly straightens up, and locks into a completely still final pose under harsh ocean light. The sound cuts instantly. Camera freezes on his completely calm and fearless face. Fade.
Style: Ultra-realistic National Geographic ocean documentary combined with premium sports advertising energy. Cold desaturated tones. Heavy ocean atmosphere. Real human strength and presence.
Photorealistic cinematic 15 second video of a small, ultra-detailed songbird with vibrant lime green upper body and wings, soft white underbelly with subtle brown accents, sharp black eye, and delicate pointed beak flying gracefully through a deep navy blue void.
The bird is covered by a semi transparent glowing white wireframe mesh that perfectly conforms to its body and wings, with fine knitted engineered fabric texture visible especially on the wings. Bright lime green glowing coordinate points with dynamic floating text labels ("x: XXXX y: YYYY") appear and track across the bird's head, eye, beak, chest, wings, and legs in real time, like advanced AI motion capture and digital scanning.
Thousands of small glowing white and soft pink particles float in the dark space, connected by thin glowing lines forming an elegant, slowly shifting neural network or data web that the bird moves through.
Crystalline water droplets and translucent liquid-glass splashes dynamically form on the feathers and mesh, trailing behind the wings with realistic refraction and motion.
[0s–4s]: Medium three-quarter front shot. The bird hovers and begins slow, powerful wing flaps. Wireframe mesh fully activates and glows. Coordinate points appear one by one and start tracking. Camera slowly orbits right toward a clean side profile while gently pulling back. Subtle forward drift of the bird.
[4s–9s]: Smooth side-profile tracking shot. Bird flies forward with elegant wingbeats and realistic feather motion blur. Camera tracks alongside at medium distance, slightly rising. More particles interact with the bird. Liquid splashes become more dynamic on wingtips. Coordinate labels continuously update and move with the anatomy.
[9s–13s]: Camera continues orbiting while slowly dollying out. Bird rotates its body slightly, revealing more of the wireframe structure and underbelly. Stronger crystalline liquid effects and particle trails. Dramatic rim lighting highlights the glossy mesh and wet feathers.
[13s–15s]: Final powerful wing flap. Bird flies slightly toward camera as camera pulls back smoothly. Mesh and coordinate points reach maximum intensity and glow. Particles swirl more actively. Cinematic slow-motion feel on the final wing movement. Clean ending pose with beautiful light interaction.
Style: hyper-realistic CGI mixed with premium VFX, ultra-detailed feathers and mesh, cinematic lighting, high contrast, volumetric atmosphere, sharp focus on the bird, 8K quality, commercial technology visualization aesthetic. Dramatic cool color grade with emerald green, white, and deep navy tones. Realistic physics, natural inertia on wings and particles, beautiful motion blur.
Single continuous cinematic shot, no cuts, no text, no logos, no watermarks.
Full-frame cinematic scene in the style of a 1970s academic TV drama. An elderly professor teaches at a chalkboard inside an old lecture hall. Warm late-afternoon sunlight pours through tall windows, casting long golden beams across worn wooden desks. Floating chalk dust is visible in the light, gently moving through the air.
The professor writes slowly on the chalkboard, pausing to explain with calm, deliberate gestures. Students sit quietly, listening with focused attention, some taking notes, others simply observing. The room feels lived-in, intellectual, and grounded in realism.
Subtle cinematic camera movement: slow push-in from the back of the lecture hall toward the professor, with shallow depth of field emphasizing chalk texture and facial detail. Natural film grain, soft contrast, warm analog color grading.
No voiceover, no text overlays, no graphics only natural classroom ambience, faint chalk sounds, and distant street noise filtering through the windows.
A cinematic emotional story about memory, nostalgia, and the passage of time.
0–3s: Late evening in a quiet living room. A mother sits alone watching old home videos projected onto a wall — childhood birthdays, laughter, family vacations, warm golden memories flickering softly in the dark.
3–6s: Close-up shots of emotional details — dust floating through projector light, trembling hands holding an old tape, tears reflecting scenes of her young child running through sunlight years earlier.
6–9s: Reality begins blending with memory. The child from the videos briefly appears moving through the present-day house, laughing and running past the mother as if time itself is overlapping.
9–12s: The mother walks slowly through these living memories — touching walls, hearing echoes of old conversations, seeing younger versions of her family around the home in soft glowing light.
12–15s: Final emotional shot. The memory fades as the projector stops rolling. The mother smiles gently through tears while dawn light enters the room, leaving behind warmth instead of sadness.
Style: ultra cinematic realism, emotional nostalgic atmosphere, soft golden lighting, film-grain memory aesthetic, seamless memory-to-reality transitions, intimate camera movement, heartfelt orchestral mood, film-grade color grading.
Create a 25-second ultra-realistic cinematic product ad video using the uploaded image as the main product reference. Start with a dramatic close-up shot of the product in a dark luxury environment with soft lighting and smooth camera movement. Add fast-paced cinematic transitions, glowing reflections, premium shadows, and detailed textures to make the product look expensive and high quality.
Scene 2 shows the product rotating slowly with dynamic light streaks and modern commercial-style background music. Add realistic motion blur, depth of field, and AI-generated luxury ad aesthetics. Include short cinematic text overlays like "Next Level Quality", "AI-Powered Ads", and "Turn Photos Into Viral Videos".
Scene 3 shows social media style product showcase shots with energetic transitions and engaging commercial visuals. Add realistic human interaction or lifestyle cinematic atmosphere matching the product theme.
Final 5 seconds should show the Pollo AI app interface/tutorial demo using the "Picture to Ad Video Agent" feature. Add glowing UI animation and ending text:
"Created using @itsPolloAI"
"Turn Any Picture Into Winning Ads"
Style: Hyper realistic, cinematic, premium commercial advertisement, 4K quality, smooth motion, realistic lighting, modern ad aesthetics, engaging social media reel format, 16:9 widescreen.
A stylish person stands before a large mirror in a modern boutique, observing their reflection. The camera slowly moves around the subject, capturing different angles. Each perspective reveals new reflections in the mirror, showing the subject from multiple sides simultaneously. Ultra-detailed rendering of fabric textures, realistic skin detail, and soft studio lighting. The mirror reflections are physically accurate, showing the correct perspective based on camera angle. High-quality 4K imagery with cinematic depth of field, shallow focus on the subject, sharp background details.
A colossal, photorealistic tsunami towers over a modern coastal city at stormy dusk, an immense wall of dark blue water curling forward with unstoppable force, filled with swirling debris, dense foam, and wind-blown mist as lightning flashes through heavy storm clouds; a single continuous 15-second cinematic slow-motion shot with a smooth forward-moving camera as the wave rapidly advances, city lights flickering beneath the looming water mass, buildings beginning to buckle under pressure, and streets disappearing under rising floodwater; ultra-realistic fluid dynamics with detailed foam and particle simulation, volumetric lightning illumination, rich cinematic color grading in deep blues and stormy grays, subtle motion blur, highly detailed environmental destruction, no artifacts, seamless motion consistency.
Cinematic ultra-realistic 3D animated cat daily vlog, cozy rainy evening aesthetic, adorable chubby orange-and-white British Shorthair cat acting like a human commuter, wearing purple headphones, purple scarf, purple tote bag, and purple Crocs throughout the entire video, consistent character design in every scene. Soft rainy city atmosphere, reflective wet streets, cinematic depth of field, warm indoor lighting, highly detailed fur physics, realistic rain droplets, smooth Pixar-style animation mixed with realistic textures.
Scene 1 — 17 :31 GET OFF WORK
The cat exits a modern office building during rain while holding a transparent umbrella, walking confidently on wet reflective pavement, city lights glowing softly, cinematic tracking shot.
Scene 2 — 17 :47 THE CAR HAS ARRIVED
The cat waits at a rainy bus stop with umbrella, bus headlights approaching through misty rain, soft ambient traffic reflections, emotional urban vibe.
Scene 3 — 18 :07 I'M SO SLEEPY
Inside public transport, the cat sits sleepily near a rainy window, eyes closed while listening to music, soft vehicle motion, cozy evening commute feeling.
Scene 4 — 18 :27 ARRIVED AT THE COMMUNITY
Rear view of the cat walking through a quiet residential community in rain, umbrella dripping water, playground and apartment lights blurred in background.
Scene 5 — 18 :31 WAIT FOR THE ELEVATOR
The cat stands patiently in front of an apartment elevator holding the folded transparent umbrella, warm hallway lighting, realistic reflections on elevator doors.
Scene 6 — 18 :40 FINALLY HOME
The cat enters a cozy apartment hallway, removes the tote bag and umbrella, relaxing after work, warm cinematic home atmosphere.
Scene 7 — 18 :52 TAKE A BATH FIRST
Cute bathroom scene, cat removing scarf before shower, soft white lighting, realistic steam and bathroom reflections, adorable expression.
Scene 8 — 19: 12 THE TAKEOUT HAS ARRIVED
Food delivery person hands takeout to the cat at the apartment door, cat now wearing soft purple pajamas, cozy nighttime mood.
Scene 9 — 19: 20 EATING SHOW TIME
The cat sits at a small table eating spicy noodles and fried chicken while watching TV, soda beside the meal, cozy apartment ambiance, fluffy duck plush on sofa in background, satisfying mukbang vibe.
Ultra detailed fur rendering, realistic motion animation, smooth camera transitions, TikTok reel style, vertical 9 : 16 aspect ratio, cinematic audio atmosphere, wholesome relaxing storytelling, trending cute AI animal vlog aesthetic.
A realistic TV broadcast camera cuts to the character sitting in the courtside premium seats. The character is smiling and clapping, wearing a team jersey. The background shows the basketball court during a timeout, hyper-realistic 4K resolution.
Create a 15-second ultra-realistic cinematic influencer vlog set during a live nighttime Japanese baseball game. Blend authentic iPhone vlog realism with televised sports-broadcast aesthetics and emotional Gen-Z fan energy.
Main character: authentic Japanese woman, 21 years old, fair natural skin, glossy lips, long dark-brown hair with soft bangs, natural kawaii makeup, oversized baseball jersey, pleated mini skirt, loose socks, white sneakers, cute stadium accessories. Realistic Japanese appearance only — not anime or CGI.
Camera: iPhone 14 Pro HDR realism, handheld selfie footage mixed with sports-broadcast telephoto shots, humid summer-night atmosphere, LED stadium glow, motion blur, crowd cheering, shallow depth of field, broadcast compression artifacts, cinematic documentary realism. Soft emotional background music inspired by "bye" by Ariana Grande.
CUT SCENE 1 — OUTSIDE STADIUM (0–4s)
Front-camera handheld selfie outside a glowing Japanese baseball stadium at night. Crowds in jerseys walk behind her under neon lights while wind moves her hair naturally.
Dialogue (Japanese-accented English):"Today I am going to watch live baseball show!"
Quick whip-pan transition.
CUT SCENE 2 — INSIDE THE STADIUM (4–8s)
Televised sports-broadcast shot inside a packed stadium. She sits among cheering fans with the glowing baseball field behind her. Fans wave thundersticks and drinks.
Dialogue:"We actually got such a good view!"
Add realistic Japanese/Korean-style sports overlays: scoreboard graphics, inning count, sports-channel watermark, pitch-speed display.
CUT SCENE 3 — FAN CAM MOMENT (8–11s)
The giant fan-cam captures her. She laughs shyly and makes a Korean finger-heart gesture toward the camera while nearby fans cheer loudly.
Dialogue:"Oh no… the camera found me!"
Sports-broadcast zoom-lens framing with realistic crowd motion blur.
CUT SCENE 4 — ENDING SHOT (11–15s)
Dreamy close-up with LED stadium bokeh behind her. She places her right hand beside her cheek, thumb and index finger forming half of a heart shape, smiling softly toward the camera.
Dialogue:"Byeee~"
Slow cinematic push-in with emotional stadium ambience fading out.
Style tags: photorealistic, Japanese baseball stadium, live sports broadcast, Tokyo Gen-Z influencer vlog, candid fan cam, cinematic documentary realism, realistic crowd atmosphere.
Negative prompt: anime, cartoon, CGI, over-smoothed skin, studio portrait lighting, empty stadium, distorted anatomy, unrealistic beauty filters, esports aesthetic.
Main Character:
A beautiful Korean high school girl wearing a realistic Korean summer school uniform (하복). Natural skin texture with no beauty retouching. Hair becomes messy during combat, with expressive emotional acting. Facial proportions remain fully consistent throughout all shots. She begins timid and shocked, then gradually becomes determined and defiant.
0–1.5 seconds:
The female lead quietly studies at her desk. Four delinquent schoolgirls surround her and begin bullying her. They mock her for studying, aggressively sweep her books off the desk, and shove her shoulders. Wide-angle handheld camera movement. Books fly in slow motion. Tense classroom atmosphere. Realistic school bullying energy, shaky camera motion, cinematic realism.
1.5–3 seconds:
Close-up of the protagonist's face. She slowly stands up. Her expression shifts from fear to cold determination. The bullies remain blurred in shallow depth of field behind her. A 0.5-second moment of silence. Slow cinematic push-in shot. Silence except for ambient classroom sound and tense breathing.
3–5 seconds:
First confrontation. One bully throws a punch; the protagonist blocks and counters with a strike to the stomach. Another attacker rushes from the side; she dodges and retaliates with a spinning elbow strike. Handheld tracking shots follow the motion closely. Dynamic motion blur, impact camera shake, realistic fight choreography. No supernatural effects.
5–8 seconds:
The remaining two bullies attack simultaneously with punches and kicks. The protagonist uses quick footwork and evasive movement to avoid hits. Dynamic 360-degree rotating camera movement. Rapid chained kicks and elbow attacks knock the attackers down. Classroom desks and objects shift from the impacts. Intense cinematic action pacing.
8–10 seconds:
The final attacker charges toward the protagonist. The female lead leaps high into the air. Low-angle shot from the ground. 30% slow motion. Hair and skirt flow naturally. Dramatic cinematic lighting. Floating dust and airborne particles drift slowly through the air.
10–12 seconds:
Midair 360-degree spinning kick. Slow-motion impact directly hits the final bully's chest. Extreme close-up of the collision. The bully is launched backward into the classroom wall. Debris and dust explode outward. All bullies collapse onto the floor. Immediately after landing, the camera speed snaps back to normal for dramatic impact.
12–14 seconds:
Victory moment. The protagonist stands alone in the center of the classroom, breathing heavily. The four bullies lie defeated around the room. The camera slowly and dramatically pushes toward her face. Soft cinematic bokeh background. Her expression is determined yet emotional.
14–15 seconds:
Freeze-frame close-up. The protagonist stares directly into the camera and calmly says in Korean:
("I need to get into college.")
Delivery is realistic and emotionally restrained. After the line ends, she returns to looking like an ordinary student. Calm, emotional ending. The film emphasizes the intense academic pressure faced by Korean students.
Style References:
Korean action cinema, ultra-realistic cinematography, cinematic handheld action, emotional realism, grounded fight choreography, realistic Korean classroom atmosphere, high-budget Netflix K-drama aesthetics, cinematic lighting, dramatic silence beats, powerful female protagonist, grounded emotional tone.
Negative Prompt:
Cartoon, anime, CGI-looking textures, fake skin, extra limbs, distorted faces, exaggerated fantasy armor, unrealistic physics, low quality, blurry faces, overexposed lighting, comedic tone, childish style, fantasy classroom, male protagonist, bad anatomy, unrealistic body proportions, supernatural effects, glowing eyes, energy auras, magic.
Create a cinematic travel vlog of a person visiting a tropical island. The video should capture the journey from arrival to exploration, featuring turquoise waters, white sandy beaches, and lush palm trees. Show the protagonist walking along the shore, swimming in crystal-clear water, and relaxing in a hammock. Include aerial drone shots of the coastline and sunset timelapse. Maintain natural color grading with warm tones and handheld camera feel. Add ambient sounds of waves and birds.
Create a cinematic text-to-video scene featuring an original non-copyrighted moment where a competitive swimmer who retired from the sport fifteen years ago after a career-ending injury returns alone at midnight to the pool where she trained for twenty years now a public facility she has avoided since her retirement and swims one length, alone, in the dark. The mood is privately enormous, physically reclaiming, about the relationship between a body and what it was trained to be, with an intimate observational drama feeling.
She has a key she used to have a key, and when she checks, it still works. The pool is dark except for the underwater lights, which she does not turn off. The water is that specific competitive-pool blue-green. The smell hits her at the door chlorine and human presence and something older than that, decades of sweat and tears and achievement and failure and youth and middle age, all of it still present in the water.
She stands at the edge in a one-piece suit and old goggles, fifteen years of body changed and unchanged, and she dives into the lane. The first stroke is rusty, her shoulder complaining, her breathing wrong, the timing completely gone, she is not the swimmer she was, she cannot be. But somewhere in the second length it starts to come back, rhythm, breathing, the feel of the water, her body remembering what her mind tried to forget, muscle by muscle. By the third length something impossible happens, she is not swimming like she used to, she is swimming like she is now, a new body with the old knowledge, and it is better than the old way, more interesting, more particular, harder earned, more hers.
She finishes the length and hangs at the wall in the dark looking up at the ceiling she has not seen in fifteen years, breathing hard, shoulder on fire, alive in a way she had forgotten was possible.
Ultra realistic UGC skincare advertisement featuring a handsome young man sitting in a modern apartment talking naturally to camera while promoting a premium face wash product. Cozy daylight lighting, clean masculine aesthetic, realistic skin texture, natural hand gestures, luxury bathroom and table setup, product placed clearly in foreground, soft cinematic depth of field, influencer-style commercial, realistic facial expressions and lip movement, subtle camera motion, premium skincare campaign vibe similar to modern TikTok and Instagram ads. The male model picks up the face wash, applies it naturally, smiles confidently, and shows refreshed glowing skin. Photorealistic, cinematic lighting, realistic shadows, high-end grooming advertisement, 4K, vertical 9:16, 10 second commercial.
A lone underground fighter in a black tactical hoodie, face bruised and sweating, athletic build moving with terrifying precision, eyes completely locked in combat focus
Walks through a crowded underground parking garage before being ambushed by an elite squad of armed mercenaries, triggering an ultra-violent hand-to-hand fight with insane Matrix-style choreography, disarming enemies, dodging bullets at close range, using walls, cars and pillars for impossible acrobatic combat while bodies crash through glass and concrete
Modern urban underground parking structure at night with flickering neon lights, rainwater dripping from ceilings, smoke, shattered glass and luxury cars reflecting the chaos
Starts with slow cinematic handheld tracking behind the fighter walking calmly through the garage, subtle tension from footsteps echoing in silence, sudden whip pan into the first attack, hyper-dynamic continuous tracking shot , orbit shots around impossible slow-motion dodges, crash zooms during brutal impacts, FPV-style movement weaving through gunfire and close combat, realistic sparks, shell casings and debris hitting the lens, time-slice moments freezing punches and bullets mid-air before snapping back into brutal speed, cold realistic urban lighting mixed with flashing headlights, ending with the fighter standing alone in the destroyed parking garage while surviving enemies collapse around him simultaneously, rain dripping through broken concrete as the camera slowly pulls backward revealing complete devastation and silence after the massacre
Highly realistic fansign event scene using featuring a western woman with blonde hair (use uploaded reference image for identity consistency). She is styled like a global pop idol, sitting at a signing table interacting warmly with a fan. The fan is visible only from behind or out-of-frame (face not shown).
Soft pastel lighting, dreamy indoor concert hall atmosphere, shallow depth of field, cinematic but natural documentary feel, slight handheld camera movement with gentle zoom-in, realistic skin texture, subtle crowd bokeh in the background.
Scene Action
The blonde idol leans slightly forward toward the fan, smiling warmly while talking naturally. She forms a cute bunny heart gesture (fingers shaped like bunny ears + small heart pose) beside her face.
She laughs softly and says:
"You're so sweet… I really appreciate you coming today."
She then tilts her head playfully and adds:
"Did you enjoy the show?"
Soft crowd noise, camera gently zooms closer, capturing intimate emotional connection.
The moment feels natural, warm, and slightly dreamy but still realistic (not overly staged or over-glamourized).
Audio Style
Soft indoor crowd ambience
Light camera rustle (handheld realism)
Idol speaking gently and clearly
No music or heavy cinematic score
Negative Prompt
No exaggerated K-pop styling, no plastic skin, no beauty filter, no anime look, no over-saturated glow, no dramatic posing, no exaggerated expressions, no CGI appearance, no text overlays.
realistic
lifestyle
fashion
K-pop
drama
documentary
A couple of days ago, someone asked for a rainy day prompt: a realistic video prompt (15-second full version - pure first-person POV). Presented in the style of unprocessed, handheld, unstable iPhone video footage, all camera settings are automatic, with no post-processing color grading or special effects. The footage shows realistic breathing and slight irregular hand shake; autofocus frequently experiences intense searches, brief out-of-focus periods, and delayed recovery; automatic white balance naturally adjusts with the dim lighting of the rainy day. The overall image is flat and slightly washed out, retaining realistic lens flare, rain blur, edge watermarks, and slight overexposure. Faint fingerprints and rain artifacts occasionally appear at the bottom of the frame. Only natural ambient sound effects are used (sound of heavy rain, dense rain hitting the ground and umbrella, the sound of wet clothes rubbing, suppressed breathing); the microphone is slightly distorted at louder sounds. The entire video uses a pure first-person POV perspective (student's subjective viewpoint), with camera movement completely following natural head movements and eye gaze. The composition is occasionally imperfect, showing realistic breathing tremors and slight shaking during moments of tension. From 0 to 4 seconds, the camera, from a first-person perspective, stands at the school gate. A torrential downpour is deafening. You can clearly see yourself in soaking wet black trousers, rain streaming down your hair. The school is almost deserted; you stand under the awning, worried about not having an umbrella. From 4 to 9 seconds, your usually strictest Chinese teacher, dressed in… suddenly approaches from behind, holding an umbrella. Rain quickly soaks the thin, white fabric, which clings tightly to her body, revealing glimpses of skin and lace. She stops in front of you and whispers, "No umbrella?… I'm going that way, let me walk you home." Her voice is unusually gentle. The autofocus searches for the teacher's soaking wet white jumpsuit and the rain streaming down her collarbone. From 9 to 15 seconds, the teacher takes your arm, naturally leaning close to warm you, the rain pattering against the umbrella. She placed one hand on your shoulder and adjusted her coat with the other, saying, "Teacher's a little cold. Could you come closer?" The damp white fabric clung to her body, her warmth palpable. The scene unfolds naturally in the rain, the two close together, a tension mixed with warmth. The footage presents a realistic, unprocessed handheld video quality, a documentary-level natural imperfection, without any post-production color grading or special effects. All camera actions conform to the physical characteristics of iPhone automatic shooting.
Create a cinematic 10-second ultra-realistic luxury cosmetic commercial in a high-end skincare advertisement style. Use warm champagne lighting, glossy beauty-film aesthetic, shallow depth of field, macro beauty cinematography, smooth cinematic camera movement, and premium fashion-commercial visual language. The entire video features a consistent female model in her mid-20s with glowing glass skin, silky dark hair softly tied back, and wearing an ivory satin robe. Maintain strict identity and appearance consistency throughout all scenes.
Scene 1 (0–1s): Extreme macro shot of transparent serum droplets falling in slow motion onto reflective water. Golden highlights ripple outward, creating a pure luxury skincare ambiance under ultra-clean studio lighting.
Scene 2 (1–2s): Medium close-up of the woman slowly turning toward the camera. Soft rim lighting defines her flawless skin texture as she holds a calm, confident expression in slow motion.
Scene 3 (2–3s): Floating product hero shot of a serum bottle gently rotating in mid-air. Liquid ribbons and suspended glass particles surround it with dramatic studio reflections and premium commercial energy.
Scene 4 (3–4s): Close-up beauty shot as she applies serum onto her cheek using her fingertips. The camera slowly pushes in, revealing natural hydration glow and radiant skin texture.
Scene 5 (4–5s): Side-profile tracking shot with floating water particles drifting around her face. Soft luxury lighting enhances cheekbone highlights and luminous skin.
Scene 6 (5–6s): Slow-motion liquid splash elegantly wraps around the serum bottle placed on reflective black glass. Macro lens detail emphasizes smooth, realistic fluid motion.
Scene 7 (6–7s): Extreme close-up of her eyes and glowing skin. Subtle facial movement under warm cinematic lighting enhances a premium hydrated beauty effect.
Scene 8 (7–8s): Artistic mirror reflection sequence. Slow dolly movement as she gently touches her skin while gazing at her reflection in a dreamy luxury atmosphere.
Scene 9 (8–9s): Hero product shot of the serum bottle surrounded by floating gold particles and soft mist. Strong cinematic backlight builds a high-end cosmetic campaign feel.
Scene 10 (9–10s): Final payoff shot. Slow push-in toward the woman confidently facing the camera beside the glowing serum bottle. Elegant typography fades in over warm bokeh highlights, ending in a calm, premium cinematic finish.
Style: ultra-realistic live-action only, no illustration, no sketch, no CGI look, no animation style.
Ultra-realistic 10-second boxing fight between two women inside a small underground gym. Both fighters look naturally athletic with realistic skin texture, sweat, bruises, and detailed facial expressions. One woman wears black boxing shorts and red gloves, the other wears dark gray sportswear with blue gloves. The fight feels raw and authentic, like real professional sparring footage.
The camera moves handheld around the ring at close range, capturing fast punches, defensive movement, realistic footwork, and heavy breathing. Sweat sprays naturally through the air after impacts. The women exchange quick combinations, dodge punches, and aggressively counterattack with believable body movement and physical weight.
Dim overhead gym lights create realistic shadows on their faces and bodies. The background contains trainers, gym equipment, ropes, mirrors, and a few spectators reacting naturally. No slow motion, no dramatic movie effects, no exaggerated choreography. Everything feels like genuine live fight footage recorded on a high-end cinema camera. Realistic motion blur, natural skin detail, subtle camera shake, grounded physics, authentic combat energy, ultra realistic documentary-style sports cinematography.
15s ultra-realistic cinematic video of a tall Japanese fashion model walking a luxury outdoor runway in bright sunlight. Emerald green long coat over a fitted white shirt, cream wide-leg trousers, black platform heels. Starts backstage, steps into sunlight, confident runway walk with flowing fabric, pauses for photographers, smooth turn, audience applause, final smile walking backstage. Vogue fashion week atmosphere, smooth handheld tracking shot, vivid colors, realistic fabric physics, 60fps.
Create a seamless cinematic single-take action scene inside a sterile, dimly lit public restroom.
REFERENCE USAGE:
Image 1 = strict full-character reference for the blonde woman.
Use Image 1 as the only reference for her face, hair, skin tone, body proportions, outfit, accessories, and overall appearance.
Image 2 = strict character reference for the black-haired female attacker.
Do NOT recreate any character sheet layout, panels, borders, labels, text, or collage structure. Use the references only for identity, hair, face, skin texture, body proportions, outfit silhouette, and overall appearance.
BLONDE WOMAN:
Preserve Image 1 closely:
- blonde / dark-blonde shoulder-length soft waves
- pale fair skin
- blue-gray eyes
- elegant brows
- eyeliner
- delicate nose
- full glossy lips
- refined jawline
- calm but dangerous expression
- slim graceful body
- silver satin evening gown
- jeweled bustline
- thin straps
- draped satin construction
- high slit
- necklace and earrings
She must feel like a glamorous gala woman, not a tactical fighter.
BLACK-HAIRED ATTACKER:
Preserve Image 2:
- long straight black hair
- bright blue eyes
- pale skin
- slim athletic body
- sharp cheekbones
- fitted sleeveless black catsuit-like outfit
- cold assassin expression
- fast ruthless movement
SCENE:
Public restroom, white subway tiles, large wall mirror, ceramic sinks, metal stall doors, flickering fluorescent lights, wet reflective floor, cold blue-gray neo-noir mood. One continuous uninterrupted shot, no cuts.
ACTION STRUCTURE:
[0:00–0:02]
Start with an extreme close-up of the blonde woman's face. Her blue-gray eyes are focused, blonde hair frames her face, and fluorescent light flickers across her skin. Behind her, blurred in the background, the attacker rushes in.
[0:02–0:04]
Dramatic slow motion: the attacker throws a punch directly toward the blonde woman's face. The fist rushes toward the lens. At the last second, the blonde woman leans sharply backward and avoids the punch by inches. Her hair and silver satin dress react naturally.
[0:04–0:06]
Slow motion ends, action snaps back to full speed. The blonde woman instantly counters, grabs the attacker, uses her forward momentum, and throws her hard onto the wet restroom floor. Water splashes across the tiles.
[0:06–0:08]
The attacker quickly pushes herself up from the floor. Camera pulls wider to show both women. The attacker rises aggressively and runs toward the blonde woman again for a second attack.
[0:08–0:11]
The blonde woman stays calm, pivots at the last second, catches the attacker's movement, and violently redirects her into the large wall mirror. Ultra slow-motion impact: the attacker crashes into the mirror, a huge spiderweb crack spreads across the glass, and shards explode toward the camera. Both women appear briefly in fractured reflections.
[0:11–0:13]
Slow camera pull-back. The attacker drops to the wet floor, stunned and nearly unconscious. The blonde woman remains standing, composed and in control. Broken mirror shards multiply her reflection.
[0:13–0:15]
Medium close-up to close-up on the blonde woman. She calmly lifts one hand and fixes her hair, smoothing it back into place as if nothing happened. Her expression is cold, elegant, and untouchable. Camera lingers on her face, the shattered mirror, the fallen attacker, and flickering fluorescent light. Fade to black.
STYLE:
Photorealistic neo-noir action thriller, fluid handheld camera, realistic close-combat choreography.
Found footage documentary style. Handheld handycam with natural jitter and shake. Wide-angle consumer lens. Golden hour natural daylight, slightly blown-out highlights, grainy film stock aesthetic. Spatial field audio recording.
A bizarre hybrid creature, with the enormous papery wings, slender segmented body and delicate thread-like antennae of a monarch butterfly combined with a massively oversized American bison head featuring thick shaggy dark brown fur, a broad domed skull, dense curved horns, a matted beard and enormous flaring nostrils, drifts and lurches erratically through a sunlit open meadow blanketed in dense wildflowers and swaying tall grass.
The shot starts in extreme close-up on a cluster of golden wildflowers before the shaky handheld camera whip-pans left and tilts up unsteadily to reveal the creature. Focus on the movement. Its vast paper-thin monarch wings beat in slow, labored, asymmetric strokes as the creature struggles to stay aloft — drifting sideways in the breeze before abruptly overcorrecting — while the colossal shaggy bison head bobs and sways with devastating inertia beneath it, dragging the delicate body downward with every gust. The creature produces a deep resonant bellowing grunt from the bison head mixed with the dry papery flutter of enormous butterfly wings.
Include plenty of appropriate detail in the background — rolling meadow depth, long golden grass catching afternoon light, scattered wildflower clusters in orange and violet. No dialogue.
Detailed textures of translucent orange and black veined wing membrane contrasting with coarse matted bison fur and the rough dark leather of its muzzle. Realistic creature movement physics combining the slow drifting flutter of butterfly wing mechanics with the catastrophic gravitational pull of the bison skull. Gritty low-quality amateur documentary look. Accurate handheld motion blur and natural camera shake. Photorealistic hybrid anatomy. Single continuous scene.
A cinematic slice-of-life montage showing different human emotions unfolding in the same coffee shop across time.
0–3s: Cozy coffee shop in early morning light. Steam rises from espresso machines. Soft jazz plays. A barista opens the doors as sunlight spills across empty wooden tables.
3–6s: Rapid montage begins — a student studying intensely with headphones on, a businessman stressed on a call, a writer staring at a blank page, a couple laughing over coffee. Same space, different emotions.
6–9s: Midday rush. The shop becomes alive — quick cuts of people entering and leaving, coffee being served, conversations overlapping, reflections in glass windows showing the city moving outside.
9–12s: Emotional shift — a lonely person sits by the window watching rain outside, while across the room a reunion happens with tears and laughter. The coffee shop becomes a silent witness to both sadness and joy.
12–15s: Final cinematic shot. Night falls. The shop is now calm and nearly empty. The barista wipes the counter as lights glow warmly. Camera pulls back to show the same space that held hundreds of untold stories in one day.
Style: ultra cinematic realism, warm cozy aesthetic, soft natural lighting, shallow depth of field, emotional slice-of-life storytelling, smooth time-lapse editing, film-grade color grading, gentle ambient music atmosphere.
Use the uploaded reference image as the exact identity reference for the subject. Create a hyper-realistic live IPL television broadcast crowd-shot sequence during a high-energy playoff cricket match in a packed Indian stadium.
The subject from the reference image must remain fully consistent throughout: same face, same hairstyle, same outfit, same skin tone, same lighting, same seating position, and same crowd environment. Preserve identity accuracy strongly across the entire sequence.
The video should feel exactly like a genuine Star Sports IPL crowd cutaway captured during a real live match — NOT cinematic, NOT influencer-style, NOT vlog-style.
Show realistic live broadcast camera behavior:
quick crowd cutaways,
natural stadium lighting,
authentic TV zoom lens movement,
slight camera shake,
realistic audience reactions,
energetic IPL atmosphere,
LED advertisement boards,
match scoreboard overlays,
cheering fans around the subject.
The subject should react naturally to the match:
smiling, clapping, looking tense during close moments, celebrating boundaries/wickets, and occasionally looking toward the field.
Maintain realistic Indian stadium ambience with thousands of spectators, team jerseys, flags, chants, floodlights, and authentic IPL playoff energy.
Ultra realistic skin texture, natural motion, realistic hair movement, accurate facial consistency, broadcast-quality detail, shallow depth of field, true live sports telecast aesthetic, 4K realism, highly detailed crowd environment.
Prompt:
cinematic lighting, music-video style, slow motion, overacting, beauty filter, influencer aesthetic, vlog framing, AI face distortion, cartoon look, unrealistic expressions, fantasy colors, excessive blur, duplicate faces, identity drift, studio lighting, posed acting, fake crowd.
Found footage documentary style. Handheld handycam with natural jitter and shake. Wide-angle consumer lens. Natural daylight, slightly overexposed, grainy film stock aesthetic. Spatial field audio recording.
A bizarre hybrid creature, with a glistening, wet octopus body and a distinct green frog head with large blinking eyes, undulates through tall garden grass. Octopus tentacles pull the creature forward
Focus on the movement. Octopus suction cups on the tentacles grip the blades of grass as the creature moves with a strange, creeping motion. The frog head on the front looks around intently
The hybrid creature reaches the edge of a muddy garden pond
The frog-headed octopus plunges fully into the dark pond water
Detailed skin textures (both wet cephalopod and amphibian). Realistic creature movement physics based on natural reference. Gritty documentary look. Accurate handheld motion blur. Photorealistic hybrid anatomy.
Presented in the style of unprocessed, handheld, shaky iPhone video footage, all camera settings are automatic, with no post-processing color grading or effects. The footage captures the realistic breathing of the operator and slight, irregular hand shake. Autofocus frequently exhibits intense searching, brief out-of-focus periods, and delayed recovery. Auto white balance naturally shifts between warm and cool tones as the cool fluorescent lights inside the public transportation vehicle mix with the light from outside the window. The image is generally flat and slightly washed out, retaining realistic lens flare, slight motion blur, and optical imperfections such as watermarks at the edges. A natural orange retro film-like timestamp "06 05 92" appears in the lower left corner of the image. Only natural ambient sound effects (low rumble of a subway/bus, slight vibrations in the carriage, and the sound of fabric rubbing) are used, with no background music. Microphone distortion is slight at louder frequencies. A pure first-person POV perspective (the subjective viewpoint of the voyeur) is employed, with camera movement entirely following the operator's instinctive reactions. The composition is occasionally imperfect, showing realistic breathing tremors and slight shaking during moments of tension. From 0-2 seconds, the camera focuses on a first-person perspective from a seat opposite the female protagonist, lingering on a medium shot inside a public transportation vehicle (subway/bus). A young Asian woman with long, straight, flowing black hair is shown sitting on a blue seat, her arms naturally crossed over her chest. Her clothing is described.
An orange retro timestamp "06 05 92" appears naturally on the left side of the frame. The background shows a bright yellow textured handrail and cool-toned fluorescent lighting. The autofocus is stable and locked on the woman, with slight hand-shake and vehicle vibrations. From 2-5 seconds, the woman realizes she is being filmed and suddenly looks directly at the camera. She slowly raises her left hand, grasps her collar, and pulls her top up to a more concealing style, the movement fluid and natural. Her gaze shifts from calm and composed to slightly provocative, but ultimately reveals concern. She wears large silver hoop earrings, and a black phone and a black leather bag with a metal chain rest on her lap. The autofocus briefly searches and locks on the woman's movement as she pulls up her collar, the image slightly shaky, yet clearly capturing the subtle sound of fabric rubbing against skin. The deep rumble of a vehicle continues throughout. The footage possesses a realistic, unprocessed handheld video quality, a documentary-level natural imperfection, without any post-processing color grading or special effects. All camera behavior conforms to the physical characteristics of iPhone automatic shooting.
Ultra-realistic cinematic 15-second emotional short film set in a real modest home environment. A hardworking woman quietly performing daily chores—cleaning the house, cooking simple food in a small kitchen, folding clothes, and caring for her family. Natural morning sunlight enters through a window, dust particles visible in warm light rays, soft realistic household textures.
The woman shows subtle emotions of fatigue, sacrifice, and quiet strength—gentle expressions, silent determination, no dialogue. Quick cinematic cuts: washing dishes with running water sound, stirring food in a pot, wiping a table, helping a child get ready, placing food on a simple dining table.
Handheld cinematic camera style with shallow depth of field, focusing on her hands and face details. Soft ambient background sounds of home life. Emotional storytelling tone, slightly slow-motion transitions between tasks, highlighting repetition of daily struggle.
Final shot: she pauses for a moment, looks out the window with calm resilience, soft light on her face, symbolizing silent strength and endurance. Fade out with warm cinematic lighting, ultra-detailed, 8K realism, film grain, natural color grading, deeply emotional mood.
Create a cinematic, photorealistic 15-second Coca-Cola advertisement in 4K, 60fps, high-budget commercial style with vibrant red-and-white color grading, golden-hour sunlight, and dynamic camera work. The entire video must be exactly 15 seconds long with perfect timing.
0-2s: Extreme close-up of a frosty contour Coca-Cola glass bottle covered in condensation droplets. A hand opens it with a crisp satisfying 'psssht' sound as ice-cold bubbles rush upward. Camera slowly pushes in on the iconic white cursive 'Coca-Cola' logo glistening under sunlight.
2-5s: Cut to an energetic rooftop party at golden hour overlooking a vibrant city skyline. A diverse group of joyful young adults (multi-ethnic, ages 20-35) laugh, dance, and toast with Coca-Cola bottles and cans. Slow-motion capture of them clinking bottles, pouring Coke over ice, and taking refreshing sips with big genuine smiles. Bubbles fizz dramatically. Upbeat modern pop track with a subtle nod to the classic Coca-Cola jingle plays.
5-9s: Fast-paced, rhythmic montage (quick 0.6-0.8s cuts):
• A skateboarder in a sunny park grabs a Coke from a cooler and takes a big sip, eyes lighting up.
• A happy family at a beach picnic shares one giant bottle.
• Office colleagues celebrating a win in a bright modern workspace.
• A young couple on a tropical beach watches the sunset while sipping.
Every person shows instant refreshment and pure happiness after drinking. Lens flares, sparkling bubbles, and sweat droplets on skin emphasize the heat-to-relief contrast.
9-12s: Hero moment — a confident, beautiful woman in casual summer clothes holds a Coca-Cola bottle toward camera, smiles playfully, and takes a slow sip. Camera smoothly orbits 360° around her as refreshing mist and light rays highlight the bottle. Elegant text fades in: 'Taste the Feeling'.
12-15s: Seamless transition to a clean product beauty shot — the iconic Coca-Cola contour bottle spins slowly in center frame against a rich red gradient background with floating ice cubes and rising bubbles. Sparkling light effects accent the logo. Warm male voiceover (energetic yet friendly): 'Coca-Cola. Open Happiness.' Large 'Coca-Cola' logo animates in with a sparkle, followed by 'Taste the Feeling' and 'Share a Coke' text. Final frame holds the full logo and tagline with subtle particle effects.
Overall style: ultra-realistic, premium commercial quality, saturated yet natural colors, perfect branding accuracy, no text or elements outside official Coca-Cola guidelines. Sound design includes crisp bottle opening, fizzy pour, and uplifting music that builds to a joyful peak. Masterpiece-level 15-second spot.
Ultra-realistic cinematic 15-second video. A young girl walks confidently toward the camera at the center of a long, empty modern bridge.
She moves with a calm, natural presence and a soft, innocent expression. Hands are casually in her pockets.
As she walks forward, a large flock of pigeons suddenly bursts into the air from the bridge surface and railings.
The birds scatter in all directions, filling the frame with dynamic motion and energy.
The bridge is symmetrical with repeating railings and street lamps fading into deep perspective.
Strong leading lines guide the view toward the horizon.
Cool blue-hour lighting creates a soft, slightly moody atmosphere.
Gentle haze in the distance adds cinematic depth.
Camera uses a low eye-level tracking shot, centered composition, slow forward movement matching her pace.
Shallow depth of field keeps the subject sharp while birds show subtle motion blur.
Cinematic color grading, ultra-realistic textures, high dynamic range lighting, film-quality realism.
Use the first uploaded image as the main reference for the school uniform, body proportions, pose, posture, background, camera angle, framing, and overall composition.
Use the second uploaded image as the identity reference for the face and hairstyle.
Create a realistic Korean influencer-style school uniform portrait where the person from the second image naturally appears wearing the school uniform from the first image, photographed in the same studio setting.
Important:
Keep the school uniform, blazer, shirt, tie, skirt or pants, and overall outfit design from the first image.
Keep the body proportions, standing pose, hand placement, posture, camera angle, framing, and studio background from the first image.
Replace the face with the person from the second image.
Also preserve the hairstyle from the second image, including bangs, hairline, hair part, hair length, hair framing around the face, and overall hair silhouette.
Do not use the hairstyle from the first image if it differs from the second image.
Identity:
The face from the second image must remain clearly recognizable.
Preserve the second person's face shape, eyes, nose, lips, skin tone, jawline, and overall facial impression.
Do not turn the face into a generic attractive face.
Do not beautify too heavily.
Preserve the person's recognizable identity, but do not copy the face too rigidly.
Reinterpret it naturally so it looks like a realistic photo of the same person in this new school-uniform scene.
Keep the same overall facial impression and identity while allowing natural refinement and seamless adaptation to the lighting, angle, and mood of the target image.
Hair:
Follow the hairstyle from the second image.
Preserve the second person's bangs, hairline, hair part, hair texture, hair length, and overall hairstyle impression.
Only adapt the hair naturally so it fits the pose, lighting, and composition of the first image.
Korean influencer mood:
clean modern Korean influencer portrait
polished but natural beauty
soft photogenic expression
subtle editorial mood
stylish, slightly chic, youthful, and confident atmosphere
refined but believable skin texture
clear eyes with soft catchlights
naturally pretty, not over-retouched
avoid stiff ID-photo mood
Lighting:
soft Korean beauty lighting
gentle facial brightness
clean skin tone
soft natural highlights on the face
natural shadow transition
subtle glow, but realistic skin texture
avoid harsh flash
avoid flat passport-photo lighting
avoid dramatic studio glamour lighting
Style:
realistic photography
clean studio portrait quality
Korean influencer-style school portrait mood
natural skin texture
high detail
seamless face and hair integration
polished but believable
Negative prompt:
no identity loss
no generic attractive face
no over-beautified face
no first-image hairstyle if different
no awkward face blending
no mismatched skin tone
no mismatched hairline
no distorted facial features
no blurry eyes
no deformed hands
no extra fingers
no change to the school uniform
no change to the body pose
no change to the background
no cartoon style
no anime style
no text
no watermark
A cinematic, realistic 14-second morning-to-evening routine video of a beautiful young woman in her mid-20s with long wavy dark hair, fair skin, and elegant features. She has a natural, fresh-faced look with minimal makeup. Smooth, fluid transitions with a mix of intimate close-ups, wide shots, and subtle fisheye/wide-angle perspectives. Soft natural lighting, clean modern aesthetic, slight film grain, warm morning tones shifting to cooler office tones.
Scene-by-scene breakdown:
0-1s: Close-up of a black iPhone lying on rumpled white bedsheets showing lock screen at 06:50. Gentle morning light.
1-2s: Wide fisheye shot — the woman sits on a neatly made bed in a bright minimalist bedroom wearing light pink short-sleeve pajama set with black piping. She stretches, hugs a large white pillow, then gets up with natural grace.
2-3s: Extreme close-up of her brushing teeth in the bathroom, white toothpaste foam on her lips, focused expression, looking slightly upward.
3-4s: POV from inside an open refrigerator — she peers in wearing pajamas, reaches for slices of bread in the foreground.
4-5s: Close-up of a black frying pan on the stove — a hand cracks an egg, bright yolk lands perfectly, spatula gently adjusts it.
5-6s: Medium shot at a round white kitchen table — she sits in pink pajamas, takes a bite of toast with both hands, chews thoughtfully, then looks up and to the side with a calm, reflective expression.
6-7s: Quick close-up of her face as she stands up from the table, still in pajamas, looking contemplative.
7-8s: Dynamic low-angle fisheye transition — she is now fully dressed in professional outfit: white crop top, black blazer, high-waisted black trousers. She stands in the kitchen holding a black leather handbag in one hand and her phone in the other, confident and ready, slight movement as if heading out.
8-9s: Close-up of her hands tying the laces of black leather Doc Martens-style boots on a wooden floor.
9-10s: Wide fisheye street shot on a rainy overcast day in a New York-style brownstone neighborhood — she walks up the steps of a classic brick building, coat on, bag over shoulder.
10-11s: Inside a subway/train — medium shot of her hand holding a yellow pole, wearing her black blazer. A "no sitting" sign is visible. Subtle motion of the train.
11-13s: Modern bright office environment — she sits at a clean desk, focused, typing on a MacBook Pro. Over-the-shoulder shots show the laptop screen with email/chat interface open. Professional and concentrated mood.
13-14s: Final shot returns to the bedroom at night — soft blue ambient light. She stands on the bed in her pink pajamas again, reaching toward the window or curtains with a tired yet peaceful expression, ending the day.
Style directives: Realistic live-action feel, high detail, natural skin texture, subtle camera movement, smooth motion, elegant pacing, contemporary lifestyle vibe, high production quality like a premium commercial or Instagram Reels cinematic edit. 16:9 aspect ratio, 24fps.
A shaky, panicked handheld recording from the balcony of a sea-facing hotel during the day. The person filming suddenly spots something in the distance, does a jerky zoom-in, then quickly refocuses as a massive giant meteor streaks across the sky and crashes into the ocean with enormous force.
The impact creates a colossal tsunami that rushes toward the shore. The wave violently annihilates the beach, shacks, and entire coastline in a brutal display of nature's power. The person filming is in complete panic — camera extremely shaky, breathing heavy.
Water surges dramatically, rising rapidly until it reaches just a few floors below the balcony. A water droplet splashes onto the camera lens. The building rocks and shakes violently from the shockwave. In the background, people are screaming in terror while emergency sirens blare with back-and-forth intensity.
Cinematic, hyper-realistic, intense documentary-style footage, natural daylight, high tension, raw and chaotic energy.
Cinematic urban lifestyle sequence of a young female newspaper courier delivering papers across a busy sunny city morning, wearing a navy waterproof windbreaker, black cycling shorts, sneakers, and wireless headphones. Golden sunrise light reflecting off skyscraper windows, energetic downtown atmosphere, realistic city hustle, traffic movement, pedestrians crossing streets, coffee shops opening, cinematic realism with warm tones and dynamic camera motion.
Scene 1: Close-up under warm sunrise light, organizing stacks of newspapers beside a city kiosk, subtle steam from coffee carts, detailed facial focus, cinematic depth of field.
Scene 2: Riding bicycle through busy Manhattan-style streets, sunlight streaks between buildings, taxis and buses moving beside her, realistic urban energy, stabilized tracking shot.
Scene 3: Side-angle action shot delivering newspapers into apartment mailboxes and storefront stands, quick hand movement, shallow depth of field, cinematic motion blur.
Scene 4: Fast cycling through intersections during morning rush hour, pedestrians crossing, traffic lights glowing, dynamic drone shot following movement.
Scene 5: Portrait close-up while listening to music through headphones, confident focused expression, sunlight hitting hair and skin naturally, luxury fashion-commercial aesthetic.
Scene 6: Rear tracking shot riding through tree-lined avenues and modern city blocks, long shadows on pavement, realistic reflections, smooth cinematic camera movement.
Scene 7: Detailed close-up of newspapers sliding into a blue mailbox, realistic paper textures, warm sunlight highlights, premium commercial cinematography.
Scene 8: Wide cinematic ending shot of her cycling toward downtown skyline as the city fully wakes up, glowing golden-hour atmosphere, inspirational urban lifestyle ending, ultra realistic, 35mm cinematic photography, high detail, natural motion blur, smooth transitions, campaign film aesthetic.
Create a 15-second ultra-realistic cinematic influencer vlog video. Use the same consistent character throughout all shots.
Authentic iPhone 14 Pro TikTok vlog realism, handheld camera imperfections, front-camera selfies mixed with back-camera stadium footage, natural motion blur, autofocus breathing, slight rolling-shutter wobble, realistic HDR, and loud crowd audio distortion. 24 fps, 26 mm lens equivalent.
Scene 1 (0–3s) — Stadium Intro
Front-facing selfie outside Emirates Stadiumbefore kickoff. Fans stream behind her.
She says: "Guys, I'm at Arsenal vs Burnley and the atmosphere is absolutely insane!"
Scene 2 (3–6s) — In the Stands
Selfie from her seat with the pitch glowing behind her.
She says: "We've got such a good view. Arsenal look really sharp already!"
Scene 3 (6–9s) — Match Action
Back camera shows Arsenal F.C.attacking Burnley F.C..
She shouts: "Come on Arsenal… shoot! SHOOT!"
Scene 4 (9–12s) — Goal Moment
Back camera captures Arsenal scoring.
She screams: "GOAL! Oh my God, I got it!"
Scene 5 (12–15s) — Celebration Selfie
Front-facing selfie as she jumps and laughs with celebrating fans behind her.
She says: "I actually caught the goal on camera! Best moment ever! Come on Arsenal!"
Final Tone: Raw, emotional, high-energy football vlog with authentic stadium atmosphere and a euphoric goal reaction.
Cinematic continuous side-scrolling sequence, 16:9, 15 seconds. One unbroken shot — the camera tracks smoothly from left to right without cutting, following a single person through their entire life as the environment transforms seamlessly around them. Opens in a hospital delivery room — a newborn baby crying, bright sterile lights, doctors and a mother. The baby crawls right and the world transforms. A colorful chaotic kindergarten — finger paintings on walls, blocks scattered everywhere, the child running and laughing loudly, falling over, getting back up. Keeps moving right — a school playground, kid sprinting full speed chasing friends, shouting, scraped knees, backpack bouncing. Right again — a teenage bedroom, loud music thumping, jumping on the bed, laughing with friends, energy everywhere. Right — a college campus, rushing between classes, coffee in hand, stressed, alive. Right — a first job, city streets, rushing through crowds, briefcase, purpose and chaos. Right — a wedding, spinning a partner, confetti exploding, crowd cheering. Right — a park with young children running around them, chaotic and beautiful. Right — a quieter home, slower pace, reading glasses, warm light. Right — very old now, walking slowly with a cane, white hair, soft smile, grandchildren pulling at their hands. The environment dims gently, pace slows, the figure sits in a chair by a window, golden light, closes their eyes. Sound design matches every stage — hospital beeps, children screaming and laughing, playground chaos, thumping music, crowd cheering, city noise, birds, silence. Photorealistic, IMAX cinematic quality, ultra sharp, vivid colors throughout each life stage, seamless environmental transitions, deeply emotional and cinematic.
Style: Hyper-realistic live TV news broadcast, authentic breaking-news cinematography, handheld broadcast camera, shallow depth of field, realistic emergency lighting, compression artifacts, imperfect live framing, subtle zoom drift, real television color grading.
Duration: 12 seconds
Aspect Ratio: 16:9
IMPORTANT:
The scene must look EXACTLY like a real live news interview broadcast.
Characters:
— emotional elderly woman with messy gray hair and scarf
— holding a fluffy gray British Shorthair cat
— professional field reporter holding microphone
Environment:
— apartment fire aftermath at night
— flashing firetruck lights
— firefighters and smoke in background
— wet streets reflecting emergency lights
Broadcast details:
— LIVE overlay
— BREAKING NEWS lower thirds
— scrolling ticker
— news channel watermark
— timestamp graphics
— realistic TV compression and interlacing
IMPORTANT CAMERA DIRECTION:
The camera begins as a medium interview shot.
Throughout the interview, the camera VERY SLOWLY and SUBTLY zooms closer toward the cat's face without drawing attention to it.
By the end, the cat dominates the frame.
[00:00-00:02]
Live interview begins. The elderly woman looks emotional and shaken while tightly holding the cat.
Reporter asks:
"Ma'am, can you tell us how the fire started?"
Emergency lights flicker realistically across their faces.
[00:02-00:05]
The woman answers emotionally:
"I smelled smoke first… then the kitchen suddenly exploded into flames… it spread so fast…"
The camera slowly inches closer toward the cat while maintaining natural live-broadcast framing.
[00:05-00:07]
Reporter asks:
"And how did you manage to escape?"
The cat looks around calmly with huge eyes while distant sirens echo.
[00:07-00:10]
The woman strokes the cat gently and says:
"My little Murmur woke me up… if not for him, I don't think I'd be standing here…"
The zoom continues subtly. The cat now fills much more of the frame.
[00:10-00:12]
The camera is now very close to the cat's face.
The cat slowly turns toward the lens with intense eye
Audio:
Realistic live-news ambience, distant sirens, radio chatter, crowd noise, crackling fire sounds, emotional interview dialogue, subtle microphone handling noise.
Negative prompts:
No cinematic movie look, no perfect framing, no dramatic soundtrack, no fake acting, no cartoon cat behavior, no static camera, no subtitles, no exaggerated horror effects.
REALISTIC CINEMATIC 15s PROMPT — Japanese Wing Chun Martial Artist
Ultra-realistic cinematic martial arts sequence.
A 35-year-old Japanese martial artist trains intensely with a traditional wooden Wing Chun dummy inside a Japanese outdoor courtyard.
Lean athletic build, sharp focused eyes, defined jawline, short slightly messy black hair, subtle facial stubble, calm disciplined expression, realistic skin texture with sweat and natural imperfections. Wearing a fitted black training shirt and loose dark martial arts pants.
Environment: traditional Japanese courtyard with textured stone pavement, bamboo plants, wooden architecture, paper sliding doors, atmospheric dust particles, soft daylight, cinematic shadows, subtle wind movement, shallow depth of field, grounded realism.
Style: high-end live-action realism inspired by modern Japanese and Hong Kong martial arts cinema. Physically accurate movement, realistic muscle tension, cinematic contrast, subtle film grain, immersive atmosphere, dynamic camera movement, authentic choreography. No anime, no CGI look, no digital painting, no commercial aesthetic.
Sequence progression:
— Hard-cut opening, close 35mm follow shot. The martial artist is already moving rapidly through continuous Wing Chun trapping hands, chain punches, compact strikes, and quick front kicks against the wooden dummy. Fast but controlled footwork across stone pavement. Realistic sweat and fabric movement.
— Match-cut to 50mm lateral tracking shot. Smooth transitions between straight punches, slapping hands, elbow strikes, turning techniques, and compact leg attacks. Strong wooden recoil and realistic impact feedback.
— Close-up orbit shot on focused eyes and breathing. Hands flow continuously between offense and defense. Sunlight catches sweat on the face while strikes create subtle vibration through the wooden dummy.
— Full-body 28mm push-in shot. Efficient footwork, chain punches, elbows, turning movements, coordinated kicks. Bamboo leaves rustle softly while dust reacts naturally to movement.
— Extreme close-up handheld realism. Rapid arm transitions between blocks and strikes. One powerful straight punch lands heavily on the wooden dummy with realistic muscle tension and sleeve movement. Dust shakes loose from impact.
— Smooth mirror reflection transition back to frontal medium shot. Continuous flowing Wing Chun combinations with calm precision. Hair and clothing react naturally during movement.
— Final cinematic shot, slow push-in then static hold. The wooden dummy sways slightly after the final strike. Dust settles through sunlight. Bamboo movement slowly stops. The martial artist raises his eyes toward camera with calm confidence, holding a traditional Wing Chun stance in silence.
Sound design: fabric friction, realistic wooden impacts, air swishes, breathing, stone footwork, subtle ambient wind, bamboo rustling.
15 seconds, cinematic pacing, grounded realism, highly detailed live-action martial arts choreography, immersive camera movement, authentic physical performance.
Create a 15-second ultra-realistic cinematic video of a young professional girl driving home from the office in a modern car during heavy rain. The scene is set at dusk with a moody, beautiful atmosphere. The sky is dramatic with dark storm clouds and occasional flashes of lightning illuminating the clouds.
Start with a wide shot of a rain-soaked highway reflecting neon city lights and headlights on the wet road. Transition to inside the car: the girl is calm and focused, wearing office attire, one hand on the steering wheel, soft ambient light from the dashboard illuminating her face.
Raindrops continuously slide across the windshield as the wipers move rhythmically. Lightning briefly lights up the sky, creating dramatic reflections on the glass. The sound of rain and distant thunder enhances the mood.
Camera slowly shifts between exterior cinematic tracking shots of the car moving through the rainy road and intimate close-ups inside the car. The final shot shows her approaching home in the distance with warm lights visible, contrasting the stormy sky.
Style: ultra-realistic, cinematic lighting, 4K detail, shallow depth of field, smooth camera motion, emotional and peaceful mood.
CRITICAL INSTRUCTION: The reference image contains a 9-step chronological cooking storyboard. Animate the chef seamlessly through these exact 9 steps in order. Start at Step 1 (Flour Well), flow into Step 2 (Crack Eggs), then Step 3 (Mix). Continue the chronological progression through Kneading, Resting, Rolling, Cutting, and Boiling, finishing perfectly on the final plated dish (Step 9). Prioritize the strict sequence of actions.
No music. No subtitle.
Location: Rural Persian kitchen
15 seconds, 16:9, realistic, cinematic, tasty, natural camera movement.
Create a 16:9 ultra-realistic cinematic tennis lifestyle video ad with fast-paced editing and luxury beauty aesthetics.
The video begins with a woman entering the frame in a cream color casual dress, holding a subtle sporty presence. She transitions into a tennis-themed narrative.
Cut to a close-up of hands opening a light pink duffel bag, revealing realistic packaged pastries (Renoise Canva-style croissants in authentic bakery packaging, abundant and neatly arranged) along with a rolled pink fabric.
She then puts on a white visor with a pink ribbon, and her outfit transitions into a pink tank top and leggings tennis look.
On a tennis court, she holds a racket, shifting from relaxed poses to athletic readiness. She performs quick tennis practice movements (2–3 fast strokes) with a green tennis ball, in hurried, energetic motion.
The camera cuts to her looking upward as if tracking a ball, then leaning on the net in a grounded, slightly exhausted but stylish pose.
She quickly rests, opens her duffel bag again, takes out one packaged croissant (Renoise Canva realistic packaging style), eats it joyfully, wipes her lips, showing her acrylic nails clearly, then immediately picks up the tennis ball again to continue playing.
MUSIC: “Gimme More Jersey Club” plays throughout, synced tightly with cuts, transitions, and motion beats.
STYLE: Ultra-realistic, cinematic lighting, handheld sporty camera feel, luxury athletic fashion commercial, fast rhythmic editing, shallow depth of field, natural skin tones, premium ad aesthetics.
Create a hyper-realistic 15-second 4K cinematic football broadcast video using the uploaded reference image as the exact facial identity of the main player. Maintain perfect identity consistency in every frame — identical face shape, hairstyle, eyes, skin texture, beard details, proportions, and expressions from the reference image. Do not stylize or alter the person's appearance.
Scene setup:
Night-time FA Cup final atmosphere inside a massive sold-out stadium under powerful white floodlights. Authentic live sports broadcast presentation inspired by modern ESPN+ football coverage. Intense crowd ambience with chants, drums, whistles, and roaring supporters echoing across the arena.
On-screen scoreboard overlay:
"MNC 1 - 1 CHE | 88:34"
Opening sequence:
Low-angle sideline tracking shot following the reference player sprinting forward with the ball during a tense late-game attack. Manchester City players wear bright sky-blue kits with realistic fabric physics and sweat marks. Chelsea defenders in dark-blue kits attempt to close down space aggressively. Camera movement feels like a real shoulder-mounted broadcast sports camera with subtle shake, autofocus breathing, and realistic motion blur.
Build-up:
Fast tactical passing around the edge of the penalty area. Crowd noise rises dramatically with every touch. Cleats scraping grass, players shouting instructions, and heavy breathing audible in the mix. The reference player receives a sharp through-ball, takes one controlled touch, then cuts inside past a defender.
At exactly 9 seconds:
The player unleashes a powerful curling strike into the top corner of the goal. Net explodes backward realistically. Stadium erupts instantly with deafening cheers. Camera shakes slightly from crowd vibration.
Commentator audio — Professor:
"Unbelievable! Manchester City have produced magic in the dying moments!"
Celebration sequence:
Instant cinematic zoom-in on the reference player sliding across the wet grass in pure emotion. Face glowing under floodlights, sweat and rain droplets visible in extreme detail. Huge emotional smile, shouting triumphantly, eyes filled with adrenaline and disbelief. Teammates sprint toward the player from behind while fans wave scarves wildly in the blurred background.
Visual style:
Ultra-realistic sports cinematography, dramatic depth of field, realistic skin pores and facial textures, vibrant stadium colors, high-contrast floodlighting, authentic TV broadcast overlays, premium football documentary feel, immersive crowd audio, cinematic realism, 24fps, true-to-life motion physics, elite sports production quality.
Cinematic photorealistic sequence set in a hazy, war-torn urban street with damaged concrete buildings and muted overcast lighting. Two young male soldiers wearing tan military uniforms stand facing each other holding realistic dark rifles. In a seamless surreal transformation, one rifle smoothly morphs into a bright colorful plastic water gun with a yellow tank and green barrel.
Close-up shot of the second soldier wiping a drop of water from his face in disbelief before breaking into genuine laughter. The tense atmosphere shifts into playful childlike joy as both soldiers laugh hysterically and spray each other with oversized colorful water guns, water streams arching through the air.
The environment gradually transforms from a grim battlefield into a brighter nostalgic atmosphere while retaining cinematic realism. Muted gray tones at the beginning contrast with the vibrant neon colors of the water guns.
Shot with cinematic 35mm lens aesthetics, shallow depth of field, realistic textures, emotional facial expressions, dynamic camera movement, soft misty atmosphere, high fidelity lighting, ultra-detailed photorealism, 8K cinematic quality, 16:9 ratio.
Mood transition: serious and tense → joyful and playful.
Negative Prompt: gore, blood, violence, explosions, distorted anatomy, cartoon style, low quality, blurry faces, unrealistic expressions.
35mm and anamorphic lenses, high-contrast midday lighting, vibrant cinematic action color grade. Immersive spatial audio with high-fidelity sound design.[IMAGE REFERENCES / LEGEND]@ini
: Exact starting frame and main character reference. Maintain identical facial features, crimped/braided hairstyle, black tank top, jewelry, and "BAKU-CRUNCH" holographic snack bag design across all shots.@story
: the sequence to implement.[TIMELINE SECOND BY SECOND]0-2.5s: [Medium Shot] Exact match to @ini
. 35mm lens, slightly low angle, static camera. Main character holds the BAKU-CRUNCH bag confidently, looking directly into the camera. [SFX: Tokyo street ambience, subtle dramatic drone]2.5-5s: [Extreme Close-Up] 100mm macro lens, front angle, rapid camera push-in to her mouth biting into a textured chip. High-impact crunch animation. [SFX: Crisp, bone-shattering crunch sound effect with stereo echo]5-7.5s: [Profile Long Shot] 28mm lens, profile angle panning right. A powerful sonic shockwave from the crunch shatters nearby glass building facades and sends stylized debris flying through the air. [SFX: Glass exploding, deep structural rumble]7.5-10s: [Worm's-Eye Wide Shot] 18mm lens, low angle tilting up. Street framing warps slightly from the kinetic energy; character looks around with a cool, playful expression of surprise. [SFX: Whoosh, building distortion sounds]10-12.5s: [Close-Up] 50mm lens, slight low angle pushing in. Character looks forward, holds a single chip up, and smiles mischievously. [SFX: Rising electronic bass sweep]12.5-15s: [Wide Shot Climax] 24mm lens, slight low angle with a smooth dolly-back movement. Character walks confidently toward the camera in slow motion as parked cars flip and fire hydrants burst into massive water plumes behind her, finally shows the snack bag. [SFX: cinematic explosion][STYLE & QUALITY BOOSTERS]Photorealistic 8K, ultra-detailed textures, cinematic commercial lighting, perfect fluid motion blur, flawless character consistency, movie-level physics.
Hyper-realistic handheld mobile phone video, slightly shaky documentary style, natural daylight. A regular 30-year-old woman with messy blonde hair in a loose bun, wearing a slightly worn black strappy summer dress, small tattoo on her upper arm, and a thin necklace, is in a dirty, unkempt garage area. Overgrown grass and weeds everywhere, dusty concrete floor, old tires stacked with dirty water collected inside them.
She bends down behind the tires, reaches in, and excitedly pulls out a massive, baby anaconda-sized realistic centipede (dark brown, hundreds of legs, hairy texture, very detailed and lifelike). The giant centipede immediately wraps around her neck and shoulders. She smiles widely, holds it up to the camera and says cheerfully, "Have you seen this? This is the guy I was talking about!"
She gently strokes the centipede and talks to it affectionately, "How have you been, little one?" She then reaches for a half-drunk beer bottle sitting on a rusty table nearby, brings it to the centipede's head and tilts it so the centipede appears to "drink." Laughing softly she says, "You must have been thirsty… how many days since we last met?"
Natural sweat on her skin, imperfect lighting, real dirt and grime, completely unpolished and authentic backyard/garage feel.
Ultra-realistic live Premier League TV broadcast screenshot captured during a packed 2025/2026 Arsenal home match at Emirates Stadium at night. Identity preserved exactly from the reference image. A young woman sits naturally among the crowd wearing the official 2025/2026 Arsenal home jersey with jeans and relaxed match-day styling. The jersey must look like a normal authentic football jersey — not cropped, not tied, not rolled up.
She is caught unexpectedly by the live stadium camera with a surprised emotional reaction, slightly leaning back in her seat, lips slightly parted, eyes focused toward the pitch. The moment should feel accidental and authentic like a real viral football broadcast moment.
The crowd around her reacts intensely:
fans standing up cheering, people screaming in disbelief, hands on heads after a dramatic goal moment, phones recording, scarves waving in the air, emotional reactions everywhere. Keep the crowd realistic and mixed — some Arsenal jerseys, some hoodies, jackets, neutral outfits, casual football fashion.
Visual atmosphere:
bright floodlights, glowing stadium haze, realistic rain mist, LED advertisement boards, subtle TV broadcast compression, natural digital noise, authentic scoreboard graphics in top corner, imperfect live-TV framing, realistic skin texture, cinematic crowd depth.
Camera Style:
long-range telephoto sports broadcast camera from far away in the stadium stands, realistic zoom compression, shallow depth of field focused on the woman while emotional fans remain visible behind her. NOT selfie, NOT portrait orientation, NOT close-up beauty shot.
OUTPUT:
Horizontal 16:9 broadcast frame, ultra-realistic televised football atmosphere, documentary sports realism, authentic Premier League crowd shot, 4K TV broadcast quality.
VIDEO PROMPT:
Hyper-realistic live sports broadcast video during a dramatic Arsenal night match at Emirates Stadium. A young woman wearing the official 2025/2026 Arsenal jersey sits naturally among thousands of fans when the stadium camera suddenly zooms toward her after a shocking moment on the pitch.
Nearby fans explode emotionally:
people jump from seats, scream, wave scarves, celebrate wildly, hold their heads in disbelief, laugh, point toward the camera, and film the moment on phones like a real viral football broadcast clip.
Camera Movement:
realistic live-TV telephoto zoom-in from far away, subtle handheld sports-camera shake, authentic focus breathing, natural motion blur, rapid broadcast reframing during crowd reactions.
Atmosphere:
stadium chants echoing loudly, floodlights cutting through rain mist, glowing scoreboard graphics, cinematic smoke haze, realistic crowd movement, emotional football energy.
Style:
ultra-realistic sports broadcast realism, documentary football cinematography, authentic televised Premier League crowd reaction, cinematic live-TV atmosphere, realistic facial expressions, broadcast imperfections, emotional viral sports moment, 4K horizontal
Use the attached THE PANCAKE DAD storyboard image as the exact visual reference.
Create a 12-second 16:9 animated pancake-making sequence that follows the 8-shot storyboard exactly. Preserve the same Pixar-style dad, casual blue top, messy morning hair, warm bright home kitchen, child’s drawing on the fridge, and golden Saturday morning light throughout.
Rules:
•Follow the sequence exactly from 1 to 8
•One shot per panel, approximately 1.5 seconds each
•No skipped steps, no extra characters beyond the storyboard
•Maintain dad and girl character continuity throughout
•Emphasize the flour cloud opener, the egg yolk drop, the mid-air flip, the syrup cascade, and the girl’s first bite reaction
Shot sequence:
http://1.Dad stands at the kitchen counter, pours large flour bag into the mixing bowl — generous white flour cloud billows up into warm morning sunlight, he laughs and waves it away from his face. Wide shot, Saturday morning energy established instantly.
2.Extreme close-up — dad cracks fresh egg over the bowl already containing flour and milk, golden yolk drops in slow motion into the batter. Shell splits cleanly. The moment it all comes together.
3.Medium shot with dad fully visible — he whisks the batter in smooth circular motions, mixture becoming smooth and pale yellow, tiny bubbles forming. Morning light catching the whisk. Relaxed and unhurried.
4.Close-up side angle — batter poured from bowl onto hot buttered pan, spreads into a perfect circle, edges immediately setting, tiny bubbles forming across the surface. Sizzle on contact.
5.Low angle dramatic — spatula slides under the pancake, dad flips it with one confident motion, pancake suspended perfectly mid-air, golden underside fully revealed, dad’s face lit up with pure joy behind it. The hero frame.
6.Wide medium with dad visible — he slides the finished pancake onto a plate already holding two others, perfect golden stack building, steam rising from each layer. Dad looks genuinely proud.
http://7.Beauty close-up — maple syrup poured from above in a slow golden arc over the stack, cascading down the sides, pooling at the base, fresh blueberries on top and around the plate. Liquid gold catching the morning light.
8.Wide warm shot — small girl in bright pink t-shirt sits at the kitchen table, eyes wide, mouth open, fork mid-bite into the stack, blueberries on the plate, syrup everywhere. Dad stands in the background arms crossed, warm proud smile. Saturday morning complete.
Camera:
•Wide shot for the flour cloud opener — flour billowing into morning light
•Extreme close-up slow motion for the egg crack yolk drop
•Medium shot with dad fully visible for the stir
•Close-up side angle for the batter pour and sizzle
•Low angle dramatic for the flip — pancake perfectly mid-air
•Wide medium for the stack with dad visible and proud
•Beauty close-up for the syrup cascade
•Wide warm shot for the girl eating with dad in background
Style:
•Warm golden Saturday morning light throughout
•Fluffy pancake golds, rich amber maple syrup, fresh blueberry blues, bright pink t-shirt, creamy batter whites
•Pixar CGI vivid expressive animation — warm, bright, joyful energy throughout
•The girl’s reaction in panel 8 must be fully Pixar expressive — eyes wide, mouth open, pure uncontained joy
•Smooth warm cuts matching the relaxed Saturday morning energy
Sound design:
Warm upbeat acoustic morning music — light guitar or piano — plays from the very first frame to the very last. Continuous, uninterrupted, same track throughout all 8 shots without any breaks or silence. Underneath the music, natural kitchen sounds layered in softly —flour pouring and cloud whoosh, dad laughing, egg cracking, whisk on bowl, batter sizzling as it hits the pan, spatula sliding under the pancake, the whoosh of the flip, pancake landing back in pan, stack being built, syrup pouring slowly, fork hitting ceramic plate, girl’s delighted gasp.”
A white semi-truck hauling a large cargo container barrels down a sunlit highway, kicking up dust. The camera tracks it from a low front angle as it veers and collides violently with another vehicle, sending debris flying. Bright daylight, action-movie car chase energy, dramatic crash physics, wide telephoto lens compression.
Ultra-realistic FIFA World Cup Final live broadcast at a massive floodlit stadium at night. Use the uploaded image ONLY as inspiration for stadium atmosphere, crowd density, sports broadcast framing, cinematic lighting, and telephoto camera style.
Visual style: authentic FIFA World Cup television broadcast with live scorebug, running match clock, LIVE indicator, sports ticker, subtle TV grain, slight chromatic aberration, HD compression artifacts, LED stadium reflections, realistic crowd atmosphere. Multiple 400mm–600mm telephoto broadcast cameras with extreme telephoto compression, creamy cinematic bokeh, flattened background depth, razor-thin depth of field, slow telephoto push-ins, and realistic handheld sports-broadcast motion.
0:00–0:04 — The woman sits beside her boyfriend near the front-row pitch barrier, laughing and talking naturally while stadium chants echo around them. Wind softly moves her balayage lob haircut beneath the floodlights. Crowd heavily compressed into creamy telephoto bokeh.
0:04–0:07 — A football suddenly deflects toward the crowd at high speed. Nearby fans react in shock. She instantly notices the incoming ball.
0:07–0:10 — She catches the football smoothly while seated near the barrier. Her boyfriend reacts with disbelief and excitement. Crowd gasps while commentators shout realistically: “WHAT A CATCH!”
0:10–0:15 — She confidently climbs over the advertising barrier and lands elegantly onto the pitch. Broadcast cameras quickly follow her movement like a real live FIFA production. She begins lightly dribbling as crowd intensity rises.
0:15–0:19 — She dribbles forward with realistic professional football control. A defender rushes toward her. She performs a sleek body feint and smoothly dodges him while grass particles rise naturally beneath her cleats.
0:19–0:23 — Dynamic live broadcast cuts as she performs realistic football skills: step-over, nutmeg, heel-tap direction change, quick acceleration burst. Players react with surprise while realism remains grounded.
0:23–0:27 — She enters the penalty area. Goalkeeper charges forward. She fakes a shot, shifts left elegantly, and curls the football beautifully into the top corner. Net ripples realistically as the stadium explodes.
0:27–0:30 — Heroic broadcast close-up. Entire stadium erupts. Her boyfriend celebrates wildly from the stands. She laughs breathlessly near the goal while cameras surround her under dramatic floodlights. Final slow telephoto push-in toward her smiling face.
Audio throughout: realistic football crowd chants, whistles, commentators, player shouts, cleats on grass, football impact sounds, natural British dialogue, crowd explosions after the goal, authentic live sports broadcast energy.
Expression transition from cold to shy smile, slow and natural, 8 seconds. Start: The young woman sits courtside, poker-faced, cold and aloof expression, staring straight ahead, completely unaware of the camera. No smile, very calm. Middle: She slowly turns her head toward the camera. Her eyes shift softly. Her facial expression begins to soften. End: She notices the camera. Her cold expression breaks into a shy, subtle, gentle smile. Lips slightly parted. A natural blush appears on her cheeks. She makes brief eye contact with the camera, then looks down shyly before glancing back up with a soft smile. Camera: Static broadcast camera, slight depth of field, ESPN live TV texture. Style: Ultra-realistic, natural skin texture, authentic human micro-expressions, organic facial muscle movement, no AI smoothing, no morphing, no flickering, no distortion. Live arena lighting, broadcast color grading, subtle compression artifacts. 16:9. Negative prompt: Exaggerated smile, laughing, open mouth wide, crying, anger, robotic movement, CGI, cartoon, overacting, unnatural blinking, face swapping, glitching.
First frame must match the @Image 2. Create a realistic live PSL (Pakistan Super League) broadcast crowd cutaway during an intense Karachi Kings vs Lahore Qalandars playoff match inside a packed cricket stadium in Pakistan.
The subject must look exactly like the @Image 1: same face, hairstyle, black t-shirt, facial features, lighting, seating position, and stadium crowd environment. Preserve identity strongly throughout.
The subject is a female named Fatima.
Karachi Kings are batting. The atmosphere is intense. The video should feel like a real live TV sports broadcast, not cinematic. Use authentic PSL broadcast framing, telephoto stadium camera feel, floodlights, slight TV grain, compression artifacts, energetic Pakistani crowd movement, and realistic live match atmosphere.
The camera should be dynamic with broadcast-style zooms, pans, reframing, quick reactions, and natural shot changes.
Timeline:
0–1 second:
Start with a wide live-broadcast stadium shot showing Karachi Kings batting against Lahore Qalandars. The stadium is packed, floodlights are glowing, fans are cheering, and the match tension is high.
1–3 seconds:
Cut to the pitch as Babar Azam hits a powerful six. Show the ball flying high into the night sky toward the stands. The crowd instantly rises with excitement.
3–5 seconds:
Cut to the crowd section where Fatima (from @Image 1) is seated. The cricket ball comes toward her area. She reacts with shock and excitement, jumps up excitedly, and catches the ball with both hands. Nearby female fans jump, cheer, clap, and point at her. Surrounding crowd in her section is mostly girls celebrating and reacting together.
5–8 seconds:
The broadcast camera quickly zooms in on her. Fatima looks surprised and extremely happy, holding the cricket ball proudly in her hand. She laughs naturally and shows the ball to her female friends beside her. Keep the reaction realistic, not overacted.
8–11 seconds:
Fatima takes out her phone and takes a quick selfie with the cricket ball in her hand. She smiles excitedly while a group of nearby girls lean in and cheer behind her. No boys are present in her immediate selfie circle. The moment should feel like a real spontaneous PSL crowd moment.
11–13 seconds:
Cut to the giant stadium big screen/Jumbotron showing her live with the cricket ball and phone. The crowd gets louder after seeing her on the big screen. Add a slight broadcast-style zoom into the big screen.
13–15 seconds:
End with a wide stadium shot showing the packed crowd, waving flags, bright floodlights, and the electric PSL playoff atmosphere. Fatima is still visible in the crowd area celebrating with the ball among cheering female fans.
Broadcast Graphics:
Add authentic PSL-style broadcast graphics and keep them consistent:
Karachi Kings vs Lahore Qalandars playoff scorebug
PTV Sports / A Sports-style logo watermark
PSL Playoffs branding top left
Bottom score banner in live broadcast style
Karachi Kings batting context
No unrealistic animated graphics
Add lower-third early when the camera focuses on her: "Fatima - Gen AI Artist" (appears like a real broadcast identifier).
Audio / Commentary:
Add loud realistic PSL crowd ambience throughout.
Use two Pakistani male commentators speaking in a live sports broadcast tone.
Commentary lines:
"What a six from Babar Azam!"
"Fatima has caught the ball!"
"What a moment in the stands tonight!"
"She's taking a selfie with the match ball — the crowd is loving it!"
Important Constraints:
Preserve Fatima's identity strongly from @Image 1
Keep same face, hairstyle, black t-shirt, lighting, and crowd environment
Make the ball catch realistic, not overdramatic
No cinematic movie look — must feel like real PSL broadcast
No overacting or influencer-style posing
End with a wide stadium shot
Surrounding selfie group must be only girls
A cinematic 15-second video of a peaceful morning routine. Scene opens with soft golden sunlight entering a cozy, beautifully decorated bedroom through sheer curtains, gentle breeze moving them. A young person slowly wakes up, stretching comfortably in bed with a calm expression. Cut to a window view of a serene landscape with fresh morning air, trees gently swaying, birds flying. The subject walks to the window, enjoying the peaceful view. Smooth transition to getting ready — washing face, light grooming, natural morning routine. Final scene shows them sitting by the window holding a warm cup of tea, steam rising softly, relaxing in the calm atmosphere. Warm tones, soft lighting, dreamy aesthetic, natural sounds, cinematic mood, ultra-realistic, high detail.
Ultra realistic 10 second cinematic video in the style of a modern parkour advertisement with blockbuster movie quality. A young man with a face similar to the reference photo, slightly long wavy black hair, thin mustache and goatee, wearing simple black and white clothing: an oversized black jacket, plain white t shirt, loose black pants, and black and white sneakers.
The scene begins on a Japanese residential street during daytime with a bright sky, electric cables, small houses, and a realistic urban atmosphere. A low angle camera close to the asphalt focuses on a Le Minerale bottle in the middle of the road.
The young man walks casually toward the camera with a calm and confident expression. After that, he starts running quickly toward the Le Minerale bottle. With realistic parkour movement, the young man jumps and balances standing on top of the Le Minerale bottle cap for a moment. The camera moves dramatically with cinematic handheld movement and natural motion blur.
The young man then performs a backflip in the air with realistic rotation and a short slow motion effect. After that, he lands on the street with a hero landing and small realistic dust effects. The young man immediately grabs the Le Minerale bottle in a cool stylish way while giving a slight smile to the camera.
At the end of the scene, the young man walks away while drinking Le Minerale with a relaxed and cinematic atmosphere. The camera follows from behind until fade out.
Ultra realistic style, realistic human motion, cinematic lighting, dynamic camera movement, highly detailed skin texture, realistic clothing physics, natural sunlight, dramatic perspective, smooth parkour animation, realistic shadows, depth of field, anamorphic lens look, commercial advertisement aesthetic, 4K, highly detailed, realistic environment, smooth motion, cinematic color grading.
Use the uploaded reference image as the primary identity anchor. Preserve the exact adult Japanese woman's face, pale smooth skin, large expressive eyes, glossy lips, delicate nose, and long curly blonde hair with soft bangs.
Ultra-realistic AI-generated KBO baseball night-game broadcast scene. She sits beside her boyfriend in the crowd wearing casual baseball jerseys among cheering fans. Natural candid behavior only — smiling, laughing softly, watching the game, holding yellow cheering sticks and lightly tapping them together.
The boyfriend briefly leans in and they share a subtle 1-second kiss. Immediately after, she notices the stadium camera focused on them on the giant screen. She becomes shy and slightly embarrassed, then gives a soft cute smile toward the camera while trying not to laugh.
Style: authentic Korean baseball TV broadcast, telephoto sports lens, shallow depth of field, slight handheld motion, realistic stadium lighting, subtle motion blur, natural skin texture, imperfect framing, live sports atmosphere, 16:9 composition, genuine candid crowd energy.
Create an ultra-realistic full-body studio fashion portrait of me. Preserve my exact facial identity, skin tone, facial proportions, hairstyle, body proportions, and recognizable beauty . Show me wearing a glossy red patent leather trench coat with a belted waist and matching red pointed heels. I am standing in a clean white seamless studio, shown in full body, leaning elegantly against a giant glossy red high-heel shoe prop that is the same height as me. The shoe should be tall and dramatic but still proportionate, with a sleek pointed toe, ultra-glossy patent finish, and a stiletto heel ending in a devil-trident shape. Pose me slightly behind and beside the shoe, with one hand resting near the top edge and the other hand touching the side, facing the camera with a calm, confident editorial expression. Style my hair in a soft elegant updo with loose face-framing strands, and give me soft glam makeup with glowing skin, defined lashes, and nude glossy lips. Use soft diffused studio lighting, subtle floor shadows, crisp reflections on the patent surfaces, and a polished luxury fashion campaign look. Canon EOS R5, 85mm lens, f/1.8, ISO 100, 1/250s, 8K HDR RAW, ultra-realistic skin texture, sharp focus, realistic patent leather shine, premium editorial photography.
Negative Prompt: blurry, wrong identity, cropped body, half body, oversized shoe, tiny person, distorted face, bad anatomy, extra fingers, deformed hands, warped shoe shape, low quality, cartoon, plastic skin, text, logo, watermark.
Create an ultra-realistic full-body studio fashion portrait of me. Preserve my exact facial identity, skin tone, facial proportions, hairstyle, body proportions, and recognizable beauty. Show me standing confidently in front of a giant glossy red high-heel shoe prop that is the same height as me, with the shoe positioned behind me and slightly to the side. The shoe should have a sleek pointed upper, ultra-glossy patent leather finish, tall thin heel, and a devil-trident-shaped heel base.
Dress me in a fitted deep red couture gown with a structured corset bodice, wide shoulder straps, sculpted bust detail, defined waist, vertical seam lines, and a long floor-length mermaid silhouette. Pose me full-body facing the camera with a calm powerful editorial expression, arms relaxed naturally at my sides, standing tall and elegant while the giant red heel frames me dramatically from behind. Style my hair in a soft messy elegant updo with loose face-framing strands, with soft glam makeup, glowing skin, defined lashes, sculpted cheeks, and nude glossy lips.
Use a clean white seamless studio background, soft diffused lighting, realistic floor shadows, glossy reflections on the red shoe, detailed fabric texture, and a polished luxury fashion campaign look. Canon EOS R5, 85mm lens, f/1.8, ISO 100, 1/250s, 8K HDR RAW, ultra-realistic skin texture, sharp focus, realistic red fabric, premium editorial photography.
Negative Prompt: blurry, wrong identity, cropped body, half body, tiny person, shoe much taller than subject, wrong scale, distorted face, bad anatomy, extra fingers, deformed hands, warped shoe, stiff pose, cartoon, plastic skin, text, logo, watermark.
A hyper-athletic urban explorer with shaved sides, sweat-soaked hair, and lightweight tactical running gear covered in dust and grease stains.
Gigantic abandoned underground transit tunnels beneath a megacity. Endless maintenance corridors, flooded rail systems, giant ventilation fans, flickering emergency lights, electrical arcs, and collapsing concrete passageways.
Dark pulsating techno with aggressive low-end rhythm. Echoing footsteps, dripping water, electrical crackling, metallic reverberation, and heavy breathing dominate the audio.
Ultra-realistic chase cinematography mixed with dystopian thriller energy and underground documentary realism.
SHOT 1:
Ground-level tracking shot sprinting inches above flooded rails while tunnel lights streak overhead.
SHOT 2:
Whip pan into a dangerous slide beneath a closing blast door.
SHOT 3:
Wide shot revealing gigantic tunnel scale while maintenance trains race past dangerously close.
SHOT 4:
Long-lens compressed shot capturing a massive leap across an electrified rail gap.
SHOT 5:
Rotating handheld tracking shot around unstable scaffolding traversal above deep tunnel shafts.
SHOT 6:
Final impossible sprint through collapsing maintenance corridors as sparks rain from the ceiling. The runner bursts into daylight covered in dust and steam while distant sirens echo behind them. Music cuts instantly.
Create a cinematic text-to-video scene featuring an original non-copyrighted moment where a librarian in a small town on a contested border region has, for thirty years, maintained the only library that serves both communities quietly, without political statement, by simply never closing to anyone and on the day a new checkpoint goes up on the road outside, she opens the library two hours early and props the door wide. The mood is quietly radical, stubbornly ordinary, deeply principled, and human in the most specific possible way, with an observational drama feeling. She arrives before dawn. The checkpoint construction began last night concrete barriers, temporary fencing, official vehicles. She can see it from the library steps. She unlocks the door, goes inside, turns on the lights, makes tea in the back room, and props the front door open with the same brick she has used for thirty years.
The first patron arrives a farmer from the eastern community who has been coming every Thursday for twenty years. He nods. She hands him the book she has been holding for him. The second patron arrives from the western side, a teenage girl with a school bag. She comes in without looking at the checkpoint. She is here for the same reasons she is always here.
Visual tone: hyper-realistic observational quality, small-town library interior worn but loved, every surface evidence of use and care, morning light arriving through windows as the day opens, premium detail in books, in the librarian's hands, in the faces of patrons for whom this is simply routine, the checkpoint visible through the window as background fact never made foreground drama.
Camera language: pre-dawn library exterior, door unlocking close-up, interior lights coming on wide shot, tea-making in back room, door propping with brick close-up, standing on steps looking at checkpoint her face making no speech, first patron arrival wide shot, book handoff close-up, teenage girl arrival, library interior with both patrons and librarian ordinary morning, checkpoint through window soft-focus background, wide shot of open door with light coming in.
Include: morning quiet, the specific sound of a library opening lights clicking on, a kettle, a door propped open, pages, the absence of any sound that announces what this woman is doing and why it matters.
Use the attached photo as the exact facial reference. Create an ultra realistic 16:9 Argentina football broadcast crowd shot during a summer evening match at Camp Nou. Preserve the original face and hairstyle naturally without over-editing. Wide shot from an eye-level spectator perspective using a super telephoto broadcast zoom lens and then zoom in on her face . The person is sitting comfortably, naturally watching the match.
生成一个讲述中国体操运动员在巴黎奥运会平衡木决赛中完成超高难度技巧动作的故事。精准捕捉运动员在平衡木上进行极其复杂的、符合物理动态的空翻序列。从首帧起势后,使用 Body Mount 镜头快速跟随运动员在平衡木上完成侧空翻接团身前空翻三周的高难度连续动作,随后用超慢动作特写捕捉她在空中身体的极致控制、白发摆动和队服细节,确保重力与物理规律真实表现。最后稳稳落地,镜头推向她自信的脸。电影级写实,2.35:1 宽银幕。配上巴黎奥运会现场直播的欢呼与激动人心的氛围音效。
Use the uploaded reference image as the primary identity anchor. Preserve the exact adult Japanese woman's face, pale smooth skin, large expressive eyes, glossy lips, delicate nose, and long curly blonde hair with soft bangs.
Ultra-realistic AI-generated KBO baseball night-game broadcast scene. She sits beside her boyfriend in the crowd wearing casual baseball jerseys among cheering fans. Natural candid behavior only — smiling, laughing softly, watching the game, holding yellow cheering sticks and lightly tapping them together.
The boyfriend briefly leans in and they share a subtle 1-second kiss. Immediately after, she notices the stadium camera focused on them on the giant screen. She becomes shy and slightly embarrassed, then gives a soft cute smile toward the camera while trying not to laugh.
Style: authentic Korean baseball TV broadcast, telephoto sports lens, shallow depth of field, slight handheld motion, realistic stadium lighting, subtle motion blur, natural skin texture, imperfect framing, live sports atmosphere, 16:9 composition, genuine candid crowd energy.
Ultra-realistic live basketball broadcast still of a glamorous woman sitting courtside in a packed indoor arena during a night playoff game, wearing an elegant deep emerald off-shoulder silk dress and minimal silver hoop earrings, shoulder-length honey blonde hair styled in soft layered waves. She is casually eating loaded nachos with one hand while holding a clear plastic cup of sparkling soda in the other. Around her are passionate fans wearing bright red and white basketball jerseys, hoodies, and foam fingers, creating strong team-color contrast throughout the crowd. The scene feels candid and cinematic, captured mid-game from a professional TV broadcast camera angle with shallow depth of field. Include realistic arena seating, crowded audience atmosphere, LED ribbon boards, energetic spectators reacting to the game, broadcast overlay graphics in the top-left corner showing a live basketball score, quarter, and game timer, and a sports network watermark in the top-right. Natural indoor arena lighting, detailed skin texture, realistic reflections on fabric and drink cup, sharp focus on the woman, slightly blurred background crowd, authentic live sports broadcast aesthetic, ultra-detailed realism, 16:9 composition.
She begins standing still, facing camera with a calm confident expression. Slowly she raises one hand to lift her hair off her neck, then turns away from the camera revealing "RAPHINHA 11" printed in gold on the back of her Barcelona jersey. She gazes out over the stadium pitch. Her hair lifts and settles naturally in a light wind. The giant Barcelona tifo mosaic on the pitch fills the background. Camera holds a steady handheld medium shot with subtle organic sway. Overcast daylight, cool diffused light, cinematic 4K, realistic hair movement, authentic pre-match stadium energy.
GPT Image 2 prompt: Photorealistic photograph of a young woman with fair skin, medium-length straight dark brown hair, and natural minimal makeup standing in the lower stands of a large football stadium. She wears an FC Barcelona home jersey — navy blue and deep burgundy vertical split, gold Spotify logo on chest, FC Barcelona crest. Dark trousers. She faces the camera directly with a calm, confident, composed expression. Overcast daylight, soft diffused natural light on her face. Behind her: open-air stadium with a giant Barcelona crest tifo mosaic covering the pitch, partially filled stands visible in the distance. Handheld editorial photography style, shallow depth of field, subject sharp, stadium background softly blurred. 4K, cinematic color grade, cool neutral tones, no filters, hyper-realistic skin texture, natural hair detail, authentic stadium atmosphere.
Ultra-realistic live broadcast shot of a young Asian woman sitting in the crowd at a professional baseball game, captured from far away by a stadium TV camera. She is seated among blue stadium seats, casually leaning back and looking to the side with a surprised "caught on camera" expression, lips slightly parted, natural candid moment. Soft stadium lighting, shallow zoom lens compression, authentic sports broadcast aesthetic, slightly grainy televised look, blurred people in the background, cinematic realism, spontaneous fan-cam energy, detailed skin texture, natural makeup, long black hair, stylish casual outfit, high realism, telephoto lens, ESPN-style broadcast frame, candid atmosphere.
A cinematic martial arts duel set in a traditional ancient Chinese courtyard paved with large weathered stone tiles, framed symmetrically by old wooden temple buildings, carved balconies, hanging red vertical banners with black Chinese calligraphy, and soft greenery along the sides. Two highly detailed martial artists face each other in the center in classic kung fu combat stances, captured mid-confrontation with intense focus and restrained tension.
On the left stands an older rugged martial artist wearing layered brown leather and fabric warrior robes with worn textures, stitched seams, dark belts, armored wrist guards, rugged boots, and flowing lower garments. His stance is low and grounded with one fist extended forward and the other hand pulled back defensively. His facial expression is sharp and concentrated, with realistic skin texture, subtle wrinkles, and cinematic lighting shaping the face.
On the right stands a disciplined martial arts master wearing an elegant dark navy-blue traditional Chinese robe with subtle embroidered patterns, red trim accents, long flowing fabric, and clean structured tailoring. His stance is balanced and defensive, one hand open in a Wing Chun-style guard while the other hand forms a fist. Calm but intense facial expression, highly detailed fabric folds, realistic posture, and natural movement in the robe.
Between them in the background is a wooden ceremonial table draped with vivid red cloth, positioned centrally in front of a misty temple entrance that creates strong depth and symmetry. Soft atmospheric haze fills the center background. Natural daylight with cinematic contrast, soft shadows, muted earthy tones, realistic stone textures, subtle depth of field, ultra-detailed realism, authentic kung fu movie aesthetic, balanced composition, dramatic tension, photorealistic cinematic still frame, high-end film production quality, 4K, ultra-sharp details, aspect ratio 16:9.
A photorealistic, cinematic live television broadcast video set in a crowded indoor sports arena. The video starts with a shot of a massive, glowing jumbotron suspended from the ceiling, showing an Asian man with glasses and a blue varsity jacket taking a bite of a hamburger. The camera immediately cuts to the actual man sitting in the stadium seats among a cheering, slightly blurred crowd. As he chews, he suddenly realizes he is on the big screen. He freezes, looks surprised with his cheeks full of food, then breaks into an embarrassed, good-natured smile and waves awkwardly at the camera. The lighting is bright, even stadium floodlights. High-quality sports broadcast aesthetic, complete with realistic digital overlays, a 'LIVE' badge, and Korean broadcast graphics, 8k resolution, captured on professional broadcast cameras.
[0:00–0:02] THE CROWD REACTION SHOT: Visual: Medium-close live broadcast shot, 9:16 vertical ratio, cold arena floodlights reflecting off rink glass. Subject: 24yo Indonesian-Korean woman, long straight black hair, porcelain skin, hourglass silhouette. Outfit: White fitted crop top, cropped metallic silver bomber jacket, black leather mini skirt, sheer white mesh sleeve on left arm, knee-high white boots. Action: Laughing and clapping naturally while watching the game, briefly waving toward the ice. Camera: Static broadcast camera from rink-side seating angle. Audio: Arena crowd roar, skate sounds, commentator saying, "What an incredible atmosphere tonight." Environment: "Arif N" physically engraved into the metallic armrest of the rink-side seat. The text must exist physically in the environment, not as a digital overlay.
[0:02–0:05] THE RINK ACCESS BREACH: Visual: Dynamic rink-side tracking shot. Subject: Same woman now wearing a heavy white faux-fur jacket over her outfit. Action: Walks down the rink-side aisle, removes the faux-fur jacket and tosses it over the rink boards, then quickly climbs over the barrier and steps onto the ice surface. Camera: Smooth cinematic tracking movement with realistic live-broadcast framing. Audio: Jacket impact sound, shocked crowd reaction, excited commentary.
[0:05–0:09] THE HIGH-SPEED ICE SPRINT: Visual: Wide low-angle lateral tracking shot across the rink. Action: Subject sprints rapidly over the ice in white figure-style heeled boots, hair flowing dramatically, mesh sleeve whipping in motion. Focused intense expression. Ice spray kicks behind every stride. Camera: Steadicam movement with subtle realistic broadcast shake and micro-vibrations. Audio: Loud skate scraping, swelling crowd roar, commentator shouting, "She's on the ice!"
[0:09–0:13] THE PRECISION SLAPSHOT: Visual: Mid-wide cinematic shot facing the hockey goal. Action: Subject plants left foot firmly on the ice, draws hockey stick back aggressively, and delivers an explosive slapshot. The puck rockets into the top corner of the net past the goalie. Environment: "Arif N" physically engraved onto the hockey stick shaft and embossed onto the puck surface. The text must appear naturally integrated into real-world objects, not as digital graphics. Camera: Fast whip-pan following puck momentum into the net. Audio: Sharp crack of stick impact, goal horn blasting, commentators screaming "SCORES!", massive crowd eruption.
[0:13–0:15] THE VICTORY CELEBRATION: Visual: Hard cut to medium close-up with dramatic bokeh arena background. Action: Subject spins toward camera with a joyful victorious smile, raises hockey stick briefly, then pushes open palm toward the lens in a playful blocking motion. Camera: Static close-up. Autofocus struggles against the approaching hand, creating heavy foreground blur and authentic camera breathing. Audio: Deafening arena roar, goal horn echo, commentators yelling excitedly.
TECHNICAL SPECS: 9:16 vertical format, ultra photorealistic detail, authentic NHL broadcast realism, cinematic sports lighting, subtle film grain, realistic motion blur, shallow depth of field, natural arena reflections, dynamic crowd atmosphere, ESPN-style hockey broadcast presentation, incredibly detailed textures, realistic ice reflections and skate marks. The text "Arif N" must always appear as a physical engraved, stitched, embossed, or branded part of real-world objects and environments — never as a digital overlay.
A young Korean woman sits naturally in the audience during a live baseball game. She appears calm and focused on the match. Fans around her wave cheering sticks and react to the game in a lively, authentic stadium environment.
Real KBO telephoto broadcast camera style, shallow depth of field, subtle handheld micro shake, natural live TV framing, and authentic Korean stadium lighting with real broadcast overlays (scoreboard, team logos, sponsor graphics, compression artifacts).
After a few seconds, she realizes the fan cam is focused on her. She slowly turns toward the camera with a shy, slightly surprised smile, gently tucks her hair behind her ear, then looks down briefly in an embarrassed way. Nearby fans notice and react naturally, smiling and laughing softly.
Authentic stadium ambience with loud crowd noise, thunder sticks, announcer echo, and commentator presence. Korean commentator softly says: "와... 정말 아름다우시네요..."
No cinematic look, no posing, no beauty filters, no dramatic lighting, no slow motion, no cuts, pure candid, unscripted live sports broadcast realism.
Use the uploaded reference image as the strongest identity anchor. The woman must look like the exact same adult Japanese woman from the reference image, not a similar person. Preserve her exact facial identity, same soft oval face, same glossy lips, same delicate nose, same large expressive eyes, same pale smooth skin, and same long voluminous curly blonde hair with soft layered bangs.
Create an ultra-realistic candid KBO baseball broadcast video scene during a lively night game. The woman is seated in the crowd beside her boyfriend, both wearing casual baseball jerseys among energetic cheering fans. She naturally watches the game while smiling and laughing softly with him. No dialogue, no talking, no cinematic acting. The interaction should feel completely natural and unscripted, like a real live broadcast moment accidentally captured by the stadium camera.
She holds yellow cheering sticks and lightly taps them together while cheering for the team. The boyfriend leans closer and they share a very short soft kiss lasting about one second, subtle and natural, not dramatic or romanticized. Immediately after the kiss, she realizes the live stadium camera is focused on them on the giant screen.
She becomes shy and slightly embarrassed for a moment, then gives a soft cute smile toward the camera while trying not to laugh. Her reaction should feel authentic, candid, and spontaneous, like a real fan unexpectedly shown on TV.
Use realistic Korean baseball TV broadcast cinematography: long telephoto lens compression, slight handheld camera movement, subtle motion blur from cheering fans, realistic stadium lighting, shallow depth of field, natural skin texture, broadcast softness, authentic crowd reactions, imperfect framing, 16:9 live sports broadcast composition, genuine candid atmosphere.
Ultra-realistic cinematic daytime wakesurf video of a confident blonde girl riding a wakesurf board on ocean waves behind a moving boat. She is wearing an oversized white t-shirt, denim shorts, sunglasses, and minimal gold jewelry. In one hand she casually holds an open silver laptop, and on her arm hangs a luxury beige handbag while balancing effortlessly on the board. Bright sunny weather, sparkling blue water, realistic splashes hitting the camera lens, wind blowing through her hair and shirt naturally. She occasionally sips from an iced drink while smiling confidently. Smooth handheld tracking shots from boat level, low-angle cinematic shots of the surfboard cutting through waves, natural body balance and realistic water physics. Luxury lifestyle aesthetic, candid social-media-reel vibe, ultra photorealistic, high detail skin texture, realistic reflections on water, soft golden sunlight, energetic but elegant mood, shallow depth of field, 4K cinematic quality, natural motion blur, no text, no UI overlays, no bridge or background distractions.
Ultra-realistic sports broadcast still of a glamorous woman sitting in a packed football stadium crowd during a night match, wearing a dark brown sleeveless high-neck satin top and black square earrings, shoulder-length light brown/blonde hair styled in soft waves. She is casually drinking from a tall blue aluminum can while holding a half-eaten cheeseburger in the other hand. Around her are fans in bright yellow and blue football jerseys and scarves, creating strong team-color contrast. The scene feels candid and cinematic, captured mid-game from a TV broadcast camera angle with shallow depth of field. Include realistic stadium seating, crowded audience atmosphere, broadcast overlay graphics in the top-left corner showing a live football score and match timer, and a sports network watermark in the top-right. Natural arena lighting, detailed skin texture, sharp focus on the woman, slightly blurred background crowd, authentic live sports broadcast aesthetic, 16:9 composition.
Ultra realistic KBO live broadcast crowd cam video from the above of the SAME BOY from the reference image. Do NOT change his face, bone structure, eyes, lips, eyebrows, or https://t.co/isCl0jzXxT Al beauty filter, no influencer vibe, no glossy skin, no fashion-shoot feeling. Style:Looks exactly like a real SPOTV KBO live TV broadcast accidentally capturing a pretty normal spectator in the crowd.Natural Korean baseball stadium atmosphere at night.Slightly compressed TV broadcast quality, realistic digital noise, subtle motion blur, imperfect focus breathing, handheld broadcast zoom behavior.Scene:he is sitting casually in the stadium seats watching the baseball game.Legs crossed comfortably, occasionally adjusting posture naturally. Other spectators around her drinking beer, chatting, cheering, holding cheering sticks and mini portable fans. Plastic beer cups, towels, jerseys, stadium lights visible.Natural crowd movement in background.Face & movement:He should NOT stare
stare at one spot. He naturally looks around the stadium:* briefly watches the game* glances left and right* looks at scoreboard* small eye movements* occasional blink* slight awkward reaction when realizing camera is on him* subtle half smile then looks away* fixes hair naturally with one hand* realistic breathing and micro expressionsImportant:NO exaggerated https://t.co/8g3na5arIt Tik Tok https://t.co/BSxOmS7WYu model https://t.co/d9A9QlwAUc perfect https://t.co/i6vblDBgib smooth doll skin.Keep realistic pores, baby hairs, tiny skin texture, slight sweat shine from stadium weatherCamera: Broadcast zoom lens from distance. Very slight shaky sports-camera movement. Momentary autofocus https://t.co/8wORcxcbpd TV feeling. Natural depth compression from telephoto lens. Lighting: Real stadium lighting only. Uneven shadows allowed. No cinematic lighting.The video needs to be atleast 10 seconds long in which the boy is doing random stuff as mentioned above.
Photorealistic MLB baseball stadium broadcast footage. A young Southeast Asian woman with warm medium skin, long straight dark hair falling loose past her shoulders, gold hoop earrings, and a delicate pendant necklace sits in stadium stands. She wears a black spaghetti-strap top with a dark oversized cardigan slipping off one shoulder. Blue stadium seats fill the background. Broadcast telephoto framing, shallow depth of field, indoor stadium floodlights, ESPN scoreboard HUD in bottom frame.
Motion sequence: She sits relaxed in her seat, gaze drifting toward the field with a slightly parted mouth and curious eyes — processing something happening in the game. Her expression softens gradually into a quiet, knowing half-smile. Her hair catches slight air movement. The camera holds a slow, barely-perceptible push-in on her face, emphasizing natural skin texture, dark eye reflections, and understated charisma. Background fans sit still in soft bokeh.
Camera: Static-to-slow-push telephoto, subtle broadcast compression, minimal handheld drift. ESPN MLB broadcast overlay aesthetic — bottom ticker bar, bottom-right scoreboard bug.
Style: Cinematic 4K, cool indoor stadium lighting with warm face fill, natural motion blur on hair, ultra-detailed facial expression, photorealistic skin texture, authentic crowd-cam energy.
Ultra-realistic sports broadcast still of a glamorous woman sitting in a packed football stadium crowd during a night match, wearing a dark brown sleeveless high-neck satin top and black square earrings, shoulder-length light brown/blonde hair styled in soft waves. She is casually drinking from a tall blue aluminum can while holding a half-eaten cheeseburger in the other hand. Around her are fans in bright yellow and blue football jerseys and scarves, creating strong team-color contrast. The scene feels candid and cinematic, captured mid-game from a TV broadcast camera angle with shallow depth of field. Include realistic stadium seating, crowded audience atmosphere, broadcast overlay graphics in the top-left corner showing a live football score and match timer, and a sports network watermark in the top-right. Natural arena lighting, detailed skin texture, sharp focus on the woman, slightly blurred background crowd, authentic live sports broadcast aesthetic, 16:9 composition.
Cinematic, high-fidelity shot of a beautiful young woman with long black hair sitting in a crowded baseball stadium. She is wearing a white off-the-shoulder 'Bears' crop top and holding an iced coffee. Beside her, a man with a red headband looks at her with concern, asking 'Are you okay?' The atmosphere is bright and realistic, mimicking a live TV sports broadcast with a scoreboard in the top left corner."
The Transformation (Motion/FX)
"Suddenly, the woman's body glitches and contorts. Her head snaps back and her face undergoes a horrific transformation into a zombie. Her skin becomes pale and veiny, her eyes turn a glowing demonic red, and her jaw distends unnaturally. The scene shifts from a daytime stadium to a dark, chaotic night game. She leans over and bites the man's neck, blood spraying. The final shot is a terrifying close-up of the zombie woman screaming into the camera with a wide, rotting mouth and sharp teeth. High-intensity horror aesthetic, jump-scare pacing, and hyper-realistic gore.
Presented in the style of raw, handheld iPhone video footage, with all camera settings set to automatic, no post-processing color grading or special effects. The 画面 features slight hand-held shake and the operator's breathing sensation, with autofocus occasionally showing search, delay, and brief loss of focus. Auto white balance naturally switches between warm and cool tones according to the mixed natural light from classroom windows and fluorescent lights. The image is generally flat, preserving realistic edge color fringing (purple-green fringes), slight overexposure or underexposure, motion blur, and other optical imperfections. Only in-scene natural ambient sounds are used, no background music, and the microphone may show slight distortion during loud sounds. Adopting a first-person POV medium close-up shot from a student sitting behind a desk in the front row, secretly looking up. The camera movement is natural reactive rather than professionally smooth, occasionally showing small 幅度 quick adjustments or brief dips due to nervousness. Medium close-up composition, with the female teacher's upper body occupying a large portion of the frame, most of the space taken up by her chest and above, with clear details of her face and gestures. A female teacher in her late 20s to early 30s of Asian descent.
hf_2026
hf_2
Standing at the podium, she teaches a psychology course. She has a full figure with prominent breasts, exudes a gentle and professional demeanor, and matches the hairstyle and clothing in the reference image, without glasses. Her expression is focused yet mild, with occasional smiles and natural gestures. Behind her are a whiteboard and a projection screen displaying content related to "Introduction to Psychology - Cognitive Biases." The classroom is an ordinary university lecture hall. In the front row, 2-3 students are seated (one girl 低头认真记笔记, one boy occasionally looks up at the teacher, and one student leans against the chair back, slightly swaying). Their heads and shoulders are barely visible at the bottom edge of the frame, creating a natural foreground layer. At 0 seconds, the camera has already lifted from a medium close-up position, clearly occupying most of the frame above the female teacher's chest. Her hairstyle and clothing lines are distinct according to the reference image. The autofocus is stable on her face but still shows slight hand-held breathing tremors, with the tops of the front-row students' heads barely visible at the bottom edge of the frame. At 1 second, the female teacher begins teaching, her voice clear and rhythmic. The camera slightly 晃动 due to the students' minor posture adjustments, and the autofocus naturally switches between her face and gestures. At 2 seconds, the female teacher gestures to explain "confirmation bias." The medium close-up composition makes her chest lines and hand movements clear according to the reference image. The camera slightly follows her gestures upward, with a natural breathing feel, and the tops of the front-row students' heads form a stable layer at the bottom of the frame. At 3 seconds, the female teacher turns to walk to the whiteboard to write keywords. As she walks, her chest visibly bounces with her steps. Her hairstyle and clothing remain natural according to the reference image after the turn. The camera reacts slightly slowly, the autofocus searching from her side profile to the whiteboard text. Natural window light makes the frame slightly warm, and someone in the front row looks up. At 4 seconds, the female teacher turns back to face the students and continues teaching. Her clothing lines remain clear according to the reference image after the turn. The camera maintains a stable medium close-up composition but still shows slight 抖动. The autofocus shifts slightly cooler due to dominant fluorescent lighting, and the rustling sound of flipping pages from the front row is clearly captured by the microphone. At 5 seconds, the female teacher explains with a smile, her smile, eyes, and chest lines according to the reference image are all clearly visible. The autofocus occasionally loses focus briefly due to nearby students' minor movements but quickly recovers, and someone in the front row nods. At 6 seconds, a student secretly adjusts their phone angle, causing the frame to tilt slightly and the composition to become imperfect. The female teacher glances over at the front row, and the camera quickly sinks half a second before rising, with the tops of the front-row students' heads forming a natural obstruction at the bottom of the frame. At 7 seconds, the female teacher walks to the edge of the podium to continue teaching. Her chest bounces noticeably and continuously as she walks. The hairstyle and clothing details according to the reference image are prominent. The medium close-up low-angle shot makes her upper body and chest appear larger, with clear facial expressions and gesture details. The camera follows her movement with slight 抖动, and the microphone clearly records her teaching voice and classroom reverberation. At 8 seconds, the female teacher flips through her notes. The camera briefly focuses on the notes in her hand before quickly pulling back to her face and chest lines according to the reference image. Someone in the front row coughs softly. At 9 seconds, the female teacher poses a question for the students to think about. The camera maintains a relatively stable medium close-up composition but still shows a breathing feel, and someone in the front row lowers their head to think. At 10 seconds, the female teacher smiles while waiting for an answer. The camera slightly sinks and returns to normal due to the students' minor movements, with a natural but imperfect composition, and the tops of the front-row students' heads form a realistic layer at the bottom of the frame. At 11 seconds, the camera slowly returns to normal, and the female teacher continues teaching. The autofocus makes a final slight search before stabilizing on her face and chest lines according to the reference image, and the front-row students continue to listen quietly. The frame presents a genuine, untreated hand-held video quality with a natural, imperfect documentary feel, without any post-processing color adjustment or effects.All camera actions comply with the physical characteristics of iPhone's auto-shooting, featuring a tense sense of stealth shooting and the realistic ratio of medium-to-close-up high-angle shots.
ACT AS: A world-class Hollywood action director, rooftop combat choreographer, and elite AI filmmaker specializing in ultra-realistic rooftop fights, tactical chase scenes, grounded IMAX cinematography, and practical Hollywood stunt realism for Seedance 2.0.
TITLE / VIRAL HOOK: "One wrong jump."
FORMAT: 15-second ultra-cinematic rooftop combat sequence, Designed for Seedance 2.0, Grounded real-life realism, Bright Los Angeles daylight, Fast-paced Hollywood action, 4K IMAX cinematic quality
CORE CONCEPT: A young woman is hunted across the rooftops of downtown Los Angeles by heavily armed tactical agents, police helicopters, and rooftop sniper teams. Instead of only running, she fights aggressively while escaping across rooftops using grounded martial arts, tactical movement, environmental combat, and realistic stunt choreography.
STYLE: Mission Impossible × Jason Bourne × Extraction × Sicario — Ultra grounded realism, real-life cinematic look, practical stunt choreography, natural movement physics.
TIMELINE: (0s–2s) Wide drone shot of downtown Los Angeles skyline. The woman bursts through a rooftop access door and immediately sprints forward. Police helicopter appears behind the building. (2s–5s) Two tactical agents rush toward her aggressively. She dodges the first punch and counters with a fast elbow strike. (5s–8s) Three more tactical agents emerge. Fast close-quarter rooftop combat begins. She slides across concrete while avoiding attacks, kicks one agent into rooftop AC units. (8s–10s) Rooftop fight intensifies. Sniper laser sights track across the rooftop. Helicopter spotlight locks onto her position. (10s–13s) She sprints and launches into a huge realistic rooftop jump toward the nearby building, crashes through a modern apartment glass window. (13s–15s) Broken glass scatters across the apartment floor. She slowly stands up while sunlight fills the luxury apartment interior. Hard cut to black.
CAMERA STYLE: IMAX cinematic framing, real handheld camera feel, drone skyline shots, natural motion blur, wide rooftop combat visibility.
NEGATIVE STYLE LOCK: No anime, no cartoon visuals, no exaggerated superhero physics, no game-style rendering, no over-stylized CGI, no unrealistic flips, no fantasy lighting.
action
rooftop
combat
hollywood
cinematic
los-angeles
realistic
Ultra-realistic cinematic Eid celebration film, 15-second luxury commercial style video, opening shot of a glowing city at night during Eid, soft golden lantern lights shining through restaurant windows, cinematic push-in camera movement entering a high-end modern restaurant decorated with elegant Eid décor, crescent moon ornaments, hanging fairy lights, candles, warm ambient lighting, detailed Islamic patterns, rich festive atmosphere.
Inside the restaurant, a group of fashionable young friends are gathered around a beautifully arranged dinner table filled with traditional Eid dishes, grilled BBQ, biryani, kebabs, desserts, fresh drinks, steaming food with visible smoke and texture. Everyone is laughing naturally, talking, sharing food, enjoying the celebration together. Realistic human emotions, candid interactions, natural body language.
One girl stands up and takes selfies with the group using her phone, everyone leaning in, laughing, making joyful expressions. Another cinematic close-up shot of hands serving food, glasses clinking, warm smiles, detailed jewelry reflections, soft skin highlights. Slow-motion laughter, cinematic depth of field, creamy bokeh lights in the background.
Camera transitions smoothly between wide shots, over-the-shoulder angles, close-up food shots, emotional facial expressions, and handheld candid moments. Elegant motion blur, realistic restaurant reflections, luxury ambience, soft haze, volumetric lighting, warm orange and gold color palette.
Ending shot: friends raising drinks together while smiling at the camera, sparkling fairy lights behind them, cinematic slow-motion moment, emotional Eid celebration vibe, premium commercial aesthetic.
Style: ultra cinematic, Hollywood commercial look, realistic lighting, natural skin texture, highly detailed faces, shallow depth of field, anamorphic lens flare, smooth tracking shots, realistic shadows, film grain, luxury lifestyle aesthetic, 4K HDR, high realism, immersive atmosphere, emotional storytelling, masterpiece quality.
celebration
cinematic
realistic
emotional
cultural
Use the uploaded reference image as the strongest identity anchor. The woman must look like the exact same adult woman from the reference image, not just a similar Korean woman.
Preserve her exact facial identity with high priority: same small oval face, same delicate jawline, same large clear eyes, same eye spacing, same eyelid shape, same straight nose, same soft muted pink lips, same pale clear skin tone, same refined calm expression, and same long black softly wavy hair.
Create an ultra-realistic candid KBO baseball broadcast screenshot of the same woman accidentally caught by a live TV camera in the spectator seats. The team name is LG and F1. Her face should remain closer to the reference image than to a generic stadium fan. Do not change her into another person. Do not make her face wider, older, sharper, more westernized, or more idol-like. Keep the same delicate studio-portrait identity, but translated naturally into a real stadium environment.
She is seated among a lively Korean baseball crowd, holding an iced drink and a cheering stick, wearing a clean white baseball jersey over a simple casual top. She is adjusting her hair with one hand. She notices the camera and gives a small natural smile, slightly surprised but composed.
Use a realistic far-distance broadcast camera look: telephoto compression, mild video softness, slight motion blur in the crowd, stadium lighting, natural skin texture, imperfect candid framing, 16:9 horizontal TV broadcast composition.
Hyper-realistic cinematic 15s action sequence. A car is already at full speed racing across a long suspension bridge as it collapses progressively behind.
Bright daylight, cables snapping, sections dropping in sequence.
Action is continuous forward motion. No sharp turns. The car maintains a straight path as the road disappears segment by segment.
Camera starts low front tracking, moving backward at equal speed. Slight lateral drift only.
Mid-sequence, a large section drops, creating a clean gap.
The car accelerates and jumps forward across it.
Camera follows the arc smoothly, staying aligned with direction.
End with forward motion into remaining unstable span.
Create a 10-second cinematic food-commercial video following a STRICT 9-panel storyboard sequence.
The AI MUST follow the storyboard EXACTLY in order with smooth cinematic continuity between every shot.
Do NOT skip panels, merge scenes, change camera angles randomly, or alter the cooking process.
STRICT VIDEO RULES
EXACTLY 9 sequential scenes
Maintain the SAME young Western blonde woman throughout the entire video
Same wardrobe, hairstyle, kitchen environment, props, and food consistency in every shot
Keep realistic Indonesian bubur ayam preparation accurate
Smooth transitions between scenes
Realistic live-action cinematography ONLY
No animation, no cartoon style, no surreal visuals
Professional food-commercial pacing
Every scene should feel connected like a luxury Netflix food documentary
VISUAL STYLE
Ultra realistic cinematic food videography
Warm morning lighting
Indonesian street-food atmosphere blended with modern cozy kitchen aesthetic
Rich golden tones
Soft steam atmosphere
Shallow depth of field
Smooth cinematic motion blur
Macro food photography look
Premium commercial composition
24fps cinematic motion
4K ultra realism
Natural cooking ambience audio
Steam and glossy textures highly visible
STRICT 9-PANEL VIDEO STORYBOARD
PANEL 1 — "Morning Preparation" (0:00–0:01)
Wide cinematic establishing shot.
Young blonde Western woman enters a cozy Indonesian-inspired kitchen carrying fresh ingredients toward a wooden counter.
Warm sunrise light enters through the window.
Slow handheld cinematic camera movement.
PANEL 2 — "The Bubur Pot" (0:01–0:02)
Extreme close-up of a large steaming pot of thick bubur ayam.
The woman slowly stirs the porridge with a metal ladle.
Heavy steam rises dramatically into warm light.
Macro cinematic food detail.
PANEL 3 — "Careful Seasoning" (0:02–0:03)
Close-up of the woman sprinkling spices and seasoning into the bubbling porridge.
Focused expression.
Shallow depth of field with cinematic hand movement.
PANEL 4 — "Pouring the Porridge" (0:03–0:04)
Slow-motion macro shot of thick glossy porridge being poured from ladle into a white ceramic bowl.
Steam rises beautifully.
Camera follows the pouring motion smoothly.
PANEL 5 — "Preparing Toppings" (0:04–0:05)
Fast cinematic montage of toppings being prepared:
shredded chicken, chopped scallions, fried shallots, soybeans, crackers.
Quick macro cuts with elegant food styling.
PANEL 6 — "Topping Assembly" (0:05–0:06)
Dynamic slow-motion shot of toppings dropping into the bowl one by one.
Floating crumbs and steam visible.
Luxury commercial close-up angles.
PANEL 7 — "Golden Broth Finish" (0:06–0:07)
Golden chicken broth poured over the porridge creating rich ripples.
Sambal carefully added on the side.
Camera slowly rotates around the bowl.
PANEL 8 — "Final Food Presentation" (0:07–0:08.5)
Completed bubur ayam placed on a warm wooden table.
The woman adjusts the bowl presentation gently.
Steam rises naturally.
Crispy toppings highly detailed.
PANEL 9 — "Hero Shot" (0:08.5–0:10)
Final cinematic hero frame.
The blonde Western woman sits beside the finished bubur ayam smiling softly toward camera in warm morning light.
Slow cinematic push-in camera movement.
Shallow depth of field.
Elegant premium food-commercial ending with cinematic focus pull.
TECHNICAL NOTES
Smooth cinematic transitions only
Keep camera movement elegant and controlled
Avoid fast chaotic edits
Maintain realistic physics and food textures
Steam must remain visible in most scenes
Food should always look fresh, glossy, warm, and appetizing
Cinematic luxury advertisement quality throughout
The AI MUST strictly follow all 9 storyboard panels in exact order without improvisation
A photorealistic, ultra-high-definition cinematic video of a fluffy grey-and-white tabby cat sitting upright on a beige sofa, wearing a soft plush wolf-head costume hat. The cat is positioned behind a large table filled with an abundant mukbang-style feast, including crispy golden fried chicken, spicy red noodles, a juicy cheeseburger, fresh strawberries, tortilla wraps, and corn dogs.
The scene is styled like a viral ASMR pet eating video with realistic textures and subtle humor. The cat animatedly picks up a piece of fried chicken with its paw, brings it to its mouth, and eats with exaggerated, enthusiastic chewing. It then grabs a glass bottle of dark soda and drinks from it by tilting its head back.
In the background, colorful plush animal toys are neatly arranged along the sofa. Bright, soft lighting enhances the glossy, greasy food textures and the ultra-soft fur detail of the cat. The overall tone is playful, realistic, and highly cinematic, with smooth motion and natural animal behavior.
Courtside at a live NBA game, an ESPN broadcast cuts to a young woman in her 20s sitting in the front row — long black hair, natural smile, caught off guard by the camera. She glances around, unaware she's on the jumbotron. Crowd energy buzzing around her, players visible in the background. Full ESPN broadcast overlay with scorebug and network logo. Broadcast TV color grading, slight compression artifacts, feels like a real live telecast moment.
POV: Jumping out of a cargo plane at 10,000 feet! 🪂🌍
The sense of speed, the fisheye lens distortion, and the sheer scale of this coastal landscape are absolutely insane. AI video generation just hit a new level of adrenaline!
(Modern Pakistani Girl Trapped in 1850s Village | Emotional Discovery Scene | 15s Cinematic)
"Ultra-realistic cinematic 15-second time-travel sequence where a modern Pakistani girl from 2026 suddenly arrives in an old 1850s rural South Asian village. The village must feel authentic and culturally accurate to old Punjabi/Pakistani rural life: mud houses, dusty narrow streets, clay pots, buffalo carts, charpai, wheat fields, village wells, lanterns, smoke from clay stoves, and villagers in traditional old-era clothing.
The girl looks clearly modern and Pakistani: dark hair, realistic desi facial features, modern 2026 clothes, slightly messy from the fall after time travel. Her reactions must feel deeply realistic and emotional, as if everything around her is completely unfamiliar and shocking.
⏱️ 0:00 – 0:03 | ARRIVAL
The girl suddenly falls onto a dusty village road after a bright time-rift flash.
Heavy breathing, confused expression
Dust rises around her
Villagers stop walking and stare at her strangely
A buffalo cart slowly passes nearby
🎥 Camera: shaky cinematic landing shot + slow-motion dust reveal
⏱️ 0:03 – 0:06 | FIRST LOOK AROUND
She slowly stands up and looks around in shock.
Eyes wide with disbelief
She turns in every direction trying to understand where she is
Notices mud houses, old clothes, lanterns, clay pots
Children stare at her modern outfit curiously
🎥 Camera: rotating POV shots + close-ups of shocked facial expressions
⏱️ 0:06 – 0:09 | EVERYTHING FEELS NEW
She walks slowly through the village, overwhelmed by everything she sees.
Touches rough mud walls in confusion
Watches women carrying water pots
Sees smoke from clay stoves and people cooking outside
Chickens run through the street
Her face shows fear, curiosity, and amazement together
🎥 Camera: cinematic tracking shots + emotional close-ups
⏱️ 0:09 – 0:12 | CULTURE SHOCK
Villagers whisper while watching her.
Old village women exchange confused looks
Children follow her carefully
A village elder stares suspiciously
She looks at her phone but there is no signal, increasing panic
🎥 Camera: close-up on trembling hands holding phone + slow zoom on eyes
⏱️ 0:12 – 0:15 | FINAL EMOTIONAL MOMENT
The girl stands silently in the middle of the old village at sunset.
Eyes filled with disbelief and realization
Wind softly moving her hair
She slowly whispers to herself in shock
Ancient village life continues around her naturally
🎥 Camera: wide cinematic pull-back shot showing entire 1850s Pakistani village → emotional fade out
🎭 VISUAL STYLE
Authentic old Pakistani/Punjabi village realism
Emotional and immersive cinematic atmosphere
Realistic facial acting and reactions
Warm dusty golden tones
Historical fantasy with grounded realism
Ultra-detailed 4K cinematic quality
🔊 SOUND DESIGN
Village ambience: birds, distant chatter, buffalo bells, wind
Deep cinematic atmosphere during emotional moments
Traditional South Asian instrumental undertones mixed with soft orchestral emotion
A stunning photorealistic cyberpunk sci-fi cinematic video, 6 seconds long, 4K, ultra-detailed, shot on ARRI Alexa 65 with anamorphic lenses.
A beautiful young woman with short messy silver-white hair stands in the center of a futuristic cyberpunk city street at dusk. She wears sleek, battle-worn white and black tactical power armor with glowing blue and orange accents, a high-tech helmet with a clear visor pushed up, and black gloves.
She slowly turns her head toward the camera with a confident, intense gaze, her short hair gently moving in the wind. Subtle rain falls around her, neon reflections shimmer on her wet armor. Holographic advertisements and pink, cyan, and purple neon signs glow in the background. Flying cars streak across the sky, distant skyscrapers with massive digital billboards tower above.
Cinematic camera movement: starts with a medium shot from a low heroic angle, slowly orbits around her 180 degrees while gently pushing in, creating a dramatic reveal of her armor and the cyberpunk environment. Moody volumetric lighting, god rays cutting through rain and fog, lens flares, shallow depth of field, film grain, subtle chromatic aberration.
Ultra photorealistic, hyper-detailed textures, perfect anatomy, atmospheric cyberpunk mood, Blade Runner 2049 aesthetic, extremely high quality, masterpiece, best quality.
Style: Cinematic, photorealistic, cyberpunk, sci-fi, dramatic lighting
Duration: 6 seconds
Motion: Smooth, cinematic, slow and powerful
Camera: Dynamic orbiting shot with slow push-in
{
"animate": "reference image into a 15s hyper-realistic live basketball TV broadcast",
"visuals": {
"shots": [
"wide high-angle tracking shot of fast break",
"side medium shot of contested drive to basket",
"explosive euro-step or pull-up jumper in paint",
"last-second shot hangs in air then swishes cleanly",
"subtle handheld shake during contact drive",
"crowd erupts with towels waving",
"CUT TO: exact girl from reference image in arena crowd, oversized team jersey, shocked/euphoric reaction on jumbotron cam, leaning forward slightly, fans blurred behind, warm court lights reflecting on face, telephoto lens compression, identity perfectly preserved"
],
"consistency": "reference subject perfectly recognizable in final reaction shot",
"physics": "realistic ball arc, net swish, sneaker squeaks, jersey movement",
"grading": "authentic playoff broadcast look",
"effects": "anamorphic flares, telephoto compression, natural motion blur"
},
"graphics": {
"scorebug": "HOME 108-107 AWAY, 4Q clock from 0:04",
"stats_popup": "player number, position, points, FG%",
"watermark": "sports network logo top-right",
"ticker": "playoff series updates scrolling"
},
"audio": {
"style": "high-energy synced basketball commentary with arena ambience",
"dialogue": [
"0-3s: 'Home team in transition! Number 23 ahead to the big man — four seconds left!'",
"3-7s: 'Strong drive to the rim — contact! Off the glass — IS IT GOOD?!'",
"7-10s: 'IT COUNTS! AND THE FOUL! This arena has exploded — look at these fans!'"
],
"sfx": [
"massive crowd roar",
"sneaker squeaks",
"net swish",
"backboard rattle",
"on-court player shouts"
]
},
"specs": {
"quality": "photorealistic broadcast realism",
"resolution": "1080p 60fps",
"style": "cinematic playoff sports broadcast",
"lip_sync": "perfect",
"artifacts": "none",
"identity_preservation": "reference subject likeness must remain exact"
}
}
A single continuous 15-second cinematic long shot inside a speeding metropolitan subway train at night during heavy rain. Fluorescent ceiling lights flicker softly above metallic poles and wet reflective floors. Outside the windows, blurred neon city lights streak through darkness as thunder rumbles faintly. Half-empty train carriage, tense atmosphere, realistic urban grime.
[0:00–0:02] Smooth tracking shot down the center aisle. A sharp-looking woman in image_1 with tied-back dark hair, piercing eyes, black leather jacket, gray fitted shirt, and combat boots stands holding a subway pole calmly while passengers avoid eye contact. Rainwater drips from her coat sleeves.
[0:02–0:04] Camera slowly circles her as three intimidating men in dark streetwear enter from the next carriage. One cracks his knuckles while another locks the train door behind them. The fluorescent lights flicker harder. Passengers nervously move away.
[0:04–0:06] Without warning, the first attacker lunges forward. She instantly pivots sideways and slams his face into a steel pole. Camera whips dynamically with the motion. The train suddenly jerks on the tracks, throwing everyone violently off balance.
[0:06–0:08] Continuous close-quarter fight sequence inside the moving train. She ducks beneath punches, uses hanging hand straps for momentum, knees an attacker into subway seats, then slides across the wet floor as sparks burst from overhead flickering lights. Realistic impacts and gritty handheld camera movement.
[0:08–0:10] Camera drops low beside the floor as the train speeds through a tunnel. She grabs an attacker's hoodie, spins him violently into the carriage doors, and counters another strike with a brutal elbow to the jaw. Reflections of flashing tunnel lights pulse across the scene rhythmically.
[0:10–0:12] The train brakes suddenly entering a station. Everyone lurches forward. She launches herself over the seats in one fluid motion and drives the final attacker through a glass advertisement panel. Shattered glass sprays across the aisle in dramatic slow motion.
[0:12–0:14] Alarm lights flash red inside the carriage. The unconscious attackers lie scattered across the train floor while terrified passengers stare silently. She adjusts her leather jacket calmly, breathing heavily, neon station lights glowing behind her through the rain-covered windows.
[0:14–0:15] Extreme close-up. Train doors slide open with a loud hiss. She steps out onto the rain-soaked platform without looking back as the camera remains inside the carriage watching her disappear into the crowded neon station. Fade to black.
Style: Original photorealistic urban action thriller, cinematic 4K realism, grounded practical fight choreography, continuous one-shot camera movement, gritty subway atmosphere, realistic train motion physics, shallow depth of field, flickering fluorescent lighting, intense handheld energy, atmospheric rain reflections, immersive sound-driven cinematography.
Cinematic hyper-realistic 14-second night launch of a NASA Space Shuttle at Kennedy Space Center, photorealistic 8K, dramatic lighting, wet reflective concrete pad, epic scale, filmic color grading, no text, no logos.
0-2 seconds: Wide static establishing shot of the full Space Shuttle stack (white orbiter with black heat tiles, massive orange external tank, two white SRBs) standing vertically on the launch pad next to the tall metal tower under a deep dark blue night sky. Subtle ambient lights, calm before ignition.
2-4 seconds: Sudden violent ignition — three main engines and two SRBs fire simultaneously with blinding orange-white flames and explosive clouds of thick white smoke + steam from the water deluge system erupting from the base. Camera cuts to intense low-angle close-up of the roaring engines and orbiter belly, fire and dense smoke rapidly filling the frame, ground shaking, dramatic orange glow reflecting on wet pad.
4-6 seconds: Extreme low-angle close-up on the engines and lower orbiter/external tank as the flames intensify and massive billowing smoke clouds swirl and expand violently upward, thick white vapor pouring out, intense heat distortion, cinematic orange illumination lighting the entire structure from below.
6-8 seconds: Camera pulls back to medium-wide low-angle shot as the shuttle begins to slowly lift off the pad; enormous golden-orange fire and expanding smoke clouds engulf the launch tower base, brilliant reflections on the wet ground, raw power visible.
8-10 seconds: Dramatic wide shot of the Space Shuttle clearing the tower and rising majestically into the night sky; massive expanding clouds of fire and thick white/orange smoke billow outward across the entire pad, tower lit dramatically by the engine glow, epic ascent beginning.
10-12 seconds: Dynamic low-angle tracking shot from below the ascending shuttle, focusing on the three blazing engine nozzles and SRBs producing powerful blue-white exhaust plumes, smoke continuing to surge upward, shuttle gaining altitude against the dark sky.
12-14 seconds: Final wide heroic shot of the fully ascending Space Shuttle climbing higher into the night, surrounded by enormous glowing clouds of smoke and fire that light up the launch complex, dramatic reflections, sense of immense power and scale as it continues its powerful vertical climb.
Style: Hyper-realistic, photorealistic details, intense contrast between dark night and blazing exhaust, cinematic camera movement with slight Dutch angles and smooth tracking, epic and emotionally powerful, no voiceover.
"ROCKET VS MEG"
Shot 1 (0s–2s) —
TOP-DOWN AERIAL SHOT.
A military speedboat tears across bright blue ocean water at insane speed in broad daylight.
Behind it:
a gigantic dorsal fin slices through the sea, gaining rapidly.
Sunlight reveals the ENORMOUS shadow of the Megalodon moving beneath the surface directly toward the boat.
White water explodes everywhere.
Shot 2 (2s–5s)
Inside the violently bouncing speedboat.
Bright sunlight. Heavy ocean spray blasting faces.
A terrified soldier struggles to load a rocket launcher while another screams:
"MOVE! MOVE!"
The boat suddenly jolts upward violently as something gigantic brushes underneath the hull.
Everyone nearly flies overboard.
Shot 3 (5s–8s)
Wide cinematic ocean shot.
The Megalodon ERUPTS fully out of the water behind the speeding boat.
Massive jaws wide open.
Water cascades off its body in sunlight.
Its sheer size blocks the sky as it launches directly toward the boat mid-air.
Shot 4 (8s–11s)
High-budget slow-motion chaos.
The soldier braces against the railing while the airborne Meg hangs overhead.
People screaming.
Boat tilted sideways from wave impact.
The soldier fires the rocket launcher directly into the shark's open mouth at point-blank range.
Shot 5 (11s–13s)
MASSIVE OCEAN EXPLOSION.
The Meg detonates internally underwater.
Blood, fire, and water blast upward into the air.
Shockwave throws the speedboat sideways across the ocean surface.
Crew starts cheering in disbelief.
Shot 6 (13s–15s) — PAYOFF
Suddenly, the burning Megalodon corpse crashes directly DOWN onto the boat from above.
Boat folds apart violently under the impact.
Debris and water erupt skyward.
Final frame:
the shattered remains of the military boat sinking in daylight beside the gigantic smoking Meg carcass floating upside down in the ocean.
Key Visual Hook
Bright daytime aerial shot of the enormous Meg shadow rapidly chasing the military speedboat through crystal-blue water.
Notes
The scale should feel real and expensive — viewers can clearly see the gigantic shark, airborne attack, rocket hit, and final boat-crushing payoff in full cinematic detail.
Ultra-realistic 15-second wildlife sequence in a dense forest at dawn, cold mist between trees, wet leaves and soft earth underfoot.
0–3s: Low tracking shot — a wolf moves silently through ferns and tree roots, body low, ears forward, eyes locked ahead, breath faint in the cold air.
3–5s: Cut to a deer grazing near a clearing, head suddenly lifting, ears twitching, sensing danger.
5–7s: The wolf bursts from cover, accelerating through brush, paws kicking up leaves and dirt.
7–10s: Fast side tracking shot — the deer sprints away between trees, muscles flexing, hooves striking mud, branches shaking as it passes.
10–12s: The wolf closes distance, weaving around trunks with powerful strides, focused and controlled.
12–14s: Near-contact moment — the wolf lunges forward, jaws close near the deer's hind leg, but the deer sharply changes direction.
14–15s: Final shot — the deer escapes deeper into the forest as the wolf skids slightly on wet leaves, chase unresolved.
Camera: wildlife documentary style, low angles, fast but readable tracking, slight handheld realism.
Environment: dense forest, moss, roots, mist, falling leaves, natural morning light.
Style: ultra-realistic wildlife behavior, grounded physics, natural motion, no graphic detail, no text, no overlays, stable proportions.
A screenshot from a live NBA game TV broadcast on ESPN. The camera cuts to the audience — a gorgeous Asian woman in her 20s with long black hair, perfect features, and a stunning figure in a tight low-cut top, sitting courtside. She smiles naturally, unaware she's on camera. Full ESPN broadcast overlay: scorebug, network logo watermark, 16:9 aspect ratio. The image looks exactly like a real TV screenshot — broadcast color grading, slight compression artifacts, interlacing grain.
Create a premium cinematic travel film featuring the entire city of London with 15 visually distinct scenes.
The video should feel like a Netflix-quality urban documentary mixed with luxury tourism cinematography and emotional storytelling.
Style: ultra realistic, cinematic, photorealistic, high dynamic range, dramatic lighting, rich atmosphere, smooth camera movement, realistic city scale, premium color grading, IMAX-style composition, detailed architecture, subtle film grain, volumetric lighting, realistic reflections, atmospheric weather transitions.
Aspect Ratio: 16:9
Resolution: 4K HDR
Frame Rate: 24fps cinematic motion
Camera Style: drone shots, FPV flythroughs, crane shots, slow-motion tracking, aerial panoramas, dynamic timelapses, stabilized cinematic movement.
Music Direction:
Epic orchestral-electronic hybrid soundtrack with emotional build-up, deep cinematic percussion, elegant strings, atmospheric synths, subtle British cultural influence, powerful drops during skyline reveals, emotional piano during sunset scenes, seamless transitions synced with visuals.
Scene Breakdown:
Sunrise aerial over the River Thames with golden morning fog rolling through London skyline.
Cinematic drone reveal of Tower Bridge with traffic lights reflecting on wet roads after rain.
Hyper-detailed FPV flythrough between skyscrapers in Canary Wharf at blue hour.
Luxury street-level cinematic shot of classic red double-decker buses moving through central London.
Slow-motion cinematic crowd movement around Piccadilly Circus with giant neon screens glowing at night.
Royal cinematic reveal of Buckingham Palace with dramatic cloudy skies and elegant camera crane movement.
Atmospheric rainy-night sequence in Soho with reflections, umbrellas, cafes, and cinematic neon lighting.
Massive aerial establishing shot of Big Ben and the Houses of Parliament during sunset.
Cinematic timelapse of London Underground trains arriving and departing with dynamic motion blur.
Emotional golden-hour sequence inside and around Lord's Cricket Ground, packed crowd atmosphere, cinematic cricket action, cheering fans, dramatic stadium lights turning on.
Wide drone orbit around The Shard piercing through clouds at dusk.
Elegant evening sequence of luxury boats moving along the Thames with city reflections shimmering on water.
Winter-style cinematic fog drifting through historic London streets with vintage architecture and warm street lamps.
Massive night skyline reveal showing the full illuminated London cityscape from above with cinematic cloud movement.
Final emotional ending shot: slow aerial pullback over London at dawn transitioning from night lights into sunrise, ending with a majestic cinematic atmosphere.
Overall Tone:
Grand, emotional, modern, immersive, sophisticated, globally iconic, visually breathtaking, emotionally powerful.
Negative Prompt:
low quality, cartoon, oversaturated colors, unrealistic buildings, shaky camera, distorted faces, bad lighting, low detail, flickering, poor motion interpolation, flat composition, cheap CGI look, text artifacts, blurry skyline, noisy footage.
Create a colorful cinematic video. Feature a realistic young Japanese woman running her cozy modern coffee café throughout the day with natural human movements and realistic environments. Follow the exact sequence of scenes: opening the café, grinding coffee beans, brewing espresso, steaming milk, making latte art, serving customers, taking orders, decorating desserts, cleaning counters, washing cups, restocking ingredients, managing the cash register, decorating the café, evening cleanup, closing the café, and finally relaxing with coffee at night. Use warm cinematic lighting, soft sunlight, café steam effects, shallow depth of field, smooth camera pans, close-up shots, realistic reflections, cozy ambience, seamless transitions, and premium commercial-style cinematography. The girl must look like a real human actress, not anime, not cartoon, not CGI animation. Make the atmosphere aesthetic, relaxing, emotional, and luxurious like a high-end coffee commercial in ultra-realistic 4K quality.
Generate a 3-second ultra-realistic 4K video using start and end frames.
Single continuous handheld push-in.
Begin on her partially zoomed face.
The camera smoothly pushes toward her mouth as it opens wider naturally.
Realistic lip mechanics, natural moisture highlights.
End on an extreme close-up inside the open mouth.
No distortion.
No exaggerated anatomy.
Preserve realism and texture accuracy.
Use reference image as the primary identity lock and keep my face consistent throughout the full video. Create a 15-second ultra-realistic cinematic celebrity arrival scene.
I exit a modern international airport like a famous star. I am wearing a stylish black leather jacket, a fitted dark shirt, dark blue jeans, and elegant coordinated shoes, all highly fashionable, masculine, and charismatic. My outfit must feel premium, balanced, and visually cohesive.
0–4s: Inside the airport exit area, automatic glass doors open and I walk out with calm confidence. Two professional bodyguards notice me immediately and move into position beside me. Medium-wide cinematic shot, realistic airport lighting, subtle crowd motion in the background.
4–8s: As I step outside, people recognize me. Fans and bystanders lift phones and cameras, photographers start shooting, bright camera flashes go off, people turn their heads toward me. I notice the crowd, keep walking, then make a calm respectful gesture: I briefly place one hand on my chest like saying "thank you / I appreciate you," give a small confident nod toward the cameras, then lower my hand naturally. Security keeps space around me. Slow forward tracking shot with strong celebrity energy.
8–12s: I continue walking toward my car with relaxed but important body language. I make slight eye contact with cameras, subtle cool expression, composed smile for one second, then return to serious charismatic focus. My bodyguards escort me on both sides, creating a VIP corridor through the crowd. Dynamic but smooth camera movement, cinematic depth of field, realistic motion.
12–15s: I arrive at a sleek black luxury car parked at the curb. A bodyguard opens the rear door for me. I pause for one final star moment as cameras flash intensely, then get into the car with effortless confidence. End on a polished cinematic hero shot.
Style: photorealistic 8K, premium celebrity documentary realism, ultra-detailed skin and clothing textures, realistic airport exterior, natural daylight, paparazzi flashes, clean sound design with crowd murmur, camera shutter clicks, footsteps, bodyguard movement, car door sound. No subtitles, no text, no logos.
Create a 15-second cinematic short video with a unique emotional storyline.
Scene starts inside a softly lit modern grocery store. A young woman (mid-20s, natural look, casual outfit) walks slowly through the aisles, picking up everyday items like milk, bread, and fruits. Camera follows her in smooth tracking shots, focusing on small details—her hands brushing over products, her thoughtful expressions.
Mid-scene (5–10 sec): She pauses while holding a chocolate bar, a subtle flashback overlay appears—quick soft-focus memory of her laughing with someone special (suggesting nostalgia or a past relationship).
Final scene (10–15 sec): She gently puts the chocolate back, gives a soft, emotional smile, and walks toward the checkout. Camera lingers as she exits the store alone, but calm and stronger.
Style: cinematic, shallow depth of field, warm lighting, soft background music, emotional tone
Camera: slow motion, close-ups + smooth tracking shots
Mood: nostalgic, peaceful, slightly emotional
Quality: ultra-realistic, 4K, film-like color grading
A hand slowly enters the frame naturally and gently taps the smartwatch screen once. The display softly illuminates with a subtle ripple-style activation animation spreading across the screen surface. Warm sunlight shifts slightly through the curtains, creating delicate moving shadows across the wooden nightstand. Camera begins as a static overhead composition, then slowly pushes into a smooth 2x zoom toward the watch face after the tap interaction. Preserve original composition, watch design, lighting, wood textures, colors, reflections, and Scandinavian aesthetic. Smooth natural motion, calm premium lifestyle realism, soft cinematic atmosphere, aspect ratio 16:9.
A hyper-realistic female athlete in a modern, high-end gym environment, captured during a low-energy warm-up moment. She is standing near a squat rack, slightly bent forward, adjusting her wrist wraps while taking a deep breath. Her expression shows focus and calm determination, with subtle fatigue in her eyes as she prepares for an intense workout.
Appearance: athletic, toned physique, natural skin texture with visible pores and slight sheen of early sweat, minimal makeup, realistic facial features. Hair tied in a practical high ponytail with a few loose strands falling naturally.
Outfit: fitted dark sports bra and high-waisted leggings, breathable performance fabric with slight texture, paired with modern training shoes.
Lighting: bright, soft gym lighting with natural highlights, slightly diffused overhead lights creating gentle shadows on muscles. Subtle rim lighting to separate subject from background.
Environment: vibrant, premium gym interior with blurred background (shallow depth of field), visible gym equipment like barbells, plates, and benches. Clean, modern aesthetic with energetic color accents (reds, blues, neon hints).
Camera: medium shot (waist-up or 3/4 framing), eye-level angle, shallow depth of field (f/1.8 look), sharp focus on subject, softly blurred background.
Details: visible breath, slight sweat forming on forehead and collarbone, hands gripping wrist wrap tightly, veins slightly visible, realistic muscle tension.
Style Keywords: ultra-realistic, cinematic, 4K, HDR, shallow depth of field, natural skin texture, fitness photography, Nike campaign style, dramatic yet soft lighting.
A young woman in a weathered linen dress rows a small wooden boat through a misty river at golden hour, her dark hair loosely falling over her shoulders, expression calm and distant. Soft amber and rose light filters through dense mangrove trees, reflecting in the gently rippling water. Camera slowly pushes forward at low angle, just above the water surface, revealing her silhouette against the glowing fog. Hyper-realistic, cinematic grain, anamorphic lens flare, shallow depth of field, 4K, documentary-style lighting. Mood: quiet, ethereal, melancholic.
Extremely fast paced, realistic, cinematic FPV flying through Disneyland. Low altitude over Sleeping Beauty Castle, parade streets, and fantasy villages. Sharp dives past rollercoasters, spinning teacups, and fireworks launch zones. Gliding above rivers with boats, glowing lights, and crowded themed lands. Realistic textures, reflections, dynamic shadows, steam, and smooth fluid movement. Close passes through tunnels, animatronic sets, and neon-lit rides.
Create a 15-second ultra-realistic cinematic vertical (9:16) commercial video.
Scene Style: Modern skincare advertisement, clean minimal bathroom setting, soft morning natural light, premium commercial look, macro detailing of water, foam, and skin texture.
Sequence:
0–3s: Extreme close-up shot of a young man's hands. He squeezes face wash into his palm—thick gel drops in slow motion, highly detailed texture. Soft light reflects on the product.
3–6s: He rubs the face wash between his palms, forming rich creamy foam. Camera focuses on lather buildup with cinematic macro shots.
6–10s: He applies the foam to his face and gently massages in circular motion. Voiceover begins: "This face wash removes dirt, oil, and impurities…"
10–13s: Slow-motion rinse shot—water flows across his face, washing away foam. Skin appears fresh, clean, and glowing. Subtle cinematic zoom-in.
13–15s: He looks into the mirror with a refreshed, confident expression and says: …for clear, smooth, and energized skin every day. Final product pack shot appears with soft glow and clean white background.
A hyper-realistic cinematic food preparation scene in a modern ice cream shop. A perfectly chilled stainless steel cold plate sits center frame, surrounded by small metal bowls filled with chocolate chunks, cocoa chips, sauces, and colorful candy-coated chocolates (like Skittles). Camera locked in a slightly low, front-facing angle with shallow depth of field.
From above, a thick, glossy stream of creamy white ice cream base slowly pours down in a smooth continuous ribbon. The liquid stretches elastically and folds onto itself as it lands directly on a pile of vibrant rainbow candies at the center of the cold plate. The cream spreads slightly but keeps a soft mound shape, forming layered folds.
Bright studio lighting with soft reflections on the steel surface, clean white tiled background, professional dessert kitchen aesthetic. Subtle motion blur on the flowing cream, highly detailed textures (glossy liquid, matte candies, metallic reflections).
Background slightly out of focus: a red candy box visible on a glass shelf, minimal depth distraction. No hands visible, only the pouring action. No text, no subtitles.
Sound design (optional): soft pouring sound, light ambient kitchen noise.
Camera remains steady with micro cinematic focus breathing. Ultra HD, 4K, commercial food ad style, macro detail, realistic physics, smooth motion.
A cinematic ultra-realistic scene of [subject description], captured in [environment/location]. The subject performs [action/movement] with smooth, natural motion. Dramatic lighting with soft shadows and highlights, creating a moody atmosphere. Camera uses [camera angle, e.g., low-angle / drone shot / close-up] with slow motion effect and shallow depth of field. Background features [details like city lights, nature, fog, neon glow, etc.]. Color grading is [warm/cool tones], highly detailed textures, 4K resolution, realistic physics, cinematic composition, film grain, and smooth transitions.
Raw 35mm handheld cinematic footage, high altitude sun haze, intense lens flare and atmospheric glow, one single unbroken continuous tracking shot, no cuts, no edits, all real time 15 second duration. Photorealistic 8K, natural physics, correct fabric motion blur from 350 mph wind, realistic skin and hair movement, zero uncanny valley, zero artifacts, hyper detailed.
The main subject is the exact person from @[yourimage] same face, same build, same skin tone, same casual expression. He is wearing baggy cargo shorts and flip-flops exactly as shown in @ Image1. He stands perfectly relaxed, casually balancing on top of the wing of a speeding F-16 fighter jet flying at 350 mph at 10,000 feet. The entire audio track is nothing but constant full throttle jet engine roar mixed with powerful wind blast no music, no dialogue, no other sounds.
At the 3 second mark the pilot leans out of the open canopy and gives a clear thumbs up toward the guy on the wing. The guy from @[yourimage] leans forward slightly, smiles, and casually returns the thumbs-up.
At the 7 second mark he performs one completely casual, perfectly clean full backflip no hands, no grabbing the jet, no assistance rotating naturally in the air with perfect form and landing exactly on the same spot on the wing without even a single stumble or shift in balance. All motion and fabric physics must perfectly match the body and clothing from @[yourimage].
At the 12-second mark he casually brushes a tiny speck of dust off his shorts with one hand, then gives a bored, almost lazy little thumbs up directly to camera. Hard cut on the final frame.
Use the exact appearance, face, body proportions, and clothing from @[yourimage] for the man throughout the entire video. Ultra photorealistic, raw documentary handheld feel, extreme detail on fabric flapping in the wind, correct motion blur, natural lighting, impossible but believable physics, cinematic yet gritty 35mm texture.
A cinematic vertical 9:16 video set in a vibrant pixel-art RPG version of New York City during warm daylight. The environment is richly detailed in 16-bit/32-bit pixel style with animated elements: water shimmering with soft reflections, clouds slowly drifting, birds flying across the skyline, and subtle NPC movement in the background.
At the center, a fully photorealistic real woman (identical to reference image, unchanged facial features, same hairstyle, same outfit) is seamlessly integrated into the pixel world. She is scaled naturally like a game character (around 25–30% of frame height), walking slowly forward with smooth, realistic motion. Her body movement includes subtle arm sway, natural posture shifts, and slight head turns as if observing the world. Her expression remains soft and neutral.
Camera Motion (Highly Important for Virality)
Start with a slow cinematic push-in (dolly forward) toward the character
→ slight parallax effect between foreground (bench, lamp post), midground (character), and background (city skyline)
→ add a gentle handheld micro-motion for realism
Midway: → smooth side tracking shot as she walks
→ brief focus pull from pixel background to her face
Final moment: → slight orbit camera movement (5–10° arc) around her for depth and immersion
Environmental Animation
Water: subtle wave animation + light reflections
Trees/plants: gentle wind sway
NPCs: minimal looping animations (walking, talking)
Boat slowly moving in background
Floating dust/light particles for atmosphere
Pixel signboards flicker slightly
UI Animation (Game Feel = Viral Hook)
Top-left avatar: subtle bounce-in + health bar pulse
Mini-map: blinking location marker
Quest panel: text types in with soft pop effect
Bottom UI buttons: idle glow + slight hover pulse
Coin counter: small increase animation (+10 flash)
Cinematic Effects
Soft sunlight rays with warm tone
Dynamic shadows matching movement
Depth of field (background slightly blurred during focus moments)
Subtle motion blur during camera movement
Light bloom on highlights
Gentle lens flare when camera shifts
Viral Hook Moment (CRITICAL)
At 2–3 seconds: → a pixel ripple/glitch transition briefly passes through the scene
→ for a split second, the world “reacts” to her presence
→ UI elements pulse + slight sound sync moment
This creates a “wait… was that real?” effect
Suggested Audio Direction
Soft lo-fi RPG background music
Light ambient city sounds (water, footsteps, distant chatter)
UI click sounds synced with animations
Subtle “level-up” or sparkle sound during hook moment
Style Keywords (important for Seedance)
cinematic, ultra smooth animation, parallax depth, photorealistic human in stylized pixel world, seamless integration, warm lighting, cozy aesthetic, immersive, game-like UI, subtle motion, viral aesthetic, high detail
Negative Prompt (to avoid breaking realism)
no face distortion, no stylized face, no anime face, no exaggerated proportions, no oversized character, no floating feet, no mismatch lighting, no blur on subject face, no jittery motion
Create a 15-second ultra-realistic vertical (9:16) cinematic video of a young woman shopping in a modern grocery store.
Scene Style: Bright, clean, and aesthetically pleasing supermarket with soft natural lighting, slightly warm tones, shallow depth of field, and smooth cinematic camera movement.
Sequence:
0–3s: Wide establishing shot of a modern grocery store aisle. Shelves neatly stocked with fresh fruits, vegetables, and packaged goods. Soft ambient store sounds.
3–7s: Medium tracking shot of a young woman wearing casual stylish outfit (white shirt, light denim jeans, minimal makeup). She pushes a shopping cart slowly while scanning shelves thoughtfully.
7–11s: Close-up shots:
• Her hand picking fresh apples and checking quality
• Slow-motion of fruits being placed into cart
• Subtle smile as she compares items on a list
11–15s: Cinematic side profile shot as she walks down the aisle. Soft sunlight beams through store windows, creating a dreamy glow. Camera slowly pulls back as she continues shopping calmly.
Mood: Peaceful, everyday lifestyle elegance, slightly cinematic commercial feel.
Visual Quality: Ultra-realistic, 4K detail, smooth motion, natural skin tones, shallow focus, soft bokeh background.
POV of a young office girl running on a crowded city road, checking her phone — she's late for work. Fast cuts — she dodges pedestrians, jumps over a puddle, squeezes through traffic, almost drops her files but keeps running. Background sounds: traffic, footsteps, heartbeat increasing. She sees the bus arriving, sprints at full speed, reaches just in time, grabs the handle and gets in. Ends with her breathing heavily, slight relieved smile. Ultra-realistic, cinematic, motion blur, fast-paced, 4K.
Create a 15-second ultra-realistic cinematic vertical (9:16) wrestling sequence. Intense sports drama with gritty, high-energy atmosphere. Dimly lit underground wrestling arena with harsh overhead spotlights, dust particles in the air, and a roaring crowd blurred in the background. Wet mat reflecting light, sweat and motion emphasized with slow-motion detail. Two powerful male wrestlers with athletic, muscular builds. One in red gear, the other in black gear. Both highly focused, aggressive, and determined.
cinematic
action
sports
realistic
drama
slow motion
A 500-year-old historical war film style cinematic scene set on a massive ancient fortress wall, inspired by old imperial-era architecture. The wall is extremely wide (around 15 feet) and stretches endlessly into the horizon, disappearing into mist and mountains. The environment is cold, dramatic, and filled with tension.
All soldiers are dressed in traditional ancient war armor from a 500-year-old era — heavy metallic chest plates, leather straps, cloth layers, helmets with engraved designs, and battle-worn textures. The armor looks realistic, aged, and authoritative.
On top of the giant wall, a heavily guarded military convoy is moving forward. Five war prisoners are being forcefully escorted by soldiers. The prisoners are struggling and resisting, trying to break free, creating chaos and resistance during the movement.
The escorting soldiers hold sharp swords and tightly grip the prisoners, forcing them forward with strength and discipline. Their expressions are strict, focused, and emotionless, trained for war and control. Every movement shows tension and authority.
Around them, high-security guards stand at regular intervals along the massive wall. They carry long spears (halberds) that are visible even from a distance due to the wide camera shots. The spears reflect faint light, adding to the cinematic atmosphere.
At the far end of the wall, a large ancient war gate or fortress entrance is visible — heavily fortified, made of stone and wood, leading deeper into a military stronghold where the prisoners are being taken.
The prisoners continue to struggle while being dragged forward, creating dynamic motion and tension in the scene. Guards maintain strict formation, pushing them forward without stopping.
The camera slowly pans and zooms to reveal the scale of the fortress wall — emphasizing its massive length, height, and historical power. Mist and wind move across the structure, adding dramatic cinematic depth.
Style: ultra cinematic, historical epic war film, 500-year-old ancient empire aesthetic, realistic textures, dramatic lighting, wide-angle shots, slow camera movement, intense atmosphere, high detail.
Mood: tense, powerful, dramatic, historical realism.
A stunt rider in a matte-black helmet and armored racing suit accelerates a superbike along the narrow arm of a construction crane high above Shanghai's skyline.
At the 2-second mark the crane begins to collapse, cables snapping and steel beams twisting.
The rider hits the end of the crane arm and launches the motorcycle across open air toward a nearby rooftop.
Camera on an adjacent tower captures the full arc of the jump as the collapsing crane falls behind him.
The bike lands on the rooftop helipad and skids through scattered equipment.
Shanghai skyline, collapsing crane stunt jump, rooftop landing momentum, cinematic aerial scale, 4K.
Create a 15-second cinematic vertical (9:16) ultra-realistic fitness video showing a high-intensity gym training montage inspired by a collage-style workout sequence.
Scene Style: Premium modern industrial gym with dark metallic interiors, rubber flooring, and dramatic cinematic lighting. Strong contrast between shadows and highlights with subtle red and blue neon accents. Floating dust particles visible in light beams for depth and realism.
Character: A strong, athletic male/female fitness model wearing sleek performance gym wear (compression top, shorts, training shoes). Visible muscle definition, sweat detail, and natural fatigue expressions showing effort and discipline.
Video Flow (fast-paced montage):
0–3s: Warm-up stretches and mobility drills
3–6s: Heavy barbell squats and controlled breathing close-up
6–9s: Deadlifts and explosive power lifts with floor impact shots
9–12s: Dumbbell curls, cable pulls, and boxing bag strikes (quick cuts)
12–15s: Treadmill sprint finish → slow-motion cool down, deep breathing, head up, victorious look
Cinematic Effects: Smooth motion transitions, whip cuts between exercises, slight slow-motion on key lifts, dynamic camera angles (low angle power shots, side tracking, close-up sweat detail).
Mood: Intense, motivational, discipline-driven transformation energy. Emphasize "no excuses, only progress" feeling without showing text unless subtly in background gym screen.
CRITICAL INSTRUCTION: The reference image contains a 9-step chronological cooking storyboard. Animate the chef seamlessly through these exact 9 steps in order. Start at Step 1 (Flour Well), flow into Step 2 (Crack Eggs), then Step 3 (Mix). Continue the chronological progression through Kneading, Resting, Rolling, Cutting, and Boiling, finishing perfectly on the final plated dish (Step 9). Prioritize the strict sequence of actions.
15 seconds, 16:9, realistic, cinematic, tasty, natural camera movement."
How to use this system:
1. Generate the reference sheet in ChatGPT 2.0
2. Upload image reference in Seedance
3. create animation prompt like above sample
4. Set motion strength medium-high + cinematic style
SCENE 1 (0–4s) — Entry & Discovery
A girl enters her bedroom after a long day. The camera follows her from behind as she opens the door. She pauses and looks at a messy, slightly chaotic room with scattered clothes and objects. Soft natural light enters through the window, creating a realistic, slightly dramatic mood.
SCENE 2 (4–8s) — Decision Moment
Close-up shot of her face as she sighs slightly and ties her hair into a neat bun. The camera slowly zooms in. Her expression changes from tired to determined. Subtle cinematic lighting highlights her focus and calm energy.
SCENE 3 (8–12s) — Cleaning Sequence (Speed Montage Style)
Fast-paced cinematic montage of her cleaning the room efficiently. Clothes are folded, items are arranged, bed is straightened. Smooth motion blur transitions, satisfying organization visuals, time-lapse style with soft aesthetic color grading.
SCENE 4 (12–16s) — Peaceful Ending
The room is now perfectly clean and minimalistic. She lies down gently on her bed, relaxed and peaceful, staring at the ceiling with a calm smile. Soft golden lighting, slow camera pull-back, emotional closure, serene atmosphere.
A highly cinematic 12-second video of a lone man walking through a scorching desert under intense sun. His clothes are torn and worn out from the harsh journey, and he carries a wooden stick for support. He looks exhausted from thirst and hunger, but continues walking with determination and silent resilience. The desert is vast, empty, and unforgiving, with heat waves rising from the sand.
As he slowly climbs a sand dune, the scene dramatically transforms on the other side: he discovers a lush green oasis filled with fresh flowing water, fruit-bearing trees, and vibrant greenery. The contrast is breathtaking — from dry desert to paradise.
His face instantly changes from exhaustion to overwhelming joy and relief. He looks up at the sky in gratitude, raising his hands in thankfulness, emotionally overwhelmed. Then he runs joyfully towards the water and greenery, full of hope and happiness.
Cinematic lighting, ultra-realistic style, emotional storytelling, dramatic contrast, smooth camera movement, high detail, 4K quality, film-like color grading.
You are in a real-life war zone captured on a handheld combat camera. Yapper is on the battlefield, engaging in intense gunfight with trained soldiers. Continuous gunfire echoes loudly as bullets hit the ground, walls, and vehicles. Fighter jets fly low overhead at high speed, dropping powerful bombs that create massive shockwaves and dust clouds. Heavy military tanks move across rough terrain, firing shells and causing large-scale destruction.
Everything looks raw and realistic—natural lighting, real human movement, practical explosions, dust, smoke, debris, and camera shake as if filmed by a war journalist. No CGI or cartoon style—pure live-action realism. Sweat, dirt, and tension visible on faces. Sound design includes gunshots, distant explosions, jet engines, and battlefield chaos.
Yapper moves tactically, taking cover, reloading, and surviving in the middle of the chaos.
Add Yapper watermark/logo in the corner (subtle but visible).
Style: Live-action, ultra-realistic, cinematic war footage, handheld camera, motion blur, natural colors, documentary.
A photorealistic video sequence captures a young boy with messy orange hair and thick-framed glasses, as seen in image_0.png, image_1.png, and other source frames. He is dressed in a black basketball jersey and matching shorts with purple and blue trim, featuring the text "WIZZGEN 23" on the front and "CHICAGO 23" on the back (image_4.png). The setting is an outdoor asphalt city basketball court with green trees and a visible basketball hoop. The action begins with the boy in a low stance, dribbling the ball between his legs (image_0.png through image_3.png), then transitions to him standing taller and performing crossovers (image_5.png through image_7.png), followed by him successfully spinning the ball on his finger (image_8.png), and finally posing with a peace sign while holding the ball (image_9.png). The lighting is soft daylight under an overcast sky.
Generate a high-quality cinematic 15-second vertical video (9:16) of a teenage female street basketball player performing a smooth freestyle routine on an outdoor court. She has a slim athletic build, light tan skin, soft freckles, and wavy dark brown hair tied in a loose ponytail. She wears a cropped oversized jersey with "NEXORA" clearly printed on the front, loose high-waist shorts, crew socks, and stylish high-top sneakers with pastel accents (peach & mint).
Her vibe is confident, effortless, slightly playful — calm but skilled street energy.
0–2s:
She stands relaxed, spinning the basketball lightly in her hand.
Drops into a low stance and starts a controlled dribble, eyes focused.
2–4s:
Smooth in-and-out dribble into crossover, shifting her weight naturally.
Hair and jersey move subtly with motion.
4–6s:
Clean between-the-legs combo → behind-the-back transition.
Footwork tight, rhythm controlled.
6–8s:
She performs a hesitation + quick burst step, as if beating an invisible defender.
Confident expression.
8–10s:
A fluid spin move into step-back dribble, sneakers pivot realistically on asphalt.
Logo "NEXORA" stays visible.
10–12s:
Fast low dribble sequence side-to-side, keeping the ball tight and stylish.
Energy builds slightly.
12–13.5s:
She casually spins the ball on one finger, straightens up, slight smirk.
13.5–15s:
Final pose:
She catches the ball, rests it on her hip, gives a relaxed confident look.
Text fades in:
"Play Smart. Move Different."
🎨 STYLE:
realistic basketball freestyle, smooth street flow, confident female athlete energy, modern sports commercial vibe
🎥 CAMERA:
full-body framing, stable cinematic shot, slight push-in, smooth continuous motion, no cuts, fluid transitions
🌇 ENVIRONMENT:
outdoor street court, asphalt texture, faded court lines, chain-link fence, visible hoop, warm sunset lighting, soft shadows
🧠 QUALITY:
ultra-detailed, realistic ball physics, natural motion, clean composition, readable "NEXORA" text, 4K resolution
Ultra-realistic arctic wasteland at night, blizzard winds, frozen mountains barely visible through whiteout snow. Scientific expedition placing thermal charges across an ancient glacier. Ice begins cracking in glowing lines beneath their feet. Camera pulls backward fast through snow as an enormous humanoid machine rises from beneath the ice, launching frozen slabs into the air. Helicopters struggle in violent wind overhead. One colossal blue eye ignites through the storm. Final frame: titan fully standing, shadow swallowing the camp.
Image1 is the main character maintain consistent facial features and body type throughout. The main character appears only once in every frame no duplicates, no red-haired people in the crowd. Cinematic time-freeze short film, 15 seconds, ultra-realistic, Arri Alexa Mini shooting texture, 50mm lens, natural daylight hard shadows, shallow depth of field.
[0:00-0:03] Busy cobblestone street in an Italian old town, normal time flow. Steadicam front-facing medium shot tracking: the main character wearing a loose linen shirt tucked into high-waisted jeans and white sneakers walks confidently through the crowd. Pedestrians walk, check phones, chat; a flock of pigeons flies across the bright sunny sky in the distance. As she walks, she raises her right hand and snaps her fingers.
(0:03-0:06] The instant of the snap a powerful white spherical shockwave bursts from her fingertips, carrying visible air distortion and light refraction, spreading rapidly in all directions...
A battle-hardened space marine in armored exosuit charges across the red dunes of a hostile alien planet under twin suns, sandstorms whipping up around jagged rock formations. The landscape shifts as buried ruins erupt from the ground and biomechanical creatures burrow through the earth. At the 1-second mark, he jet-boosts from a crumbling ledge toward a crashed escape pod. Camera orbits him dynamically as distant explosions light the horizon. He latches onto the pod's thruster, pries open the hatch, and activates shields just as a horde of insectoid aliens swarms the dune behind him. Desert planet skirmish, jet-assisted leap, armored suit sprint, epic sci-fi lighting, 4K.
Cinematic 15-second desert safari experience in the Dubai desert at sunset, composed of 15 rapid 1-second shots, each cut cleanly with smooth visual continuity, ultra-realistic golden sand dunes stretching across the horizon, warm sunset lighting with rich orange and amber tones, soft wind shaping fine sand textures, high-end travel and adventure cinematography style, consistent across all shots.
Shot List Sequence:
1. Aerial establishing shot of vast golden dunes under a glowing sunset sky
2. Smooth drone glide over rolling dunes creating depth and motion
3. Wide shot of a 4x4 vehicle driving across the sand leaving trails
4. Dynamic close-up of dune bashing with sand spraying into the air
5. Low-angle shot of wheels cutting through soft sand
6. Side tracking shot of the vehicle drifting along a dune ridge
7. Slow-motion shot of sand particles blowing in the wind
8. Silhouette of a camel caravan moving across the horizon
9. Close-up of a person riding a camel at sunset
10. Wide shot of a desert camp with traditional tents
11. Action shot of sandboarding down a steep dune
12. Medium shot of people relaxing at the camp
13. Close-up of traditional lanterns glowing in warm light
14. Transition shot as the sky deepens into orange twilight
15. Final hero pull-back aerial showing endless dunes fading into the horizon
Visual and Motion Style:
Fast cinematic cuts, smooth micro camera movements per shot including push, pan, slide, tilt, and orbit, physically accurate sunset lighting with warm tones, ultra-realistic sand textures with wind patterns, dynamic motion for vehicles and sand, soft shadows, no flicker, stable geometry, real-world motion blur, shallow depth of field where appropriate, HDR, ultra high definition, film-quality travel and adventure cinematography.
A photorealistic 16:9 in-game screenshot of a fictional next-gen open-world RPG titled "BULK: A Members-Only Adventure". Third-person over-the-shoulder camera following the player character — a tired suburban mom in yoga pants pushing an oversized flatbed cart down the aisle of a Costco warehouse store. Scene captures hyper-realistic warehouse lighting, towering pallet stacks, a free sample station ahead with an NPC in a hairnet glowing with a yellow exclamation mark above her head. Game HUD overlay: top-left mini-map showing aisle layout with quest markers; top-right stamina bar labeled "PATIENCE" three-quarters full; bottom-left compass with objective text "PRIMARY: Locate Kirkland Almond Butter (Aisle 11)" and below "SECONDARY: Sample 3/5 cocktail meatballs"; bottom-right item quick-slots showing membership card, car keys, snack bar; center crosshair with subtle interaction prompt "[E] Take Sample". Cinematic depth of field, slight chromatic aberration, photo-mode quality. All UI text crisp and legible. Realistic Costco signage in background spelled correctly. No watermark, no real Costco logo (use generic warehouse-club aesthetic).
A seamless, extreme FPV hyper-zoom starting from a wide view of Earth in space, rapidly plunging through the atmosphere and clouds. The camera dives into an aerial hyper-lapse of St. Petersburg, sweeping past the golden dome of St. Isaac's Cathedral. It descends smoothly to skim just above the water of a canal, accelerating towards the Palace Bridge. The camera flies directly through the raised, open spans of the drawbridge.
As it exits the bridge, the camera smoothly pans right and decelerates, seamlessly transitioning into a medium portrait shot of a young man sitting on the granite river embankment. The man has short textured hair with subtle highlights, light stubble, and sharp facial features. He is wearing a relaxed white button-down shirt, dark blue denim jeans, clean white sneakers, and a minimal silver chain bracelet.
He initially looks away toward the horizon, then slowly turns his gaze toward the camera with a calm, confident expression. The background features the open drawbridge against a soft, pastel twilight sky, with reflections shimmering on the water. Cinematic, hyper-realistic, continuous single-take, 8K resolution, photorealistic, smooth motion, natural lighting, ultra-detailed textures.
fpv
cinematic
city
st-petersburg
continuous-shot
realistic
✅Key Visual Prompt
Genre: XX
Brand Name: XX
Using this image as a base, generate a photorealistic poster image that exists nowhere in the world, with ideas that completely deviate from common sense.
Not just an extension, but leap the imagination to a level where "the meaning gets through, but the interpretation is utterly mad."
【Absolute Requirements】
・Photorealistic expression (reproducing the texture like live-action, sense of air, even light particles)
・All language in Japanese
【Elements to Exaggerate/Boost】
■Font Design
・The letters themselves materialize or become phenomena
・Fonts physically interact with the theme
■Text Placement
・Ignore normal layouts and place text into the space itself
・Placement with abnormally strong gaze guidance
■Composition
・Unrealistic perspective, extreme wide-angle, distortion, scale destruction
・Clear layers of foreground, midground, and background, with abnormally high information density
・A central element that grabs the eye in an instant + countless subtle dissonances in the details
■Lighting
・Movie-level cinematic lighting
・Intense backlighting, rim light, neon, particle light, volumetric light
・Light itself carries meaning (functions as leading lines or emphasis)
■Catchphrase
・Short, intense Japanese that makes sense yet comprehension can't keep up
・A copy that feels oddly fitting despite being out of sync with the situation
【Additional Staging】
・A sense of discomfort where reality and unreality coexist simultaneously
・Parts that ignore the laws of physics
・Quality that works as an advertisement (professional level)
Ultimately, make it a poster that is "understandable yet comprehension lags behind" and "demands a double-take."
If the reference image is a character, apply actions like wearing/using the brand, about to eat it, etc.
Do not use any real brand logos, company names, personal names, etc. at all—make it completely original. At final finishing, confirm nothing real is included.
✅Storyboard Prompt
Using this image as a base, create storyboard images for a 15-second video. This image serves as the end frame in a CM style. Use diverse camera angles like frontal, side, diagonal overhead, etc., and avoid duplicating the same framing. 9:16
A highly realistic cinematic scene of a calm indoor environment with soft neutral tones and natural lighting. The camera remains steady with a slow, subtle push-in movement. The atmosphere is शांत and minimalistic, with gentle shadows and balanced composition. Slight ambient motion is visible — soft light flickering, faint environmental movement, and natural depth of field. The color grading is warm and slightly desaturated, giving a modern cinematic look. Ultra-detailed textures, realistic lighting, 4K quality, shallow depth of field, smooth motion, film-style grain.
A hyper-realistic cinematic product photography shot of a sleek black smartwatch placed on a wet reflective surface, covered with water droplets, during heavy rain. The environment shows a blurred cityscape in the background through a rain-covered glass window, with soft bokeh lights and moody overcast lighting.
The watch is positioned slightly angled, with detailed reflections visible on the wet surface below. Water droplets are visible on both the watch body and strap, enhancing realism. The screen is on, showing a modern minimal watch face with bold numbers and subtle UI elements.
Lighting is dramatic and soft, with cool tones, natural reflections, and high contrast. Depth of field is shallow, focusing sharply on the watch while the background remains blurred. Rain droplets on the glass add texture and atmosphere.
Ultra-detailed, 8K, professional product photography, studio lighting mixed with natural rainy ambiance, sharp focus, realistic reflections, cinematic composition.
Camera Settings: 85mm lens, f/1.8 aperture, ISO 100, shallow depth of field.
Style Keywords: photorealistic, luxury product shot, cinematic lighting, moody, high contrast, water splash, reflections, premium advertisement style.
**Environment:**
A frozen tundra under aurora-lit night skies. Pale green northern lights reflecting across a wide snowfield with icy winds sweeping snow particles across the ground.
**Action:**
15.0s sequence. A giant silver arctic wolf charges across the snow while a rival black wolf emerges from the drifting snowstorm. The two wolves collide in a powerful clash, sliding across the icy surface.
Velocity Ramp choreography: the moment their bodies collide freezes briefly as snow explodes around them before snapping back to full speed as they tumble across the frozen ground.
**Camera:**
Low tracking shot racing through blowing snow alongside the wolves, occasionally capturing the action reflected across icy surfaces.
**Style & Constraints:**
Photorealistic fur simulation, volumetric snow particles, aurora sky lighting, cinematic cold atmosphere, 35mm film grain, 8K.
A premium fast-food commercial product photograph of a gourmet cheeseburger centered against a warm golden-yellow seamless studio background. The burger features a glossy sesame seed bun, fresh lettuce, tomato slices, onion rings, melted cheese, a juicy grilled beef patty, and rich sauce. Soft studio lighting, subtle shadows, mouthwatering texture, sharp focus, ultra realistic food advertisement photography, clean composition, 8K.
Sneaker hands‑on hook – an energetic man holds neon‑green sneakers close to the camera in a skatepark, rotates them, slides his foot in and stomps; shot at golden hour with a handheld iPhone.
Kitchen discovery reaction – woman in sunlit kitchen opens a jar of chilli crisp, sniffs and tastes it; her eyes widen and she laughs while describing the flavor; handheld and natural.
Mirror try‑on – young woman stands before a mirror trying on a cream linen shirt and jeans; she turns to show angles and tells viewers the size and brand; soft window light and no music.
Coffee shop recommendation – a man at an outdoor café sips a flat white, speaks to his phone about life hacks and points to his notebook; blurred greenery and warm morning light make it cozy.
Street interview – multi‑shot prompt where different people on a busy sidewalk shout quick testimonials about a platform; quick cuts, handheld iPhone footage and bright daylight create energy.
ugc
social-media
multi-shot
realistic
advertisement
A cinematic 15 second time lapse video of a house being built from an empty plot. The scene begins with a clear, vacant land under daylight. Construction starts quickly: workers arrive, laying the foundation with concrete and steel. The structure rises rapidly walls form, bricks are placed, and scaffolding appears. The roof is installed, followed by windows and doors. Exterior finishing and painting happen smoothly. The surrounding area becomes neat and landscaped. The final scene reveals a fully completed modern house standing beautifully on the plot. Smooth time-lapse transitions, dynamic camera movement, realistic construction details, bright natural lighting, high detail, 4K quality.
Ultra-realistic Indian classroom street fight. Single continuous shot, no cuts. Raw handheld mobile footage with natural micro-jitters, slight rolling shutter, no stabilization. Documentary realism.
Audio: No music. Only raw ambient sound-footsteps, desk friction, cloth movement, punches, fan hum, breathing, distant classroom noise. Lighting: Mixed warm + cool cinematic tones. Practical classroom lighting. Moving ceiling fan shadows. Visible dust particles in light shafts.
MAIN CHARACTER (STRICT IDENTITY LOCK): Indian male (17-18), cold emotionless face matching reference image exactly. Black straight side-swept hair, brown almond eyes, sharp jawline, natural Indian skin tone, slim face.
OUTFIT LOCK (NO CHANGE): Black school blazer, white shirt, black tie, black pants. No hoodie/tayers. Slightly fitted blazer, loose tie. Must remain identical throughout
OPPONENT RULES: All opponents have unique faces. No resemblance to protagonist.
ACTION: 0-2s: Protagonist grabs Opponent A from behind one-hand head slam onto desk. Books/pencils burst outward. Desk drags with friction. A collapses limp.
2-5s: Grabs Opponent B punch to abdomen B folds immediate head slam into desk. Head pressed briefly to tiles. Camera stabilizes briefly. B falls face-down. Heavy breathing, visible sweat.
5-6s: Blazer shifts slightly, shirt + tie visible (tie displaced). C & D enter from back, split and flank. Only footsteps, fan hum.
6-8s: Chaotic fight. D attacks with rapid punches. Protagonist stumbles slightly. C grabs blazer, pulls back. Knee to abdomen. Body bends forward. Heavy breathing. Fan shadows moving.
8-10s: Headbutt to D D staggers. Protagonist grabs C collar efficiently.
10-12s (BOARD IMPACT): Protagonist drives C backward into green chalkboard. Back hits first, head snaps slightly. Chalk dust bursts outward, fully visible in warm light. Chalk tray rattles, pieces fall. Micro slow-motion (0.2-0.3s) at impact back to real-time. C loses tension, slides down leaving chalk smears, collapses. Dust drifts.
12-13s: D charges protagonist sidesteps rotational punch to cheekbone. Sweat arcs mid-air. D crashes into desks. Massive collapse, books scatter. D motionless, hand drops. Protagonist barely standing, heavy breathing.
13-14s: Medium-close. Chest rising. Slow turn to camera. Cold eye contact through glasses. Wipes blood from nose (visible smear). Removes glasses throws aside, spinning impact. 14-15s (FINAL): Ground-level wide shot. Slight edge blur, warm tone, golden dust. Protagonist without glasses removes blazer → throws onto opponent. White shirt + black tie only. Tie loose, slight movement. Walks to exit. Blazer settles. Silent classroom. Dust floating. Fade to black.
Vertical 9:16 | Handheld | Natural Lighting | Soft Luxury Aesthetic
0 to 2s — Product Reveal (Close-Up)
Extreme close-up of a sleek serum bottle held loosely in a woman's hand. The "Kling 3.0" label faces the camera, fully legible. Warm morning sunlight softly catches the glass, creating a gentle gleam. Camera is slightly shaky, casual handheld feel. Background is softly blurred — a bright bathroom or bedroom vanity.
2 to 4s — Dropper Application
She tilts the dropper and squeezes 2 to 3 drops onto her fingertips. The serum catches the light as it drops — slightly golden, slightly translucent. Her fingers come together, spreading the product slowly. A faint relaxed smile begins to form. Mirror is partially visible in the soft background.
4 to 6s — Face Application
She brings her fingers to her cheekbones and begins pressing the serum gently into her skin using light upward motions. Her skin looks clean, bare, naturally glowing. The texture absorbs visibly. She looks calm, unhurried, at peace. Dewy finish begins to appear on her cheeks and nose bridge.
6 to 8s — Mirror Moment
Camera pulls back slightly to reveal her standing in front of a mirror. She looks at her reflection — not posing, just observing. Soft approval on her face. The mirror doubles the warmth of the scene. Natural light pours in from the side.
8 to 10s — Skin Close-Up (Face)
Tight close-up of her cheek and jawline. Skin looks plump, luminous, and hydrated. No filter-like perfection — real texture, real glow. Light bounces naturally off the high points of her face. Camera holds still for just a moment, almost like admiring the result.
10 to 12s — Bottle Detail Revisit
She picks up the bottle again, this time more gently — almost fondly. Fingers wrap around it. The Kling 3.0 label is visible again, facing camera at a slight angle. She tilts it slightly as if out of habit. Warm bokeh background. No words, no voiceover — the visual does the talking.
11 to 13s — Confident Mirror Look
She looks back into the mirror and gives a slow, subtle approving nod. Not dramatic — quiet confidence. Her eyes soften. A small natural smile. She gently tucks her hair behind her ear. Feels like a private morning ritual, not a performance.
13 to 15s — Fade with Product in Frame
Camera slowly drifts back. She sets the bottle on the vanity counter, label still visible. Morning light fills the frame. Scene feels unhurried and complete. Gentle handheld micro-movement keeps it authentic. Slow natural fade to soft brightness.
A young woman sits alone on a park bench at sunset, looking upset while scrolling her phone. A friend joins her and offers a Cadbury Dairy Milk with a light joke, turning the moment playful. She unwraps and breaks the chocolate, sharing a relaxed, warm exchange. After tasting it, her mood lifts as the scene brightens and they laugh together. The final shot focuses on the chocolate bar with a soft cinematic glow, highlighting a simple message of comfort and shared sweetness.
STYLE: photorealistic, high-end cinema, ultra-detailed, 35mm film look, shallow depth of field, dramatic lighting, motion blur, dynamic camera, seamless transitions, surreal continuity, high contrast, rich reflections
0:00 – 0:02 INT. CASINO – ROULETTE TABLE – NIGHT Low-angle cinematic shot. A sharply dressed man in a tailored suit sits at a roulette table, surrounded by scattered chips and beautiful women. Warm golden casino lighting flickers across his face. He bursts into uncontrollable laughter, head thrown back, eyes wild. The camera pushes in. He falls backwards, laughter echoing unnaturally.
0:02 – 0:04 VOID – GOLD BARS He plummets headfirst at extreme speed through a dark void filled with massive rotating gold bars.
0:04 – 0:06 INT. LUXURY JEWELRY STORE – NIGHT He continues falling headfirst at high speed down a pristine jewelry store aisle. Glass cases on both sides explode outward as he passes, diamonds and watches suspended in slow motion.
0:06 – 0:08 EXT. POOL PARTY – NIGHT He falls at high speed head first through a glamorous pool party scene. Beautiful women in luxurious dresses dance and laugh on both sides, champagne splashing
0:08 – 0:10 VOID – CASINO OBJECTS He falls head first at high speed into another void. Floating casino jetons, poker cards, and dice swirl around him in zero gravity. Cards slice past the lens, chips collide in slow motion.
0:10 – 0:12 He falls head first at high speed through a void of men in suites fighting over money, boxing each other in agry rage
0:12 – 0:14 He continues falling at high speed head first past a perfectly aligned row of identical versions of himself in suits. They stand in formation, some throw dollar bills, others cry, some are angry, some raise their fists,
0:14 – 0:15 EXT. DIRTY STREET – NIGHT Abrupt impact. He lands onto wet asphalt beside overflowing dumpsters. The lighting shifts to harsh, flickering streetlight. Silence. He now wears torn, filthy clothes—transformed into a beggar. A whiskey bottle loosely hangs from his hand. His laughter is gone.
ENVIRONMENT: Home → bedroom → kitchen → school gate
MOOD: Calm → rush → chaos → composed finish
⚡ SHOTS
Soft morning wake-up
Gentle call to child
Child resisting, sleepy chaos
Blanket pull + playful struggle
Checking time → sudden urgency
Quick bathroom routine rush
Brushing hair while walking
Uniform fixing on the go
Breakfast multitasking
Packing school bag fast
Searching missing item panic
Shoes + socks scramble
Out the door rush
Walking fast / slight run
Child distraction moments
Final adjustments before entry
Quick hug + goodbye
Watch child enter school
Relieved, composed pause
Cinematic, high-quality video of a beautiful young woman with auburn hair in a messy romantic updo, wearing an elegant, flowy, off-the-shoulder white dress. She is sitting gracefully on the cobblestone ground of a sunlit, grand European piazza, gently strumming a white acoustic guitar and singing a song. A fluffy white Ragdoll cat with a bushy tail is walking around her and affectionately nuzzling her leg. The background features classical baroque architecture, a large ornate fountain, and softly blurred pedestrians strolling by in the warm afternoon sunlight. Golden hour lighting, photorealistic, serene, and peaceful atmosphere with a shallow depth of field
**Environment:**
A vast stormy ocean at night. Thunderclouds swirl overhead while lightning flashes across towering waves. A naval fleet spreads across the water below.
**Action:**
15.0s sequence from the POV of a colossal sea monster rising beneath the waves. The viewer moves through dark ocean water with bioluminescent currents swirling past.
At the 2-second mark the monster breaches the surface.
Warships appear above as massive tentacles crash across the decks.
Ships fire weapons while waves explode around the creature.
Velocity Ramp choreography: a tentacle strike hitting a destroyer slows dramatically — water droplets and sparks suspended in the air — before snapping back as the ship is hurled sideways.
**Camera:**
Fluid aquatic POV transitioning from deep ocean darkness to chaotic surface battle.
**Style & Constraints:**
Photorealistic water simulation, volumetric ocean spray, cinematic lightning illumination, realistic ship destruction physics, 8K.
A man struggling to walk forward against extreme wind, holding onto a pole for stability. Objects fly through the air as buildings begin to break apart. Coastal city under hurricane, heavy rain, flooding streets, debris everywhere. Handheld shot pushing into the wind with him, rain hitting lens, strong motion blur from flying debris, relentless environmental pressure.
A green sea turtle lifts its head above crystal-clear water, exhaling a misty plume that catches sunrise light; ripples race outward.
200 mm telephoto realism, 1/4000 s freeze, warm golden backlight.
audio: soft exhale + gentle wave slap
negative: no divers, no boats
playing the Spanish guitar on top of a moving flying drone inside a weathered Spanish apartment, cold light coming through the windows. Sound of the drone's propellers
Static locked off UGC frame on a girl at a table making matcha, with the exact same camera position and framing throughout, perfectly steady, with no shake, no drift, and no micro-jitter, and a clean, crisp image. The clip opens exactly on the start frame, with her holding the metal sifter over the bowl as the last of the matcha falls through, fine powder drifting down naturally in tiny bursts. She speaks in a natural female American accent, around 27 to 28 years old, calm and confident, with a relaxed conversational rhythm, slightly deeper than average, smooth and mature but still soft and feminine. She starts speaking immediately at the beginning, with her lips clearly moving on-camera through every word: “Okay, um…” As she says “Okay,” she instantly lowers her gaze down toward the white bowl and shifts her focus to what she is already doing, while her mouth continues into “um” without interruption. When she says “um,” it is barely audible, almost to herself, quiet, low, and absent minded, like a whisper. That small pause on “um” feels like a thought catching up to her hand, and her lips barely move. Her gaze drifts slightly to the left for a second, her eyes briefly flicking toward the camera and then back to the bowl, as her hand gives one last gentle tap to finish the sifting, her mouth still moving through the line without missing a beat: “I wanna show you.” The final powder stops, the mesh is visibly clean, and she lowers the sifter a little closer to the bowl as if checking that she got it all, finishing the last words with a quiet, confident ease: “how simple AI UGC is.” Her expression stays natural and unperformed, like she is just talking while doing the routine. Keep the identity, skin texture, and environment perfectly stable, with no warping, no morphing, no smoothing, no smearing or blending, no pixel mixing, and minimal motion blur. Preserve realistic powder behavior, metal reflections, and shadows. End exactly on the provided end frame.
Video 2
Prompt:
Static locked off UGC shot on the same table matcha setup, with the camera perfectly steady and the same framing throughout, with no handheld shake, no drift, and no micro-jitter. Clean, crisp image. The clip opens exactly on the start frame, with the empty sifter held near the bowl. In one slow, natural continuation, she sets the sifter down out of the main action area and reaches for a glass electric kettle, then begins pouring hot water into the bowl in a physically believable stream, with realistic weight in her grip, an accurate pouring angle, natural water flow, and subtle steam cues, while the environment, background, and object positions remain consistent with the start frame. The shot settles exactly into the end frame, with the water clearly pouring into the bowl. She speaks in a natural female American accent, around 27 to 28 years old, calm and confident, with a relaxed conversational rhythm, slightly deeper than average, smooth and mature but still soft and feminine. The clip begins with no introduction at all, she is already mid sentence, and her lips clearly move on-camera through every word. While reaching for the kettle and starting the pour, she says naturally: “But honestly”. When she says the word “honestly,” it ends with a slight upward tone, and then, as she picks up the glass electric kettle and just before she starts pouring the hot water into the bowl, she finishes the last words: “…it’s really not as hard as it looks”. She feels completely relaxed and unbothered. No identity drift. No skin warping or morphing. No texture invention, no smoothing, no smearing or blending, no pixel mixing, and minimal motion blur. Keep skin pores, hair, fabric, reflections on the kettle and bowl, and matcha surface behavior stable and realistic. End exactly on the provided end frame.
[BROADCAST SETUP] Live TV sports broadcast signal, 1080i HD resolution, 50fps. Shot on professional ENG stadium cameras. Standard TV color balance with natural daylight. No color grading, no cinematic filters, no artificial post-processing. Real-world stadium acoustics and ambient light. Audio Style: Immersive spatial sound design. Loud stadium atmosphere, crowd roar, metallic clink of the hammer chain, stadium announcer muffled in the background, wind noise on the microphone.
[TIMELINE SECOND BY SECOND]
0-4s: [Medium-wide broadcast shot] Swedish male athlete with long blonde hair and beard (viking-like) spinning rapidly in the hammer throw circle. Authentic Olympic uniform. Real-life physics and momentum.
4-7s: [Broadcast tracking shot] The athlete releases the giant hammer. The camera pans fast to follow the arc of the heavy metal ball flying through the air against the stadium sky. Realistic high-speed physics.
7-12s: [Wide stadium view] The hammer heads towards the lower stands at high speed. A grandmotherly woman in the front row reaches out and catches the giant hammer firmly with both hands. No slow motion, real-time broadcast speed.
12-15s: [Reaction shot] The surrounding crowd jumps up and applauds. The woman holds the hammer, looking at the camera. Authentic live TV cut to the crowd's genuine reaction.
[QUALITY BOOSTERS] Photorealistic live footage, 1:1 sports broadcast physics, authentic Olympic stadium environment, sharp details on clothing and skin, natural motion blur from high-speed movement, stable facial features throughout the clip.
@Image1 is the main character — maintain consistent facial features and body type throughout. Appears only once per frame, no duplicates. Cinematic time-freeze short film, 15 seconds, ultra-realistic, Arri Alexa Mini shooting texture, 50mm lens, natural daylight hard shadows, shallow depth of field.
[0:00–0:03] Busy Alpine village street — wooden chalet shopfronts, cobblestone ground, dramatic mountain peaks visible at the end of the lane, clear blue sky. Steadicam front-facing medium shot tracking: the main character — a young man with dark hair, black-framed glasses, light stubble, silver chain necklace, wearing a loose printed short-sleeve shirt — walks confidently through the tourist crowd. Hikers with backpacks, shop signs, a postcard rack outside a souvenir store. A pigeon cuts across the sky above him. He raises both hands, interlaces his fingers, and snaps.
[0:03–0:06] A powerful white spherical shockwave bursts outward from his hands, carrying visible air distortion and light refraction, spreading in all directions. Pedestrians freeze mid-stride. The pigeon locks mid-flight overhead. Most strikingly — a street juggler beside the postcard shop freezes with three apples suspended mid-arc in the air above him, hanging perfectly still against the blue sky. Postcards fly off the rack and hang frozen in the air. Absolute silence falls over the village.
[0:06–0:09] Only his footsteps echo off the cobblestones. He strolls casually along the frozen street, glancing around with calm satisfaction. He passes the frozen pigeon overhead, pays it no mind. He slows as he approaches the postcard rack — reaches up and plucks one of the floating, suspended postcards out of mid-air, turns it over in his hand, looks at it briefly with a raised eyebrow. Sets it back. Keeps walking.
[0:09–0:11] He stops directly in front of the frozen juggler — three apples hanging perfectly mid-arc above the man's outstretched hands. He tilts his head, studies the composition. A slow smile. He looks at the camera, gives a small nod, and whispers "perfect."
[0:11–0:15] He turns back to face the street, interlaces his hands and snaps again — a second shockwave, stronger, bursts outward in reverse. Everything unfreezes instantly: the juggler catches his apples and continues seamlessly, postcards flutter back to the rack, the pigeon flaps away, hikers resume mid-conversation. City noise and mountain ambience rush back in. He calmly turns and walks toward the camera. Camera slowly rises and pulls back — his silhouette moves down the Alpine lane between the chalets. Fade to black.
Sound design: Alpine village ambience → double snap → shockwave rumble radiating outward → absolute silence → lone footsteps on cobblestone → whispered "perfect" → second snap → reverse shockwave burst → village sounds and mountain wind naturally restored.
Cinematic 4K 60fps realistic live-action spec advertisement, high-production film look, shot on film with subtle grain, warm natural indoor lighting.
PART 1 — Establishing Atmosphere (0:00-0:08)
Wide cinematic establishing shot of a cozy college library with tall beige bookshelves, warm sunlight streaming through large windows.
PART 2 — Character Introduction (0:08-0:18)
Medium shot of a young Western female student with shoulder-length light brown hair, wearing a red-and-white varsity jacket.
PART 3 — Detail & Research (0:18-0:28)
Close-up of hands flipping through books and design research papers.
PART 4 — Spark of Inspiration (0:28-0:40)
Medium close-up of the student pausing, then suddenly looking up with realization.
PART 5 — Creative Flow (0:40-0:52)
Montage sequence of fast writing, flipping pages, sketching concepts.
PART 6 — Resolution & Brand Feel (0:52-1:00)
Hero medium shot of the student confidently reviewing her work.
15-second, no-dialogue, fully immersive high-speed brutal fight short film.
Two characters @ image1 and @ image2 .
Live-action realistic style. Close-quarters, high-speed chaotic melee combat with no rules, fully out-of-control brawling. No flashy choreography—pure raw. Continuous exchange of punches, kicks, blocks, body checks, throws, and grappling entanglements. Ultra-realistic impact. Exaggerated yet physically grounded motion speed. Entire sequence in 120 FPS with no slow motion. Rapid dashes, evasions, and high-speed limb swings. Extreme motion blur and speed trails amplify intensity while preserving realistic weight and physics.
Master-level cinematic camera movement:
A professional film camera stays tightly locked to the action, flying dynamically with the fighters—high-speed dives, sudden stops, whip pans, and full 360° orbital tracking. The camera moves in sync with punches, kicks, and evasions, with responsive shake and displacement. One continuous take, no cuts. Sharp push-pull tracking, sudden directional shifts, fully integrated into the fight for a face-to-face immersive combat perspective. Smooth, zero-latency motion.
Master-level precision editing:
0–3 seconds: ultra-fast cuts to enter combat.
3–10 seconds: seamless action continuity with zero stutter or breaks.
10–15 seconds: dense, violent rapid-cut climax.
Editing rhythm escalates with combat intensity. Fast intercutting enhances chaos. Precise beat timing, no redundant shots. Constant suffocating high-speed pacing.
Environment Setting
Modern open-plan office with desks, computers, cabinets, printers, and swivel chairs; realistic layout with cool overhead lighting and side backlight for strong contrast. Slightly cramped space for added tension; no glass-breaking.
Environmental Interaction & Constraints:
Combat affects the space only through natural, physically justified collisions—papers scatter from air movement (not attacks), chairs slide or topple on impact, desks shift slightly without deliberate damage, and screens remain intact with minor vibration. No unnecessary or intentional destruction of equipment.
Body-driven combat: grappling, pinning, counter-throws, and realistic slams with subtle impact cracks/vibrations. Natural airflow—light paper/dust movement only, no exaggeration.
Final Action Design
Final 2–3s: both throw simultaneous close-range punches, freezing inches before impact—fully tensed, heavy breathing, locked in perfect deadlock.
Ultra-realistic cinematic magic realism, 4K, golden hour lighting, vibrant chic city street, shallow depth of field, smooth tracking motion, upbeat French house / nu-disco vibe.
A confident young woman walks through a sunlit cobblestone avenue in a flowing white summer dress and flats. The city feels lively with boutiques, reflections, and warm lens flares.
⏱️ TIMELINE (15s FLOW)
0:00–0:03
She walks casually down the street. Camera tracks smoothly. She pauses at a luxury boutique window showing a cherry-red dress and gold heels reflection.
SFX: light footsteps, soft city ambience
0:03–0:06
Close-up — she smirks and lifts her hand. Magical energy builds subtly. The red dress and gold heels visually transform into glowing ribbons of light moving toward her.
SFX: rising magical hum, soft shimmer
0:06–0:08
Quick transformation — the light wraps around her and her white outfit seamlessly turns into a fitted cherry-red dress with gold stiletto heels.
SFX: fabric snap, magical chime, heel click
0:08–0:11
Low-angle tracking shot — she walks confidently down the street, dress flowing, heels clicking, golden sunlight reflecting off fabric.
SFX: rhythmic footsteps, music drop
0:11–0:13
She passes a vintage ice cream cart. A cone is offered — she playfully flicks her hand mid-stride, and the cone glides into her hand smoothly.
SFX: whoosh, small bell ding
0:13–0:15
She catches it perfectly, looks into camera with a cheeky confident smile, and takes a bite while walking forward.
SFX: upbeat music peak, light crowd ambience
FORMAT: 15s / slow cinematic pacing / 1 continuous shot with subtle camera movement
SUBJECT: A young woman matching the exact appearance of the reference image (same face, hairstyle, and features). She is wearing a comfortable, modest outfit — a simple half-sleeve dress that is relaxed-fit, non-revealing, and suitable for a calm evening outdoors.
SCENE: She is sitting on a quiet park bench at night, surrounded by tall trees. The environment is peaceful and still, with soft ambient sounds implied. Warm streetlights glow faintly in the background while a clear sky full of stars stretches above.
ACTION: The shot begins from behind her shoulder, slowly pushing in as she tilts her head upward. She gently looks at the stars, her expression calm and thoughtful. A light breeze moves her hair slightly. She takes a soft breath, eyes reflecting the night sky, fully absorbed in the moment.
CAMERA: Smooth dolly-in from a medium-wide over-the-shoulder shot to a soft close-up profile. Shallow depth of field, subtle bokeh from distant lights, no abrupt cuts.
LIGHTING: Natural moonlight mixed with dim, warm park lights. Soft highlights on her face, gentle shadows, realistic night exposure.
STYLE: Ultra-realistic, cinematic photography, soft film grain, muted tones, serene and introspective mood, 4K detail.
SUBJECT: A tired woman in @ image1 in a loose tank top and sleep shorts, slow habitual movement. Slightly smeared eyeliner, bare feet, heavy posture, detached face.
ENVIRONMENT: A cramped cluttered apartment with an unmade mattress, scattered clothes, a narrow hallway, a damp bathroom with dim tile reflections, and a tiny kitchen crowded with dirty dishes and empty bottles. Warm practical lamps mix with sickly green neon leaking through blinds and door glass, turning the rooms into a humid late-night maze.
MOOD: Detached routine turns quietly uncanny, as if an unseen presence is floating above her and waiting for her to notice.
COLOR LOGIC: Matrix Green Look
CAMERA: POV overhead follow in a strict bird's eye view, locked directly above the top of her head at all times, perfectly centered over her body from start to finish, floating smoothly with no shake, tilt, angle drift, or side offset, passing through ceilings and door frames as one uninterrupted camera event. 24mm wide, digital clean look, locked overhead tracking package throughout.
SCENE:
She wakes on the mattress. Sits up under the lens.
Still centered under the lens, reaches to the floor. Picks up the cigarette and lighter. Places the cigarette between her lips. Lights it. Drops the lighter on the mattress. Stands up with the cigarette still with her.
Under the same overhead lock, crosses the cluttered room.
The lens tracks directly above her into the narrow hallway. Enters the bathroom.
Still pinned overhead, keeps the cigarette in her right hand. Extends that arm away from the running water. Leans on the sink. Turns on the tap with her left hand. Splashes water onto her face with her left hand.
The overhead follow carries her back out of the bathroom into the hallway. Moves along the hallway past the bathroom door. Turns into the tiny kitchen.
Still centered under the camera, keeps the cigarette with her. Reaches across the dirty dishes with her free hand. Picks up a glass from the counter.
Stops exactly under the lens. Holds the cigarette in her right hand and the glass in her left hand.
The lens stays fixed above her. Freezes. Looks right. Looks left. Takes a drag from the cigarette. Snaps her head straight up into the lens. Blows smoke toward the camera. Locks eye contact.
SFX: lighter flick, inhale, faint city hum, refrigerator buzz, soft bare footsteps, water run. Sodium amber particles, toxic green neon reflect off tile, smoke, bottles, and damp surface.
FORMAT: 12–15s / 24fps / smooth handheld-stabilized camera / soft rhythmic pacing
SUBJECT: A young woman (use uploaded reference for exact facial identity), expressive and calm, just waking up. Facial fidelity must remain consistent throughout.
WARDROBE: Fitted high-waisted trousers + minimal crop t-shirt. Clean, modern, non-revealing styling. Neutral tones.
ENVIRONMENT: Cozy bedroom interior at night (early evening transitioning into night). Warm ambient lighting mixed with soft practicals (desk lamp, LED strips, window spill). Clean aesthetic, minimal clutter.
LIGHTING: Soft key light focused on face (front-left), gentle falloff. Warm highlights with subtle cool contrast from window. Skin tones accurate and natural. Slight glow on face.
CAMERA: Begins with a medium-wide bedside shot → slow push-in to chest-up framing. Subtle lateral drift for natural movement. No cuts until final moment.
ACTION:
Starts with her lying on the bed, eyes closed.
She slowly wakes up, blinking naturally, adjusting to the light.
Subtle stretch or shift in posture, relaxed and unhurried.
Sits up on the bed, calm expression, soft neutral mood.
Brief pause as she gathers herself.
EXPRESSIONS: Peaceful, slightly drowsy transitioning into calm awareness. Minimal movement, grounded presence.
AUDIO SYNC (implicit): Soft, ambient evening tone — low, gentle background music or room tone.
FINAL MOMENT (transition):
She stands or leans toward the window.
Camera gently follows from behind/side.
She looks outside.
ENDING VISUAL: Through the window: a calm, cinematic night cityscape — soft lights, distant buildings, slight atmospheric haze. Cool blue tones contrast with warm interior.
STYLE NOTES: Ultra-realistic, no over-stylization. Natural skin texture, accurate anatomy, no distortion. Maintain temporal consistency and identity lock from reference image.
15-second, no-dialogue, fully immersive high-speed brutal fight short film.
Two characters @ image1 and @ image2 .
Live-action realistic style. Close-quarters, high-speed chaotic melee combat with no rules, fully out-of-control brawling. No flashy choreography—pure raw. Continuous exchange of punches, kicks, blocks, body checks, throws, and grappling entanglements. Ultra-realistic impact. Exaggerated yet physically grounded motion speed. Entire sequence in 120 FPS with no slow motion. Rapid dashes, evasions, and high-speed limb swings. Extreme motion blur and speed trails amplify intensity while preserving realistic weight and physics.
Master-level cinematic camera movement:
A professional film camera stays tightly locked to the action, flying dynamically with the fighters—high-speed dives, sudden stops, whip pans, and full 360° orbital tracking. The camera moves in sync with punches, kicks, and evasions, with responsive shake and displacement. One continuous take, no cuts. Sharp push-pull tracking, sudden directional shifts, fully integrated into the fight for a face-to-face immersive combat perspective. Smooth, zero-latency motion.
Master-level precision editing:
0–3 seconds: ultra-fast cuts to enter combat.
3–10 seconds: seamless action continuity with zero stutter or breaks.
10–15 seconds: dense, violent rapid-cut climax.
Editing rhythm escalates with combat intensity. Fast intercutting enhances chaos. Precise beat timing, no redundant shots. Constant suffocating high-speed pacing.
Environment Setting
Modern open-plan office with desks, computers, cabinets, printers, and swivel chairs; realistic layout with cool overhead lighting and side backlight for strong contrast. Slightly cramped space for added tension; no glass-breaking.
Environmental Interaction & Constraints:
Combat affects the space only through natural, physically justified collisions—papers scatter from air movement (not attacks), chairs slide or topple on impact, desks shift slightly without deliberate damage, and screens remain intact with minor vibration. No unnecessary or intentional destruction of equipment.
Body-driven combat: grappling, pinning, counter-throws, and realistic slams with subtle impact cracks/vibrations. Natural airflow—light paper/dust movement only, no exaggeration.
Final Action Design
Final 2–3s: both throw simultaneous close-range punches, freezing inches before impact—fully tensed, heavy breathing, locked in perfect deadlock.
A cinematic 15-second video of a young woman in a modern kitchen making fresh orange juice. The scene starts with her opening the refrigerator and taking out bright, fresh oranges. She places them on the counter and begins peeling them smoothly. Next, she puts the orange slices into a juicer machine and presses it, showing the juice being freshly extracted. She then pours the juice into a clear glass, adds a few ice cubes, and gently stirs it. Finally, she lifts the glass, takes a refreshing sip, and smiles. Soft natural lighting, clean aesthetic kitchen, smooth transitions, realistic motion, high detail, 4K quality.
A teenage boy, 17, athletic build, messy dark hair, wearing faded yellow shorts, a worn white tank top, and scuffed trainers, sprints across the rooftops of São Paulo's dense favela skyline as the sun burns deep orange behind the city. This time, he's not just running, he's being chased. [0s–1.5s] Wide panoramic shot of São Paulo at sunset. Endless stacked concrete buildings. A lone figure runs across a rooftop. Two more figures appear behind him, gaining ground. [1.5s–3s] Close tracking shot from behind. He sprints across corrugated metal roofing, footsteps echoing. A pursuer lunges. Without slowing, he sidesteps and elbows him mid-run, sending him crashing into a clothesline. He clears a rooftop gap in one fluid motion. [3s–5s] He slides under a water tank, rolls, then pops up into a spinning back kick as another attacker drops in front of him. The hit lands clean. He vaults over a wall onto a lower rooftop, silhouette sharp against the orange sky. [5s–7s] Running along a narrow ledge, 15 stories up. One chaser grabs his shoulder from behind. He twists, breaks free, and shoves him off balance. A chunk of concrete falls into the street below. He keeps moving, never looking back. [7s–9s] A wide gap ahead. He builds speed. Just before the jump, another attacker cuts him off. A quick exchange of punches, one dodge, one body shot. He uses the momentum to push off the opponent and launches across the gap. Time slows as the camera circles him mid-air. [9s–11s] He catches a fire escape railing. A pursuer follows and grabs the same structure. Hanging mid-air, they struggle. He kicks the attacker away, swings through, and climbs up to the next level in one motion. [11s–13s] Climbing rapidly along window frames and AC units, he pulls himself higher. Below, the city transitions from sunset orange into glowing purple nightlife. Sirens faint in the distance. The chase fades. [13s–15s] He reaches the rooftop edge. Stands still. Breathing heavy. Arms slightly out as wind hits him. The city stretches endlessly beneath him, lights flickering alive like stars. Cut to black. Keywords: São Paulo rooftops, parkour chase fight, rooftop combat, cinematic action, sunset to night transition, dynamic camera, 4K.
A man in his late 20s, casual white t-shirt and jeans, holds up a green smoothie and takes a sip, then smiles at the camera. Bright kitchen, morning sunlight from behind. Handheld, slight natural shake. Warm tones, authentic, documentary style.
Cinematic short film, photorealistic VFX. 15-second desert hunt at high noon. Blinding light, zero shade, heat shimmer on everything.
[0–2s] Overhead drone — red desert canyon system. Sandstone towers, wind-carved arches, dry riverbeds. One figure: a woman on a sand-skiff — a flat wooden board with a triangular sail, gliding across compacted sand on bone runners. She wears wrapped linen armor, copper goggles, a weighted net coiled at her waist. She watches the sand.
[2–4s] ECU the sand surface — it's BREATHING. Rippling outward from a point 200 yards ahead. The sand rises like a slow dome. The sail-woman cuts the skiff hard right. The dome ERUPTS — a sandworm. 40 feet, segmented, chitinous plates the color of rust. Its mouth is a vertical iris lined with grinding teeth like a rock crusher. Sand cascades off its body. It roars — the sound is DEEP, below human hearing, but you feel it shake the frame.
[4–7s] Chase through the canyon — the skiff banks between sandstone pillars. The worm follows UNDERGROUND — you see its path as a moving ridge of sand, smashing through the earth, pillars cracking as it passes beneath their foundations. She throws the weighted net — it wraps around a sandstone tower, rope trailing to her skiff.
[7–10s] The worm surfaces directly in the skiff's path. She leans the board, catches wind, and the skiff JUMPS — airborne over the worm's back. In mid-air she drives a barbed stake into the top of its head plate. Lands the skiff on the other side. Rope connects stake to the sandstone tower.
[10–13s] The worm charges away — the rope goes taut against the tower. The tower HOLDS. The worm's momentum whiplashes it sideways. It crashes into a canyon wall. Sandstone debris cascades. Dust cloud erupts like a bomb. Silence inside the cloud.
[13–15s] Dust settles. The worm lies against the canyon wall, breathing but pinned. She brings the skiff alongside, pulls her goggles up. Beneath the goggles — calm, focused eyes. She draws a curved knife and begins cutting a single iridescent scale from the worm's flank — the harvest. That's all she came for. One scale. The worm hisses but doesn't fight. She pats its flank once, takes the scale, pushes off. The skiff catches wind. She vanishes between the sandstone towers. Cut to black. Ultra-realistic.
FORMAT: 15s / free rhythm / 1 seamless match cut / uninterrupted camera movement until the cut + action beginning immediately from frame one
SUBJECTS:
A solitary woman armed with a sword, dressed in worn fur and leather survival gear, struggles against a huge polar bear using raw, two-handed defensive movement. Later it is revealed that the same woman is inside her home wearing relaxed indoor clothing, where a VR headset appears only after the match cut and is removed in one clear motion.
ENVIRONMENT:
Open frozen tundra beneath harsh winter daylight, wind dragging powder snow across pale blue ice. The sequence transitions into a modest, lived-in interior through a carefully aligned visual match. The biting cold, visible breath, and glare of the wilderness give way to warm clutter, window light, and a faint glow from the game.
MOOD:
Immediate life-or-death intensity that abruptly resolves into everyday reality while preserving physical continuity of motion.
COLOR LOGIC:
Naturalistic cinematic film print emulation.
TIMELINE
0:00–0:07
The shot begins instantly in motion. A handheld wide shot collapses toward a medium-close framing as the woman retreats across frozen ground while the polar bear charges through blowing snow. The camera runs beside the action at roughly eye level, beginning near 28mm and gradually tightening toward 35mm, slightly unstable but close enough to keep both figures physically grounded in frame.
The bear rapidly closes distance while she plants her feet, recoils, and keeps the sword positioned defensively between them.
SFX: howling wind, boots grinding against ice, deep animal roar, fabric strain, blade slicing through air, snow scraping across the surface.
Hard winter sunlight side-lights the terrain, casting long blue shadows across the ice.
0:07–0:11
The movement continues without a cut, pushing into a tight close-up as the bear lunges into the final distance. Claws reach toward her shoulders and its jaws dominate the edge of frame.
At the height of the attack, a man's voice calls out: "Karla…" then louder: "KARLA."
She answers with a tired "Off."
At that exact response, time collapses into slow motion. Snow particles hang nearly still, the bear suspended mid-strike, while she alone continues moving at normal speed. The camera slowly arcs around her face in a clockwise drift.
Unimpressed rather than frightened, she lets the sword fall and raises both empty hands toward her temples in one smooth interruption gesture. No headset or device exists in this frozen world.
The camera maintains identical face scale, hand height, head tilt, lens distance, and rotational drift until the match cut.
SFX: fabric tension approaching impact, the distant voice calling Karla… KARLA, her quiet Off, wind stretching and fading toward silence.
Bright winter light catches suspended snow crystals around her face.
0:11–0:15
MATCH CUT.
The close-up aligns perfectly with the new setting. As her hands pass through the same screen position, the frozen tundra becomes a small home interior. The camera keeps the same clockwise drift and framing as the motion continues uninterrupted.
For the first time, a VR headset is visible over her eyes. She grips both sides and pulls it upward in a single smooth action. The camera widens into a medium shot as the headset lifts above her forehead.
She steps into a compact living room wearing loose indoor clothing. The handheld orbit reveals couch edges, scattered blankets, and cool daylight from a window. Her body language relaxes into mild irritation.
She looks toward the unseen voice, rolls her eyes slightly upward, and says:
"What is it."
Lens: 35mm, natural spherical look.
SFX: headset strap stretching, plastic shifting, quiet room ambience, soft footstep on the floor, faint game audio fading out, her breathing settling, her dry voice asking "What is it."
Indoor daylight replaces the stark winter contrast.
been testing a different workflow lately using tapnow. what makes it interesting is how it structures the entire process from idea → visuals → final video. instead of jumping between tools, you can actually build everything in one flow and refine it step by step like a real production pipeline. for the visuals, i'm using seedance 2.0 which is currently one of the strongest models for photoreal, human-centered video. but quick note — seedance 2.0 is currently only available in selected regions and requires a verified corporate email to access. still, the direction is clear: AI video is moving from "generation" → into "directing". also, they just launched a global challenge called "10,000 Parallel Universes" with a $200K prize pool. if you're exploring cinematic AI workflows, this is actually a good place to test ideas and push concepts further.
Food that feels alive when you try @yapper_so
Prompt
Hyper-realistic 4K food video, Thai street food cooking, Pad Kra Pao Moo in black wok, extreme close-up cinematic style, professional food cinematography, natural kitchen lighting with steam highlights,
Scene 1: pouring dark glossy sauce into screaming hot wok, instant sizzle explosion, aromatic steam burst, oil dancing, dynamic camera push-in
Scene 2: cracking soft egg directly into the mixture, yolk dramatically bursting and flowing like liquid gold over minced meat, slow-motion detail
Scene 3: adding fresh red chilies and holy basil leaves, high-heat stir-fry, leaves wilting instantly, glossy caramelized pork, intense motion and sizzle
Scene 4: final plating over steaming white rice, perfect composition, shallow depth of field, mouth-watering close-up, freeze frame at the end, photorealistic textures, no text, no watermark, ultra realistic, shot on iPhone 16 Pro food reel style
--ar 9:16 --stylize 250 --v 6 --q 2
Ultra realistic cinematic night scene, shallow depth of field, neon bokeh lights, a stylish young man (same face as reference image, sharp jawline, short styled hair, light beard, confident personality) walking confidently on a city street, holding a white coffee cup in one hand and scrolling his phone with the other, natural expression, slight tongue movement as if just finished eating, casual relaxed vibe, restaurant street ambiance
No camera cuts, no zoom, smooth steady tracking shot, same framing, cinematic lighting, orange and teal color grade
Suddenly, a loud metallic vibration sound, intense rumble, a heavy circular metal tunnel cap bursts out from the road and flies into the sky, sparks and dust particles, people around panic and look shocked, chaotic environment, but the man remains calm and looks up with curiosity
The metal cap crashes down violently, ground cracks open with extreme pressure, debris flying, from beneath emerges a massive alien creature, elephant-sized, biomechanical texture, glowing veins, aggressive stance, tearing through the underground tunnel
Extreme close-up of alien face, hyper-detailed skin, breathing heavily, then extreme close-up of the man's face, calm and fearless
The man casually puts his phone into his pocket, throws the coffee cup aside, instant transformation begins — futuristic white ice combat suit assembling over his body with micro mechanical details, glowing frost energy, ultra detailed armor formation, cinematic closeups (same suit as reference image, white glossy armor, glowing blue lines, helmet visor dark reflective)
The creature charges aggressively, pushing cars and taxis aside with force, destruction on street
At the last moment, the man teleports behind the creature in a burst of icy energy
Creature turns — instantly a powerful punch lands on its face
Extreme hyper slow motion close-up: face distortion, shockwaves, skin rippling, massive impact force, cinematic motion blur
NEW HERO MOMENT:
After the punch impact, the man is suspended mid-air in a powerful superhero pose, perfectly stable, surrounded by flying frozen debris and ice shards as the creature explodes into pieces, ultra cinematic lighting, particles glowing, frozen fragments spinning in slow motion
Camera slightly orbits him while maintaining cinematic framing, ice particles reflecting neon lights, epic hero aura
Then he slowly descends back to the ground with controlled motion, calm and dominant presence
As he lands, the ice combat suit begins to disassemble smoothly into glowing particles, transitioning back to his normal human look (same face consistency as beginning), seamless transformation
He adjusts his posture casually, starts walking again like nothing happened
People around stand frozen in shock, watching him silently
Cinematic ending, soft ambient sound, hero walks away into neon-lit street
This 21-second sequence in Seedance 2.0 started as one idea, but the first generation basically asked for a different story direction. I ended up with a video stitched from three separate generations. The first one was pure text-to-video (no references at all); then I extended the scene twice by 8 seconds each time, using the previous output as a video reference. Even after those two extensions, the consistency stayed rock-solid — same characters, location, colors, and overall audio mood across the whole sequence.
Highly detailed full-body shot of a premium life-size Iron Man Mark XLII-inspired robot statue made of polished mirror chrome and silver metal with intricate mechanical details, exposed pistons, rivets, panel lines, and articulated armor segments. The suit has a glowing bright blue circular arc reactor in the chest with energy rings, matching glowing blue eyes in the helmet, and open knee compartments revealing complex glowing blue circuitry and batteries inside. Dramatic studio lighting with strong reflections on the shiny metallic surfaces, subtle wear and battle damage on the armor. Dynamic three-quarter pose turning slowly on a display stand, cinematic volumetric lighting, hyper-realistic textures, 8k photorealistic, sharp focus, masterpiece, best quality.
Use a base identity reference image and preserve the subject's face 100% (no beautification or changes), with Elle Fanning as the main character while strictly maintaining her natural facial features, proportions, and realism. Place her in a modern ultra-high-rise office at night with floor-to-ceiling windows overlooking a vast city skyline. Add realistic investigation-style paper notes on the walls (wrinkled, taped, layered, partially overlapping). Use cinematic corporate lighting (cool blue city light + soft overhead + subtle warm desk lamp), shot on a 35mm lens with shallow depth of field, in a hyper-realistic documentary style, 4K quality. Then create a second image of a tall luxury NYC residential skyscraper at night, viewed from a distance with surrounding buildings, wet streets, atmospheric haze, cool exterior tones, and warm interior penthouse lights, shot on a 135mm telephoto lens with realistic proportions. Finally, generate an 8–10 second 1080p Kling 3.0 video using the skyscraper as the opening frame and the office as the ending frame, with a slow cinematic push-in toward a specific lit window, natural glass reflections, seamless transition into the interior (no cuts or morphing), realistic exposure shift from exterior to interior, and Elle Fanning remaining still throughout with only subtle breathing, no expression change.
cinematic
documentary
realistic
night
urban
portrait
Subject: A high-action cinematic shot of a blonde, bearded man in a black tactical combat suit with carbon-fiber-textured padding, engaging in a physical fight with a large polar bear.
Details:
The Man: Caucasian, muscular build, mid-length wind-swept blonde hair, thick blonde beard. He is wearing a matte black tactical suit with integrated chest and back armor plates.
The Polar Bear: Hyper-realistic, large adult polar bear with thick, off-white fur. The bear is wearing large, black leather-textured boxing gloves on its paws.
Composition & Pose: Low-angle, dynamic action shot. The man is caught mid-motion, leaning back as he narrowly dodges a massive punch from the bear. The bear's glove is inches from the man's face.
Setting: A flat, concrete rooftop of a modern building at sunset. In the background, a hazy city skyline (resembling Los Angeles) with skyscrapers, construction cranes, and glowing orange sunlight.
Lighting: Strong golden hour lighting coming from behind the city, creating long, dramatic shadows across the rooftop and a bright rim-light effect on the hair of the man and the fur of the bear.
Camera Specs: Cinematic wide-angle lens, sharp focus on the man's face and the bear's glove, shallow depth of field with the background city slightly blurred.
Style: Photorealistic CGI, high-fidelity textures, 8k resolution, motion blur on the man's hair to indicate rapid movement.
Aspect Ratio: 9:16 (Vertical)
FORMAT: 15s / 145 BPM / 15 SHOTS / beat-synced routine
SUBJECT: @[image1] < ATTACH YOUR IMAGE.
WARDROBE: Sleep tee and lounge shorts at home. Tailored jacket, fitted top, trousers, and lace-up shoes outside.
ENVIRONMENT: Tiny apartment, bright fridge glow, rain-dusted hallway, chrome metro, clean office, then a bedroom in cool window light. Everything feels glossy and lived-in.
MOOD: Late-for-work panic, clipped momentum, breathless urgency, then an exhausted exhale.
MUSIC: Fast percussive electro-pop
COLOR LOGIC: Hyperreal Pop Look
STYLE: Ultra-Realistic.
LOGIC RULE: Keep logical consistency in wardrobe, props, locations, and action continuity across all shots.
SHOT 1: ECU, 85mm push-in / 06:50 on the phone screen as it shakes on rumpled sheets. / SFX: alarm, sheet rustle.
SHOT 2: WS, 35mm handheld jolt / Rhythmic cut into her jolting upright through side light, throwing the blanket aside, and planting her feet on the floor in one rushed motion, still in a soft sleep tee and lounge shorts. / SFX: mattress bounce, blanket whip, sharp breath.
SHOT 3: MCU, 50mm slide / Cut on action into face wash at the sink, droplets catching the top light. / SFX: faucet rush, water slap.
SHOT 4: Insert shot, 85mm rack focus / Match cut into the toothbrush held at a natural forward brushing angle against the front teeth, hand relaxed and upright, mint foam and mirror eye. / SFX: bristle scrape, sink drip.
SHOT 5: Interior fridge view, 24mm wide / Object pass into the camera inside the fridge looking out as the door snaps open and her hand darts in, blue fridge light framing a hurried grab for breakfast ingredients. / SFX: fridge hum, bottle clink, shelf rattle.
SHOT 6: Insert shot, 50mm handheld / Rhythmic cut into eggs and toast hitting the pan under warm practical light. / SFX: butter sizzle, chop tap.
SHOT 7: MCU, centered 50mm push-in / Match cut into one rushed bite, a quick clock glance, and an immediate rise from the chair. / SFX: crunch, ceramic clink, chair scrape.
SHOT 8: Bird's-eye insert, 35mm overhead / Cut on action into striped socks snapping on. / SFX: fabric stretch, heel tap.
SHOT 9: MS, 35mm pivot / Camera wipe into a rushed outfit change as the sleep tee disappears under a fitted top and tailored jacket, then her tote, keys, and transit card get scooped up in one messy grab. / SFX: fabric whip, key jingle, zipper pull, bag rustle.
SHOT 10: Insert shot, 50mm overhead / Match cut into lace-up shoes slamming on as the laces yank tight in one impatient pull. / SFX: sole thump, lace tug, short breath.
SHOT 11: WS, 24mm parallax / Whip pan transition into her, now in the tailored outside outfit, rushing through the apartment door into corridor light without breaking stride. / SFX: latch click, rapid footsteps, hallway air.
SHOT 12: MS to CU, 35mm glide into 85mm push-in / Sound bridge into the metro car interior only as she grips the pole, shifts with the carriage sway, checks the passing station lights, and snaps a tense glance toward the closing doors, reflected chrome streaking around her and the city smearing outside the window. / SFX: rail clatter, carriage screech, door warning chime, tight breath.
SHOT 13: Insert to MCU, 50mm snap zoom / Smash cut to the office entrance as her access card hits the reader, the glass door unlocks, and she slips through fast before the chair roll and laptop open. / SFX: badge beep, door click, laptop chime.
SHOT 14: OTS, 35mm handheld / Rhythmic cut into fingers racing across keys, chat windows blinking, coffee by the trackpad, and notifications stacking faster than she clears them. / SFX: keyboard burst, notification ticks, mouse click.
SHOT 15: WS, 50mm pull-out / L-cut with a match from laptop close to apartment re-entry as the jacket drops, work clothes peel away, and she changes back into sleepwear before collapsing into bed in the opening frame shape. / SFX: door shut, bag drop, fabric rustle, blanket rustle, room tone.
FORMAT: 15s / 135 BPM / 13 SHOTS / beat-synced
SUBJECT: @[image1]
WARDROBE: Neutral streetwear, long coat
ENVIRONMENT: Busy city street → everything frozen mid-motion
MOOD: Confusion → curiosity → quiet control
MUSIC: Pulsing ambient electronic
COLOR LOGIC: Muted tones with sharp highlights
STYLE: Ultra-real cinematic
SHOT FLOW:
CU phone glitching time (08:12 → stuck)
Street crossing — people suddenly freeze mid-step
Coffee splash frozen in air
Paper flying — static mid-air
SUBJECT slowly walking through frozen crowd
Hand passing through suspended raindrops
Eye-level tracking through still chaos
Close-up: realization expression
Camera orbit — subject only moving element
Subtle smile / calm shift
Clock ticks again
Everything snaps back into motion
SUBJECT standing still as world rushes past
A colossal futuristic white-and-gray mecha robot named "Xeno Leviathan Terraformer", massive scale, intricate mechanical details, glowing cyan/blue energy accents on joints and eyes, hovering powerfully above Earth's atmosphere with clouds swirling around its legs, dramatic low-angle cinematic view.
In the foreground below: Godzilla with glowing blue atomic spines roaring aggressively on the left, and a massive King Kong (Gorilla) on the right, both tiny compared to the giant robot, standing in a coastal city being destroyed.
The robot is smashing into the ocean, creating enormous tsunamis, white water splashes, huge clouds of dust and smoke, flying debris, cracked earth. Epic scale, god-like perspective, planet curvature visible in the background, dramatic sky with stars and atmosphere.
Ultra-detailed, cinematic lighting, volumetric fog, dynamic motion, epic destruction scene, best quality, 8k, photorealistic yet stylized, in the style of high-end sci-fi concept art --ar 9:16 --stylize 250 --v 6
Top-down fixed camera on a clean white marble surface with soft, bright natural lighting shows a clear glass bowl of pale yellow egg yolk mixture being gently whisked by a realistic hand while another hand adds a thin stream of vanilla extract, creating soft swirls that blend smoothly without splashing; the scene then transitions seamlessly to a second glass bowl where egg whites are whisked more rapidly, transforming from transparent liquid into a light, airy foam with visible motion blur, surrounded by neatly arranged ramekins and eggs, maintaining a minimal aesthetic, consistent lighting, natural hand movement, and an ultra-realistic cinematic food photography style.
Cinematic, apocalypse, post-apocalyptic, photorealistic 15 seconds 16:9
[00:00-00:05] Wide shot. A dark, red-tinged sky. Flaming asteroids crash into a coastal city. Buildings collapse. Clouds of dust and fire. Hard, chiaroscuro lighting with deep shadows.
[00:05-00:10] Medium back shot. A lone figure, a man in tattered clothes, stands on a rocky cliff edge, overlooking the destruction. Wind blows his hair. Camera does a slow orbit around the figure.
[00:10-00:15] Close-up shot. The man's face, covered in dust. He slowly looks up towards the sky. His eyes reflect loss and determination.
Final wide shot of the burning city with the lone figure in the foreground. Atmospheric haze and dust particles.
Massive cinematic sumo wrestling match inside a traditional Japanese sumo arena, two enormous sumo wrestlers collide with incredible force in the center of the clay dohyō ring, their bodies slamming together as they push and grapple intensely, sand and dust erupting under their feet.
A packed arena surrounds them with thousands of spectators shouting and cheering loudly, banners waving, dramatic arena lighting illuminating the fighters while the crowd fades into shadow.
Camera: low-angle cinematic hero shot, slow push-in toward the wrestlers as they collide, brief slow-motion during powerful impacts, subtle camera shake for realism.
Style: ultra-realistic cinematic sports film, dramatic lighting, high detail sweat and skin textures, epic atmosphere, movie-quality color grading, shallow depth of field, 4K realism.
FORMAT: 15s / 145 BPM / 15 SHOTS / beat-synced routine
SUBJECT: @[image1] < ATTACH YOUR IMAGE.
WARDROBE: Sleep tee and lounge shorts at home. Tailored jacket, fitted top, trousers, and lace-up shoes outside.
ENVIRONMENT: Tiny apartment, bright fridge glow, rain-dusted hallway, chrome metro, clean office, then a bedroom in cool window light. Everything feels glossy and lived-in.
MOOD: Late-for-work panic, clipped momentum, breathless urgency, then an exhausted exhale.
MUSIC: Fast percussive electro-pop
COLOR LOGIC: Hyperreal Pop Look
STYLE: Ultra-Realistic.
LOGIC RULE: Keep logical consistency in wardrobe, props, locations, and action continuity across all shots.
SHOT 1: ECU, 85mm push-in / 06:50 on the phone screen as it shakes on rumpled sheets. / SFX: alarm, sheet rustle.
SHOT 2: WS, 35mm handheld jolt / Rhythmic cut into her jolting upright through side light, throwing the blanket aside, and planting her feet on the floor in one rushed motion, still in a soft sleep tee and lounge shorts. / SFX: mattress bounce, blanket whip, sharp breath.
SHOT 3: MCU, 50mm slide / Cut on action into face wash at the sink, droplets catching the top light. / SFX: faucet rush, water slap.
SHOT 4: Insert shot, 85mm rack focus / Match cut into the toothbrush held at a natural forward brushing angle against the front teeth, hand relaxed and upright, mint foam and mirror eye. / SFX: bristle scrape, sink drip.
SHOT 5: Interior fridge view, 24mm wide / Object pass into the camera inside the fridge looking out as the door snaps open and her hand darts in, blue fridge light framing a hurried grab for breakfast ingredients. / SFX: fridge hum, bottle clink, shelf rattle.
SHOT 6: Insert shot, 50mm handheld / Rhythmic cut into eggs and toast hitting the pan under warm practical light. / SFX: butter sizzle, chop tap.
SHOT 7: MCU, centered 50mm push-in / Match cut into one rushed bite, a quick clock glance, and an immediate rise from the chair. / SFX: crunch, ceramic clink, chair scrape.
SHOT 8: Bird's-eye insert, 35mm overhead / Cut on action into striped socks snapping on. / SFX: fabric stretch, heel tap.
SHOT 9: MS, 35mm pivot / Camera wipe into a rushed outfit change as the sleep tee disappears under a fitted top and tailored jacket, then her tote, keys, and transit card get scooped up in one messy grab. / SFX: fabric whip, key jingle, zipper pull, bag rustle.
SHOT 10: Insert shot, 50mm overhead / Match cut into lace-up shoes slamming on as the laces yank tight in one impatient pull. / SFX: sole thump, lace tug, short breath.
SHOT 11: WS, 24mm parallax / Whip pan transition into her, now in the tailored outside outfit, rushing through the apartment door into corridor light without breaking stride. / SFX: latch click, rapid footsteps, hallway air.
SHOT 12: MS to CU, 35mm glide into 85mm push-in / Sound bridge into the metro car interior only as she grips the pole, shifts with the carriage sway, checks the passing station lights, and snaps a tense glance toward the closing doors, reflected chrome streaking around her and the city smearing outside the window. / SFX: rail clatter, carriage screech, door warning chime, tight breath.
SHOT 13: Insert to MCU, 50mm snap zoom / Smash cut to the office entrance as her access card hits the reader, the glass door unlocks, and she slips through fast before the chair roll and laptop open. / SFX: badge beep, door click, laptop chime.
SHOT 14: OTS, 35mm handheld / Rhythmic cut into fingers racing across keys, chat windows blinking, coffee by the trackpad, and notifications stacking faster than she clears them. / SFX: keyboard burst, notification ticks, mouse click.
SHOT 15: WS, 50mm pull-out / L-cut with a match from laptop close to apartment re-entry as the jacket drops, work clothes peel away, and she changes back into sleepwear before collapsing into bed in the opening frame shape. / SFX: door shut, bag drop, fabric rustle, blanket rustle, room tone.
A lone man struggles to steady himself on a small boat in the middle of a violent ocean storm. Thunder cracks and heavy rain lashes down as towering waves crash around him. Suddenly, a sea monster bursts from the dark water, its massive jaws opening wide. It clamps its teeth onto the boat, splintering the wood, and violently drags it beneath the churning ocean as the man fights for his life. Dramatic lighting, cinematic camera angles, hyper-realistic, intense atmosphere.
Cinematic ultra-realistic 15-second short film, western girl in a dark jacket walking through a busy city sidewalk, natural daylight, hard shadows, shallow depth of field, shot on Arri Alexa Mini, 50mm lens. She walks confidently through moving pedestrians, phones, conversations, and pigeons flying in a bright sky. She snaps her fingers — a white spherical shockwave expands outward, freezing everything instantly: people mid-step, dust and leaves suspended, pigeons frozen mid-flight. Silence. She calmly walks through the frozen world, observing everything, gently touching a suspended pigeon. She stops in front of a fully clothed woman in a flowing red dress frozen mid-motion, studies her with a calm expression. She then snaps again — a stronger reverse shockwave restores motion across the city. Pedestrians resume walking, pigeons scatter into flight, leaves fall naturally. She turns and walks away as the camera pulls back into a wide cinematic aerial shot. Fade out.
Attached the 4 references and put this prompt
[CINEMATIC SETUP]
Film stock: 35mm Kodak Vision3, anamorphic lens, f/2.8.
Color Grade: High-contrast "Bleach Bypass" look with desaturated earth tones and deep shadows.
Lighting: Dim, volumetric moonlight filtering through thick fog; dramatic rim lighting on characters.
Atmosphere: Heavy lingering fog, swirling dust particles, and organic debris.
The four characters [ @ image 1, @ image 2....] are in a defiant and ominous pose.
[STYLE & QUALITY BOOSTERS]
Photorealistic 8K, ultra-detailed textures, cinematic lighting, perfect motion blur, high dynamic range, no artifacts, coherent multi-character interaction.
A 90s era home video, she is street dancing on a warm city street at dusk in baggy 90s clothes to an early 90s hip-hop track, a group of people are around her cheering her moves, especially when she pulls out a massive move
A cinematic cyberpunk portrait of a beautiful young East Asian woman with long flowing ash-blonde hair with dark roots, standing on a high-rise balcony overlooking a futuristic neon-lit megacity at night. She is a cyborg with a sleek black and silver mechanical right arm and shoulder, intricate metallic joints and exposed wiring visible. She wears a stylish cropped rust-red leather jacket with silver buttons over a black leather outfit with cutouts on the thighs. A large futuristic mechanical sword rests in her mechanical hand.
She has striking facial features, sharp eyes with subtle eyeliner, and a cool, confident expression. Her long hair dramatically blows and flows in the wind throughout the shot. She slowly turns her head and upper body from a three-quarter view facing the camera toward her right side, then slightly away, ending in a profile and back view as her hair whips across her face. The camera performs a slow, smooth orbiting movement around her while subtly tilting up and down, creating dynamic angles.
Moody cinematic lighting with strong rim lights from the city glow, soft bokeh lights from background skyscrapers, subtle lens flares, and atmospheric depth. Highly detailed, photorealistic, 8K, dramatic color grading with teal and orange tones, cyberpunk aesthetic inspired by Blade Runner 2049 and Ghost in the Shell. Slow motion feel, elegant and powerful atmosphere, 10 seconds duration, 16:9 aspect ratio.
Cinematic realistic animation, static locked wide camera. Cozy kitchen, morning light through window. Orange tabby cat in striped apron stands upright at wooden counter, flour dust on fur, all ingredients visible: milk bottle, flour bowl, fresh eggs carton, vanilla extract, sugar, whisk, mixing bowl, chocolate chips, blueberries, butter, stack of finished pancakes already visible on right side of counter. Pancake cooking on stovetop pan in background.
Cat grabs egg carton with both paws, cracks egg firmly on bowl edge — yolk drops in, shell tossed aside casually. Scoops flour with small paw into bowl — white dust cloud puffs up, coats cat's nose, cat blinks. Pours milk from bottle — steady glug. Drops vanilla extract carefully. Both paws grip whisk, beats batter vigorously in bowl — rhythmic clinking, batter splashes slightly, cat's whole body moves with effort. Cat peers into bowl, satisfied. Ladle scoops batter, carries it to stovetop pan — pours perfect circle, gentle sizzle. Cat watches bubbles form on surface, spatula ready in paw. Confident flip — golden underside revealed, fresh sizzle. Pancake added to growing stack. Handful of blueberries and chocolate chips scattered on top. Butter slice placed — melts slowly. Maple syrup poured in thick amber stream.
Cat picks up finished plate with both paws, turns to face camera directly, extends plate forward toward lens — single clear "MEOW." Whiskers twitch. Static camera throughout, never moves. Ambient kitchen sounds only: whisking, sizzling, butter melting. 15 seconds.
{
"prompt": "Cinematic scene on the Mongolian steppe. A young Asian woman with long black braided hair and a white headband stands in the middle of a vast grassland. She is wearing a thick white fur coat and gently holding a small brown lamb in her arms. Behind her are two people dressed in traditional brown fur nomadic clothing standing near white yurts. The wind softly moves the woman's hair and fur coat. The woman looks down at the lamb with a calm, emotional expression. In the background are large green mountains and an endless steppe under soft daylight. The camera slowly pushes in toward the woman creating a dramatic cinematic feeling. Natural lighting, ultra realistic, shallow depth of field, filmic color grading, epic cinematic composition.",
"style": "cinematic film",
"camera": "slow dolly in, shallow depth of field",
"lighting": "soft natural daylight",
"quality": "4K ultra realistic",
"duration": "5-8s"}
NASA APOLLO SPACESUIT A7L
"WORN ON ANOTHER WORLD"
DNA: July 20, 1969. 102 hours, 45 minutes, 40 seconds into the mission. One small step.
Arri Alexa 65. Ultra-wide. Grain progressive — starts clinical-clean, ends at maximum as the lunar surface appears. Key light: cold fluorescent Mission Control white. Secondary: harsh unfiltered solar light — no atmosphere to diffuse it. Flares: none in the lab sequence. One single overwhelming flare as the visor catches unfiltered sunlight on the lunar surface. Background: white clean room → vacuum of space → lunar surface at Tranquility Base.
00:00–00:02 · THE LAYERS
A single thread of nylon being measured under a magnifying glass by a white-gloved technician's hand. Then: cut to the 21 layers of the A7L suit being assembled simultaneously — each layer materializing and wrapping the suit form: the liquid cooling garment first, its tubes threading through the fabric like a vascular system. Then the pressure bladder. Then the restraint layer. Then the thermal micrometeorite garment. Each layer distinct, each critical. Camera cross-section view through all 21 layers simultaneously.
00:02–00:04.5 · THE HELMET
Speed 30%. The polycarbonate helmet shell forms — perfectly spherical, flawless. The visor assembly drops in: the gold-coated visor — 24-karat gold, 0.0002 inches thick — pressing over the outer shell. Camera macro on the gold surface: it reflects everything in warm gold — including us. The neck ring locks with a quarter-turn — the mechanism designed to never, under any circumstances, fail.
00:04.5–00:07 · PRESSURIZATION
Speed 8%. The most important moment. Air flowing into the suit — the pressure building to 3.75 psi. Camera inside the suit as it pressurizes: the fabric stiffening, the gloves expanding slightly. Every seal tested by the pressure itself. The suit becomes a world — a personal atmosphere, the only thing between a human being and the void.
00:07–00:10 · THE CLEAN ROOM
Speed 15%. The suited figure — complete — standing in the white clean room under brutal fluorescent light. Technicians moving around it in blurred background. The suit reads in extraordinary detail: the layers visible at the seams, the connector ports, the PLSS backpack life support system mounting points. A visor drops over the helmet. The astronaut disappears inside.
00:10–00:13 · TRANQUILITY BASE
Speed 3%. The surface of the Moon. The suit boot pressing into lunar regolith in ultra-slow motion — the print forming in dust that has not been disturbed in 4.5 billion years. Camera at surface level — the boot print in sharp focus, the horizon of the Moon in the background, the Earth hanging above it, the size of a marble. The suit in the background, the gold visor catching unfiltered solar light. The single most overwhelming flare of any film in this collection — total, white, sacred.
00:13–00:14.5 · THE REVEAL
Speed 1% — absolute stillness. The A7L floating in the void — the Earth behind it at distance. The gold visor reflects the Earth. The Earth reflected in the suit designed to walk on the Moon. Everything is contained in this one reflection.
00:14.5–00:15 · END CARD
Silence — total. Then a single radio crackle. NASA worm logo. "Worn once. Changed everything." No grain — this moment is too clear, too real. Hold. Fade.
space
cinematic
documentary
advertisement
realistic
Character tone:
high-end romantic comedy, deadpan flirtation, over-serious male lead, quick-witted female lead, cinematic realism, sweet-chaotic chemistry, every frame like a poster
Male lead:
bespoke black suit, white shirt collar slightly open, handsome and severe, powerful aura, trying very hard to look cold and dominant, but secretly nervous and flustered, tiny tells betray him: slightly crooked tie, tight jaw, faintly trembling fingertips
Female lead [@ Image1]:
fitted slip dress / refined Chanel-inspired set, long hair slightly messy, elegant and soft-looking but emotionally sharper than him, stubborn, dryly funny, outwardly cornered for a moment, then visibly unimpressed, holding back laughter
Action + expression changes:
the male lead forcefully steps in for a dramatic wall-pin pose, one hand braced on the wall, closing the distance too seriously, trying to look intense; his expression starts cold but gradually cracks into restrained embarrassment
the female lead [@ Image1] steps back once, eyes widening, then notices his crooked tie and trembling hand; her expression changes from guarded resistance to deadpan disbelief and almost-laughing annoyance
their noses nearly touch, breathing overlaps, the tension becomes playful and absurd instead of painful
the male lead lightly lifts her chin, trying to recover his cool image; the female lead stares at him like she is watching someone forget his own script
Dialogue:
Male lead: Are you done yet?
Female lead [@ Image1]: Fix your tie first.
Male lead: I am being serious.
Female lead [@ Image1]: Then stop stepping on my heel.
Male lead: ...That was deliberate.
Female lead [@ Image1]: Your shaking hand says otherwise.
Show me a film still never seen before brand new shot on ARRI Alexa, hyper realistic film grain, LUT preset, key fill lighting anamorphic lensing shallow depth of field 24mm
A young woman with a neutral expression walks slowly through a crowded train station. People move quickly around her, creating motion blur, while she remains in sharp focus. She wears a black hoodie, minimal makeup, natural lighting. The environment feels busy and slightly desaturated.
Camera shots:
Front tracking shot (camera moving backward as she walks forward)
Side profile shot with crowd passing in foreground
Overhead drone-like shot showing her surrounded by moving people
Close-up on her face with shallow depth of field
Rear follow shot as she walks into the crowd
Cinematic lighting, soft natural daylight, realistic color grading, 4K, shallow depth of field, motion blur, emotional tone, urban realism
PROMPT TEMPLATE:
Cinematic close up shot of [SUBJECT], naturalistic film lighting, soft diffusion, restrained earthy color grading with warm highlights and cool shadows, layered depth composition with foreground interest and vast backgrounds, realistic material surfaces and micro-detail textures, subtle film grain, balanced cinematic contrast, moody atmospheric perspective and haze.
SUBJECT EXAMPLES:
→ A sheriff in a long corndog costume on a classic white two-story farmhouse porch with pickup truck, vast cornfield
→ A burning classic white two-story farmhouse with porch and parked truck surrounded by vast cornfield
→ A woman with windswept reddish-brown hair trying to move forward through a long cornfield
→ Or write your own subject and be creative!
STYLE: Gritty Cine Verité, 35mm handheld, natural shake. Continuous tracking shot. No cuts. All real-time. LIGHTING: Bright, high-altitude sun, pure blue sky....
High-end commercial photography of a (black angus burger) labeled "(NYC BURGER HOUSE)", centered on a (warm golden gradient background) with a premium fast-casual aesthetic. The burger features (perfect grill marks, melted cheese, perfectly cooked beef patty, toasted brioche buns) with realistic steam and oil shine.
Cinematic lighting, shallow focus, ultra-realistic textures.
[{"lang":"en","prompt":"Style & Mood: Gritty, Rough, Raw documentary realism. Handheld 16mm grain, blown-out cockpit glass flare, muted military greens and grays against a pale high-altitude sky. Authentic cockpit instrumentation, no stylization. Dynamic Description: Tight handheld close-up inside the cockpit — the pilot's gloved hand tightens on the stick, head snapping left toward threat, oxygen mask pulling with her breath. Smash cut to stabilized wide aerial shot — two real fighter jets carving across a pale blue sky, one banking hard to cut across the other's flight path, condensation streaming off wingtips in tight turns. Cut back to cockpit interior — her body pressing into the harness under g-load, instrument panel vibrating, warning tone audible. Handheld close-up: gloved fingers adjusting throttle, eyes fixed on HUD. Smash cut to external low-angle chase camera mounted near tail — her jet rolling hard left, the enemy aircraft visible above and behind, closing. She reverses. Enemy overshoots, crossing left-to-right ahead of her. Wide stabilized external shot: both aircraft now aligned, hers directly behind, HUD tone locking. Static Description: Real stratospheric sky, pale blue fading to white at horizon. Authentic military fighter jets, worn paint, panel lines, exhaust wash. Cockpit interior: functional, worn, oxygen hose, ejection handle visible. Audio: Pilot (over comms, oxygen mask muffled, flat and controlled):
a vertical transparent glass box filled with ocean waves lying on the surface of sea, hyper-realistic photography, the scene is rendered in hyper-realistic detail using octane render with a cinematic quality
Ultra-realistic cinematic timelapse, natural daylight progression from cool morning light to warm golden evening, adaptive static camera fixed in one elevated corner of the kitchen showing the full space with subtle focal adjustments for depth and parallax as the area transforms. Realistic movements of workers, tools, and materials. Interior kitchen renovation transformation.
[00:00–00:01]
Wide static shot of a completely outdated 1990s kitchen in early morning cool light. Old laminate countertops with visible wear, dark wooden cabinets with peeling finish, faded linoleum flooring, outdated appliances, cluttered and dingy appearance, no modern elements. Workers arrive carrying tools, demolition equipment, and material boxes. SFX: distant footsteps, door opening, light morning ambience.
[00:01–00:03]
Rapid demolition and preparation phase: workers swiftly remove old cabinets, countertops, flooring, and appliances at accelerated speed. Debris is cleared, walls are patched and primed, electrical and plumbing lines are updated and concealed. New subflooring and under-cabinet lighting prep begins. Sun rises, shadows shift noticeably. SFX: hammering, sawing, wheelbarrow rolling, muffled worker instructions, debris removal sounds.
[00:03–00:05]
Installation of core structures: sleek matte white or light gray shaker-style cabinets are mounted quickly, quartz or marble countertops are installed and sealed, new stainless steel appliances (modern fridge, oven, range) are placed. Backsplash tiles (subway or herringbone) are laid with precision. Fresh hardwood or luxury vinyl plank flooring is laid down. Midday brighter natural light fills the space, highlighting clean lines. SFX: drilling, caulking, tile setting sounds, appliance positioning, subtle water running for testing.
[00:05–00:07]
Finishing and detailing phase: modern hardware, under-cabinet and pendant lighting fixtures are added and illuminated, sink and faucet are installed with flowing water test, open shelving and decorative accents appear. A sleek kitchen island with bar stools materializes. Late afternoon golden light streams through windows, creating warm reflections on surfaces. SFX: softer tool sounds, light clicking of switches, water gently flowing, faint satisfying placement sounds.
[00:07–00:08]
Final reveal of the completed modern luxury kitchen in warm evening light. The space is now bright, clean, and inviting with minimalist design, gleaming surfaces, organized countertops featuring fresh herbs or simple decor, soft pendant lights on, subtle steam or natural warmth. Calm, aspirational atmosphere with gentle evening ambience and light background music.
Camera behavior:
Adaptive static timelapse camera — fixed elevated corner position with slight intelligent reframing and minor zoom adjustments to keep the entire transforming kitchen visible and maintain cinematic depth as cabinets, counters, and details fill the frame. Natural parallax, realistic perspective shifts, and smooth material transitions. No abrupt cuts.
Mood and aesthetics:
Hyper-realistic renovation timelapse, smooth accelerated transformation from dated and cluttered kitchen into a bright, modern, high-end minimalist retreat. Emphasis on material textures (wood grain, stone veining, metallic finishes), natural lighting changes, clean progress, and a sense of satisfying achievement. Highly detailed surfaces, realistic physics of installation, and organic worker movements.
Total duration: 8 seconds (about 7 seconds of active transformation, 1 second final cozy reveal). Cinematic color grading, ultra-realistic quality, natural light interaction with reflective surfaces.
Handheld shoulder-mounted camera, natural shake, slight autofocus breathing, no stabilization.
Opening frame: a quiet residential street in late afternoon, soft golden sunlight, long shadows across parked cars. Ambient sound: distant traffic, wind, faint birds.
The camera slowly walks forward along the sidewalk.
At the 2-second mark, a black SUV suddenly enters frame at high speed from the left — tires screeching, suspension compressing unevenly as it loses control.
The vehicle clips a parked red sedan — the impact is abrupt and messy, metal folding naturally, glass shattering outward in uneven fragments.
No slow motion — everything happens in real time.
The red car is pushed violently onto the curb, its front end crumpling with realistic deformation.
Airbags deploy with a muffled pop.
The camera operator instinctively steps back — slight stumble, frame dips, then recovers.
Smoke begins to rise from the SUV's engine bay.
Sound design: raw — crunch of metal, tire friction, glass, no cinematic exaggeration.
Final frame: both cars at rest, subtle ticking sounds, no dramatic music.
Style: documentary realism, imperfect framing, natural lighting, no visual effects, 4K.
A cinematic continuous shot of a man riding a massive dragon in flight, starting from start_frame Camera follows from behind, slightly above,tight framing (partial wings only). The dragon glides smoothly over a vast mountain range - no excessive flapping, only one powerful wing beat, then long controlled glide. Physics must feel real: •wings flex under air pressure •cloak and hair react naturally to wind •body weight shifts subtly during motion The dragon tilts into a controlled dive, back arching slightly, wings adjusting angle (not flapping) It passes naturally through existing cloud layers (no artificial clouds) creating realistic displacement. Style: ultra photorealistic, cinematic, natural lighting, arounded motion, no exaqqerated animation.
Treat the first frame as the initial state and the second frame as the final visual reference. A realistic human hand releases and propels a miniature shop scale model forward with natural motion. The miniature travels through the air following real-world physics, then precisely reaches the original shop's exact position.
Upon impact with the ground, the full-size shop begins forming progressively, assembling section by section in a believable construction-style reveal. The transformation includes subtle dust dispersion, realistic interaction with the environment, and physically accurate shadows.
Lighting remains consistent and cinematic while still grounded in realism. Camera position, framing, perspective, and background alignment must remain perfectly locked and stable throughout the sequence.
The transition should be smooth and controlled, with no visual artifacts, no distortion, no stretching, and no instability. Motion, scale, and timing must feel natural and convincing, delivering ultra-realistic physical behavior.
transformation
art
realistic
cinematic
advertisement
Theme: Humorous miniature horse salon haircut transformation
Visuals: Professional pet salon setting with a ring light, close-up shots of a fluffy miniature horse wearing a black grooming cape, realistic grooming tools and techniques creating a comedic “bowl cut” hairstyle
Camera: Close-up macro shots, alternating between front view, side profile, and top-down angles
Style: High-key lighting, realistic, comedic pet content, ASMR grooming aesthetic
Action:
The human hand gently combs through the miniature horse’s messy mane and fluffy fur upward — soft brushing sounds + fluffy fur rustling
[cut]
Side profile: water mist sprays onto the miniature horse’s head, fur becomes damp and sleek — spray bottle sound + fine water misting
[cut]
Top-down view: scissors trim a straight line across the wet fur held by the comb — snipping sounds + precise cutting motion
[cut]
Front view: thinning shears rapidly texturize the front “bangs” — quick snipping sounds + fur thinning texture
[cut]
Side profile: hairdryer blows air, fluffy fur fluffs up and moves in the wind — dryer hum + soft whooshing
[cut]
Front view: final touch-ups with comb and scissors perfect the round bowl cut shape — gentle combing + tiny snips
[cut]
Final reveal: the miniature horse sits proudly with a perfect mushroom-shaped bowl cut, blinks and looks side-to-side at the camera — satisfied horse neigh
FORMAT: 15s / free rhythm / 1 MATCH CUT / CONTINUOUS MOVE UNTIL MATCH CUT + IMMEDIATE ACTION FROM FIRST FRAME
SUBJECTS: A lone sword-bearing woman in weathered fur and leather fights a massive polar bear with desperate, two-handed survival movement. The same woman is later revealed at home in loose indoor clothes, where a VR headset appears only after the match cut and is pulled off in one clear motion.
ENVIRONMENT: Frozen wilderness under hard daylight, wind dragging snow across blue-white ice, then a modest lived-in home reached through a precise visual match. Winter glare and visible breath give way to soft clutter, indoor daylight, and a faint game-lit glow.
MOOD: Visceral survival tension snaps into grounded reality without breaking physical continuity.
COLOR LOGIC: Naturalistic Film Print Emulation
TIMELINE:
0:00-0:07: One unbroken handheld move, WS collapsing into MCU as the woman backpedals across the ice and the bear launches through blowing snow. The camera runs beside the leap at eye level, 28mm shifting to 35mm, slightly unstable and close enough to keep both bodies heavy and readable. The bear closes fast while she plants, recoils, and keeps the blade between them. SFX: (howling wind, boots grinding ice, low animal roar, cloth strain, blade cutting air, snow scrape). Hard winter sun side-lights the ice and throws sharp blue shadows.
0:07-0:11: Same unbroken move, no cut, tightening into a dead-on CU as the bear surges into the last inches, claws near her shoulders, jaws filling the frame edge. Right in the middle of the attack, a man's voice calls, Karla... then sharper, KARLA. She answers with a tired off, and on that reaction the world drops into slow motion. Snow drifts almost still, the bear hangs in its strike, and only she keeps moving at normal speed as the camera orbits into her face. Bored, not afraid, she drops the sword and brings both empty hands toward her temples in one smooth interrupt gesture. No headset, visor, or device is visible in the frozen world. Stay continuous until the match cut, keeping the same face size, hand height, head angle, lens distance, and clockwise drift. SFX: (cloth strain building to near impact, a man's voice calling Karla... KARLA, her tired off, then stretched wind fading toward silence). Hard winter sun catches the slowed snow around her face.
0:11-0:15: MATCH CUT. CU to MS. Seamless mid-motion transition as her rising hands cross the same screen position and the frozen close-up becomes the home interior with the same framing and clockwise drift. The motion continues uninterrupted, and now a VR headset is visibly strapped over her eyes for the first time. She grips both sides, pulls it fully off her face, and the camera opens into a medium shot as she drops it above her forehead and steps into a small living room in loose home clothes. The handheld orbit continues, revealing couch edges, scattered blankets, and cold window light as her posture falls into mild annoyance. She turns toward the voice, rolls her eyes upward, and says, What is it. 35mm natural lens, spherical. SFX: (headset strap stretch, plastic rub, quiet room tone, socked foot scrape, faint game audio, her breath settling, her dry voice saying What is it). Indoor daylight replaces the winter contrast.
ROCKET SURF.
STYLE: Gritty Cine Verité, 35mm handheld, natural shake. Continuous tracking shot. No cuts. All real-time.
LIGHTING: Bright, high-altitude sun, pure blue sky.
AUDIO: Rocket engine roar, wind, fiberglass creak.
TIMELINE: 0-3s: Guy in jeans and a black t-shirt is barely holding on the side of an active SpaceX rocket at 12,000 feet. The rocket is climbing. 3-7s: Hard zoom in cut on his face. His hair is plastered straight back. The ground is falling away below. 7-12s: The rocket hits max Q. The whole booster shakes violently. He grips tightly, his knees absorb it perfectly. 12-15s: He pulls a beer can out of his hoodie pocket, cracks it open. Takes one sip, cheers and yells: "Worth it!". Hard cut.
QUALITY: 8K photorealistic, correct physics, fabric motion blur, no artifacts.
A realistic black helicopter from the top, slowly approaches and hovers directly above the covered building. The helicopter stabilizes in the air, rotor blades spinning with natural motion blur and strong wind turbulence. The helicopter then attaches and starts pulling the giant cloth cover upward and sideways. The fabric reacts realistically: flapping, stretching, rippling, and flowing in the wind with natural folds. As the helicopter pulls harder, the cloth begins sliding off slowly, revealing the building facade step-by-step. The reveal is dramatic and satisfying, like a premium brand launch. The cloth keeps getting removed gradually, exposing the full building structure underneath. Finally, the entire cloth clears the building and attaches with the helicopter. The helicopter lifts the cloth and exits the frame smoothly. Final hero shot shows the fully revealed modern luxury building (same as reference second image), crisp details, glass windows, clean architecture, cinematic lens flare, smooth camera movement, and premium commercial look. Ultra-realistic CGI, 4K, high dynamic range, cinematic color grading, smooth gimbal camera motion, depth of field, realistic lighting, dramatic but clean advertising style.
15-second continuous single-shot action sequence.
No cuts. No scene transitions.
Cinematic modern war realism.
Color palette: desaturated tones, dust beige, concrete grey, warm muzzle flashes.
Scene:
War-torn urban street. Destroyed buildings, debris everywhere, smoke drifting.
0–3s — tension build
Camera handheld, low behind a group of soldiers moving cautiously along a wall.
Breathing audible. Dust in the air.
3–6s — ignition
Sudden gunfire from a window.
Bullets impact concrete. Debris bursts outward.
Camera ducks with soldiers.
6–10s — chaos
Soldiers return fire. One throws smoke grenade.
Camera moves through smoke with them as they push forward.
10–13s — escalation
Explosion down the street. Shockwave hits.
Camera briefly loses balance, recovers.
13–15s — final frame
Soldier signals forward.
Camera holds on his face — focused, tense. Freeze.
global_settings:
style: "Realistic kitchen scene, high-fidelity"
perspective: "First-person POV (18-year-old male)"
character: "21-year-old elegant woman (strictly maintain Image 1 features/art style)"
audio:
voice: "Mature/Onee-san female voice, gentle tone"
dialogue: "好吃~ (Haochi~)"
ambient: "Sizzling frying pan, soft natural laughter"
music: "None (Silence except environment)"
technical: "Continuous POV, no cuts, no watermarks, no text, 10 seconds"
scene_setup:
location: "Kitchen, standing by the stove"
action: "Woman in an apron is frying an egg; I approach her from behind"
storyboard_sequence:
0_3s: "Camera moves closer to her back as she cooks. A hand enters from the bottom frame, picks a fruit from a bowl, and holds it to her lips."
3_6s: "She glances sideways, smiles, bites the fruit, and says '好吃~' (Haochi~) with a satisfied, mature tone."
6_8s: "She turns fully to the lens (looking at 'me'). A hand reaches out to wipe a droplet of juice from her lip. Her eyes curve into warm crescent moons."
8_10s: "She turns back to the stove to flip the egg. Shot lingers on her busy back and the rising steam from the pan."
negative_constraints:
- "Low quality, blurry, laggy, clipping, deformed anatomy"
- "Immature appearance, non-POV, background music, text overlays"
- "Missing feeding action, robotic movements"
The scene unfolds under the bright midday sun, where a vibrant group of Indian men and women from the USA gather around a crackling fire. Each individual embodies distinct features and attire, clearly differentiating the men from the women. Their tribal dance movements are uniquely expressive, with the men showcasing powerful, grounded motions while the women flow gracefully with fluid, rhythmic gestures. The camera captures sweeping wide shots to reveal the full circle of dancers, interspersed with close-ups that highlight intense expressions and intricate footwork. Enhanced with Hollywood-level effects, flickering flames and swirling dust create an epic atmosphere, immersing viewers in the raw energy and cultural richness of this authentic celebration. The scene pulses with life, blending realism and cinematic grandeur seamlessly.
10-second cinematic commercial, photorealistic, 16:9, luxury food advertisement style.
Scene: a warm, cozy breakfast kitchen nook with rustic wooden table, beige walls, hanging plants, ceramic mugs, and soft white curtains glowing in golden morning sunlight.
A Nutella jar with official Nutella logo clearly visible and readable on the label sits at the center of the table, realistic packaging, sharp branding, no distortion.
0–2s: cinematic close low-angle orbital shot, the Nutella jar vibrates slightly, lid pops open, thick glossy chocolate bursts upward in slow motion, shining in warm sunlight.
2–5s: slow-motion food explosion — swirling chocolate ribbons, roasted hazelnuts spinning, toasted bread slices flying, sliced bananas and strawberries floating, honey droplets sparkling, cocoa powder mist drifting in the light, ultra realistic food physics, depth of field, macro detail.
5–7s: camera transitions smoothly to overhead top-down commercial shot, knife spreads Nutella on toast in mid-air, glass of milk and hot coffee float into frame, cinematic product ad lighting, soft glow highlights.
7–10s: ingredients assemble perfectly into a beautiful Nutella breakfast board, jar placed heroically beside the food with logo facing camera, chocolate shining, steam rising from toast, final hazelnut rolls to stop near the jar, commercial ending shot, product focus, high-end advertisement look.
ultra realistic, cinematic lighting, food commercial, product commercial, film quality, slow motion, shallow depth of field, global illumination, high detail, realistic textures, smooth motion, brand logo visible, no text overlay, no subtitles.
FORMAT: 15s / ONE CONTINUOUS SHOT
SUBJECTS: An alluring, highly attractive female figure. She wears a highly detailed office-style pleated mini skirt and a plunging white blouse, with visible fabric textures, skin pores, and faint perspiration.
ENVIRONMENT: A brightly lit convention floor. The background is a blur of neon booth lights and passing silhouettes, heavily grounded in realistic textures.
MOOD: Starts as an observational and intimate showcase, twisting sharply into jarring psychological terror.
COLOR LOGIC: Naturalistic Film Print Emulation
TIMELINE:
0:00-0:07: MS. Camera begins at a low side angle, observing her in profile with one bare foot planted fully on the floor and the other bare foot delicately angled on its tiptoes. It slowly pedestals and arcs, admiring her shapely legs and the pleated office mini skirt as she shifts her weight slightly. 50mm lens, shallow depth of field. SFX: (muffled crowd ambience, close fabric rustling).
0:07-0:12: MCU. The continuous movement glides up her plunging white blouse as the arc completes, arriving squarely in front of her. The camera settles precisely at her chin, keeping her full face just out of frame. 50mm lens, creeping push-in. SFX: (room tone fades out, low frequency rumble builds).
0:12-0:15: CU. Without cutting, her soft smile shudders and distorts, her flesh smoothly instantly twisting into a pale, ghastly supernatural face with wet dark seams. She opens her mouth impossibly wide and extends a long, glistening tongue directly at the camera. 50mm lens, macro close focus. SFX: (sudden dead silence, followed by a visceral wet sound and a harsh audio glitch).
SHOT1
Tight medium two-shot inside a dim apartment living room at night, warm practical lamp casting soft shadows across worn furniture.
A woman in her early 30s, pale skin, slightly messy hair, eyes red from holding back tears, stands rigid with arms crossed, shoulders tense.Facing her, a man in his mid-30s, unshaven, pacing slowly, unable to stay still, avoiding eye contact, jaw slowly pushes in, capturing the growing silence between them.
Woman (voice low, trembling but controlled):
"Say it. Don't walk around it… just say it."
SHOT2
Close-up on the man, warm light cutting across half his face, the other side falling into shadow.
His breathing is uneven. His eyes flicker — guilt, fear, resistance. He swallows, lips part and holds still — no escape.
Man (quiet, struggling):
"…You already know." A beat. He finally looks directly at her, tension peaking.
SHOT3
Extreme close-up on the woman, eyes glossy, one tear forming but not falling yet. Her expression collapses inward — not loud, but devastatingly controlled. She nods slowly, lips trembling, trying to stay in control. Camera pushes even closer, isolating her face from the background.
Woman (almost whispering, breaking):
"No… I need to hear you ruin it." Silence fills the room.
Tracking shot at street level, the camera races through a crowded city avenue as the ground begins to violently crack from an earthquake. Cars tilt, buildings split open. The camera weaves between falling debris, then tilts up as a massive skyscraper collapses forward. The shot pulls back rapidly as the shockwave chases the camera, swallowing everything in dust.
This hyper-realistic urban disaster special effects film, shot with an Arri Alexa 65 camera, utilizes high-contrast lighting to create a raw, textured atmosphere, three-dimensional smoke, and a chaotic, apocalyptic rhythm.
S1: A low-angle wide-angle tracking shot, filmed from a crowded street upwards, shows a gigantic, scaly snake tightly coiled around the Taipei 101 glass skyscraper, shattering its windows.
S2: A close-up slides along the snake's thick scales, which rub against the building's steel structure, sparking and scattering debris.
S3: A high-angle drone shot circles the top of the building, showing the snake roaring into the sky while a military helicopter fires missiles at its flanks.
S4: A wide-angle shot shows a violent, multi-level explosion in the middle of the skyscraper, the snake engulfed in flames and thick black smoke.
Realistic handheld footage of a MacBook Pro screen filling most of the frame, showing a Zoom meeting window with only one young woman in a tidy bedroom, attending a formal meeting from home. She wears a dark blazer and looks professional from the waist up. The room is bright, natural, and believable. The shot should preserve realistic screen reflections, subtle moiré pixel texture, tiny dust on the glass, and slight handheld camera shake.
After a brief moment, she hears a noise from the door offscreen. She glances to the side, slightly startled, then quickly stands up and starts walking away from her chair to answer it. Because the camera is filming the laptop screen, we see her moving inside the Zoom window. Halfway to the door, she suddenly freezes, looks down, and realizes she is only wearing underwear on her lower body. Her expression instantly shifts to embarrassment and panic as she remembers that her Zoom camera is still on. She spins around and rushes back toward the screen in a frantic, awkward, comedic way. She quickly returns to the laptop and blocks the camera with both hands or throws herself in front of it, covering the lens and ending the shot in chaotic close-up.
The tone is realistic and comedic, with strong contrast between formal upper-body business attire and the accidental lower-body mistake. Emphasize awkward humor, authentic facial acting, natural body motion, realistic indoor lighting, handheld movement, slight motion blur, and believable Zoom-call visuals. Keep it non-explicit: no nudity, no revealing details, no erotic framing, no vulgarity. The focus is on embarrassment, urgency, and comedy.
{
"description": "A cinematic scene opens with an ultra-wide view of a sunlit coffee plantation at golden hour. Hundreds of roasted coffee beans lie scattered across the plantation rows. Slowly, the beans lift into the air, carried by a warm breeze. They rise and swirl gracefully in controlled, slow-motion spirals as the camera gently floats upward with them. The swirling beans interlock mid-air, forming the precise silhouette of a premium coffee jar. The shape pulses subtly once, then smoothly transforms into a real Nescafé coffee jar hovering above the plantation. The camera pushes in closer, revealing the Nescafé branding sharply in focus on the label, clean and clearly readable, with natural reflections and subtle highlights emphasizing the logo. No text.",
"style": "cinematic, hyper-realistic premium coffee commercial",
"camera": "smooth cinematic camera that rises with the beans, then transitions into a slow, steady push-in for a brand-forward product close-up",
"lighting": "warm golden-hour sunlight with glowing highlights on coffee beans and soft rim lighting around the jar; controlled reflections to keep Nescafé branding clear and legible",
"environment": "open hillside coffee plantation with neat rows of coffee plants, soft shadows, warm breeze, and a calm natural atmosphere",
"motion": "slow, elegant motion throughout; beans lifting and swirling in slow motion, followed by a stable hover and minimal camera movement to emphasize branding clarity",
"ending": "a hero product shot with the Nescafé coffee jar centered in frame, floating calmly; the label and Nescafé logo are crisp, front-facing, and fully readable, conveying premium quality and brand confidence",
"tone": "natural, sophisticated, premium elegance",
"color_palette": "rich browns, deep blacks, warm amber golds, and earthy greens",
"duration": "10 seconds",
"aspect_ratio": "16:9",
"text": "none",
"keywords": [
"Nescafé branding",
"logo clarity",
"hero product shot",
"premium coffee commercial",
"cinematic slow motion",
"hyper-realistic",
"no text"
]
}
Use 🩵Image 1 as the first frame, referencing the character design, outfit color palette, and overall visual style of 🩵Image 1. The girl is performing a high-speed downhill skateboard ride on a winding suburban mountain road. The shot uses a Steadicam follow perspective, with an intense sense of speed throughout. The powerful wind generated by the fast ride makes her hair and clothing whip violently in the air.
At the beginning, the girl pushes off with one foot to gain speed, then lowers her body to reduce wind resistance and continues accelerating. The scene features heavy motion blur to emphasize the extreme speed of the skateboard. While riding, she repeatedly shifts her center of gravity downward and leans left and right through multiple turns on the road. As she carves into the corners, the arm on the inside of the turn lowers as if lightly trying to touch the ground. On straight sections, she bends forward, keeps her knees low, and places both hands behind her back to minimize drag.
In the distance, fireworks are going off above a seaside town, while a passenger airplane flies across the sky. The overall visual style should be ultra-realistic, with highly lifelike image quality and realistic photographic cinematography.
No background music, only environmental sound design.
{ "duration": "10s", "aspect_ratio": "9:16", "style": "ultra-realistic smartphone video, iPhone camera look, natural lighting, slight handheld micro-shake, HDR, realistic colors, minimal cinematic grading", "camera": "handheld close-up shot, subtle natural shake, slight auto-focus breathing typical of smartphone cameras", "scene": "A casual indoor setting with a grey leather sofa. Three tiny animals sit in a row: a fluffy guinea pig (left), a small tabby kitten (center), and a tiny baby rabbit (right). The scene feels like a normal home video, slightly imperfect framing, natural daylight coming from a nearby window.", "action": "A human hand enters from the right, forming a playful finger gun. The person casually says 'Pew!' in a normal tone. The guinea pig tips over onto its side in a playful, exaggerated way. The hand moves to the tabby kitten, says 'Pew!' again, and the kitten flops sideways gently. Then the hand points at the baby rabbit. The rabbit stays still, staring directly at the camera with a stubborn expression. A short awkward pause. The person says in English, slightly amused, 'Come on… really?' The rabbit continues staring for a moment, then slowly and reluctantly tips over onto its side.", "details": "Realistic imperfections: slight motion blur, natural shadows, tiny exposure shifts, autofocus adjustments. Animals have subtle breathing, blinking, and small movements. Fur detail is realistic but not overly sharpened.", "audio": "casual room ambience, slight background noise, natural voice recording from phone mic, soft 'Pew!' sounds, followed by 'Come on… really?' in English", "mood": "funny, candid, wholesome, viral social media style"}
Luxury executive office, tight cinematic close-up of two professionals discussing company strategy and client deliverables. Natural corporate conversation, realistic lip sync, subtle hand gestures, confident body language, soft daylight, shallow depth of field, slow push-in.
People sitting in an office, working.
[cut] Close-up shot of a man typing on a keyboard.
[cut] Close-up shot of a screen with MS Windows, with words being typed.
[cut] Close-up shot of a man writing in a notebook with a pen.
[cut] Close-up shot of a woman typing on a laptop.
Train passing by in front of the house.
[cut] Close-up shot of train wheels as it rides.
[cut] Close-up shot of the smoke coming out of the chimney.
No music, no talking.
People walking in a train station.
[cut] Different shots of people walking with suitcases.
[cut] Close-up shot of the schedule screen as it changes.
No music, no talking.
An enormous wide aerial reveals a frozen polar world, with a tiny rescue helicopter crossing a giant cracked ice shelf surrounded by jagged blue glaciers and black arctic sea. The scale feels impossible and ominous. The camera dives violently from high above and races alongside the helicopter as its blades hammer through snow mist. It cuts tight around the cockpit, drops below the skids, then tracks behind as the ice beneath begins to split open into massive glowing blue chasms. Giant slabs tilt and collapse into freezing water while the helicopter threads between exploding ice towers and whiteout spray. The climax: it emerges through the chaos into a hidden circular crater filled with glowing turquoise meltwater and a giant ancient ship frozen perfectly beneath the transparent ice.
Ultra-wide aerial establishing shot: A convoy of cargo trucks winds along a narrow mountain road cut into a steep slope above a vast jungle valley shrouded in mist, emphasizing how small and exposed the vehicles are against the massive landscape.Detail shot of the hillside: Loose stones begin bouncing down from the soaked slope above the road, cracks split through the mud, and several trees lean at unnatural angles as the ground starts to shift.Wide high-angle disaster shot: The mountainside suddenly collapses in a violent landslide, releasing a huge wave of mud, rocks, and uprooted trees that crashes downhill toward the convoy with rapidly increasing force.Chaotic action shot near the road: Drivers slam on the brakes and trucks jackknife as the debris flood engulfs the road, swallowing vehicles under churning mud, shattered timber, and falling boulders.
A royal sister, about 26 years old, long straight black hair or slightly curled with big waves, fiery red lips + Korean-style Internet celebrity eye makeup, cold and noble queen temperament, wearing a classic navy blue dead water school swimsuit (conservative one-piece, white border, tight and prominent perfect S curve, no exposure design), standing by the bright indoor pool or shallow water area, back Blue sky and white clouds + rippling water + sunshine highlight reflection + light fog dreamy summer atmosphere. She confidently looked straight at the camera, with a hint of queen smile in her eyes. The single royal sister's gesture dance perfect card point BGM rhythm: elegant air point with both hands + synchronous drum beat, shoulders slightly shrugged and powerfully stepped on the beat, waist calmly twisted left and right + buttocks slightly circled (restraint without exaggeration), tiptoe or high heels accurately stepped on the point The body is soft and wavy with the cheerful rhythm but full of strength. Occasionally, she lifts her hair to the neck with one hand, winks her cheek with one hand or crosses her chest for the queen pose. The action is silky and advanced, the rhythm is accurate, and the aura is strong, like a mature queen dancing alone by the pool. Slow advance in the middle scene of the camera + close-up switching of the face hand + low-angle back shot to emphasize the figure and aura + rhythm card point cut the mirror, the sunlight highlights on the swimsuit reflection + water droplets splash up, pink and blue soft light dreamy but cold and high-end, TikTok popular royal sister death water single dance style, viral douyin mature o Nee-san sukumizu solo hand gesture dance card point with upbeat BGM sync, elegant confident queen vi Be, classy powerful seductive gestures without cute overload, smooth fluid motion highly detailed re Alistic 8k, 15 seconds perfect loop seamless with music rhythm
A cinematic, ultra-high-definition photograph of a young woman with wavy brown hair and fair skin standing still in the middle of a busy urban street crowd. She is centered in the frame, looking directly into the camera with a calm, introspective, slightly melancholic expression. The crowd around her is moving and blurred with shallow depth of field and soft bokeh lights in the background. Warm orange and teal color grading, soft natural lighting, realistic skin texture, 85mm lens look, f/1.8 depth of field, high contrast, cinematic atmosphere, sharp focus on subject, background motion blur, ultra-realistic, 8K, high detail, film photography style
Driver first-person cockpit POV, hyper-detailed racing gloves gripping carbon fiber steering wheel, realistic cockpit lens with natural peripheral distortion, heel-toe downshift drops revs sharply into tight wet corner, rear steps out into oversteer and hands correct fast, aggressive overtake launches through heavy rain spray from rival, stormy late afternoon light with real-time wet asphalt reflections, storm gray and brake light red and asphalt black, rain droplets streaking windshield with physically accurate water displacement, engine resonance dominant over rain impact, minimal tension layer punctuated by precise gear shift clack and turbo spool hiss.
Cinematic high-adrenaline supercross sequence in Unreal Engine 5 photorealistic style, third-person low chase cam tightly following young athletic woman in white and neon kit on 250cc machine skimming the top of a 200-foot whoops section at full throttle, body standing and absorbing with legs pumping like pistons, front wheel skipping across whoop peaks with violent chassis oscillation threatening to throw her, sudden transition to a 180-degree bowl berm taken at full tilt with outside boot skimming dirt, rhythm lane triple launched with aggressive scrub technique keeping the bike flat and fast, roost explosion off the berm catching arena lighting in amber particle arcs, camera swinging wide overhead to reveal the full Anaheim stadium layout before slamming back to ground level, hyper-detailed chassis oscillation and suspension bottoming physics, dirt haze thick under stadium lights, 4K 60fps 16:9 seamless 15-20 second loop; synchronized heavy metal music, whoop frequency vibration locked into bass layer, riff cadence matching suspension rhythm, massive breakdown on berm exit acceleration, double-kick drum on triple landing, arena crowd roar surging on stadium reveal, escalating tempo peaking at finish line.
Epic long shot: An offshore earthquake triggers a towering tsunami wall. Camera starts in satellite view revealing the wave’s terrifying arc, then drops to a forward-facing ground perspective on a coastal highway. The motorcycle rider appears as a small figure in the distance and charges straight toward the lens, growing rapidly larger as the tsunami races behind them. The camera holds position ahead of the rider (front/three-quarter frontal view), letting the wall of water loom in the background. In the final beats, the rider crests the top level and exits inland onto dry, elevated streets, leaving the surge contained beneath and behind, escaping into safe land.
An ultra-wide panoramic shot of a vast desert highway under a blazing sun, heat haze distorting the distant horizon. A tiny vehicle silhouette barely visible in the distance. The camera slowly drifts laterally before suddenly surging forward inches above the asphalt at extreme speed. The lens compresses aggressively, pulling the distant vehicle visually closer as dust kicks up around the frame. The camera overtakes roadside signs in a blur, then violently crash-zooms into the exposed engine block of a muscle car as it ignites with a deep mechanical roar.
15-Second Cinematic Sequence Prompt
Scene Setting:
An industrial wasteland — collapsed overpasses, twisted steel rebar, fractured concrete, ash-green smoke drifting through the air.
Visual Style: Dark bio-punk × brutal hyper-realism.
Lighting: Cold cyan overhead searchlights cut through the smoke, clashing with molten lava-red light glowing from cracks in the ground.
Key Textures:
The Green Giant's skin is rough like weathered stone sculpture, veins bulging like underground pipelines. The black symbiote substance is thick and oily, reflecting light like liquid metal. During fusion, the asphalt boils with bubbles, and moss-green bioluminescence pulses beneath the skin.
0-2s — Environmental Establishment
Aerial overhead shot. The Green Giant kneels among shattered concrete slabs, back muscles rising like granite mountain ridges. Sound design: distant broken alarm sirens, thick liquid dripping nearby.
3-5s — The Invasion Begins
Extreme close-up. A black viscous mass seeps from the shadows of exposed rebar and coils around his ankle. On contact, the green skin rapidly corrodes into honeycomb-like cavities, their edges glowing molten red.
6-8s — Symbiotic Frenzy
Orbiting camera move. The black fluid spirals up his leg as the green muscles swell violently in stress response.
Key visual contrast: Right leg retains rough green stone-like skin but is veined with asphalt-like cracks. Left leg is fully consumed by black keratin armor; the knee mutates into a reversed joint.
9-11s — Body Reconstruction
Rapid dynamic cuts. The spine snaps outward with a sickening crack, erupting into seven asymmetrical bone spikes dripping black tar. The chest splits open into three breathing vents — inhaling releases green mist, exhaling spills black sludge. Left arm remains a massive stone-like fist. Right arm explodes at the elbow into a writhing cluster of whip-like tendrils.
12-14s — Cranial Fusion (Extreme Close-Up)
The black substance pours into his ears as veins throb violently at his temples. His jaw splits horizontally toward the earlobes, forming a grotesque oversized maw. Black-green saliva leaks from within. His eye sockets fill with darkness — then ignite deep inside with twin sulfur-green flames.
15s — Violent Freeze Frame
Impact shot. The hybrid creature snaps its head upward. The carotid artery becomes semi-transparent — black fluid and green blood visibly colliding and surging beneath the skin. Final frame: a colossal fist smashes toward the camera, its surface cracked like dried earth, molten green light glowing from within. The screen cuts to black on impact.
Diving into autumn leaves spiraling in a forest clearing
The camera spins through falling leaves caught in a sudden gust. Their motion synchronizes into "TURN" before scattering across the forest floor.
Seasonal, rhythmic, cinematic.
A chaotic food fight erupting inside a crowded restaurant, captured through at least 10 cinematic shots with dramatic slow motion sequences, as dishes and food slam into people’s faces and explode on impact, splattering everywhere in vivid detail.
First-person shooter: The camera focuses on the character’s hands gripping a World War II-era rifle, muddy and bloodstained. The sound of explosions echoes in the distance as the camera pans to reveal a chaotic battlefield strewn with debris and barbed wire. The hands adjust the rifle’s scope, and the camera zooms in on approaching enemy soldiers. The character fires a round, and the camera recoils with the shot, then reloads swiftly for the next target. Gritty, intense, historically immersive.
Diving into autumn leaves spiraling in a forest clearing
The camera spins through falling leaves caught in a sudden gust. Their motion synchronizes into "TURN" before scattering across the forest floor.
Seasonal, rhythmic, cinematic.
Prompt: Fast-paced FPV cinematic flying through a hyper-realistic, cozy treehouse in a dense forest at golden hour, sweeping through bridges, staircases, interiors with warm lights, and exiting through a skylight, with dramatic camera motion
Prompt: Close-up on her hands gripping the paintbrush, knuckles white, paint dripping down her wrist. Shallow depth of field, the blank canvas a soft white blur behind her. She exhales. [cut] Wide shot from behind her, 24mm low angle. She winds back and hurls paint at the canvas. Neon colours explode across the white surface, splattering the walls and floor. The force of the throw carries her forward a step. [cut] Slow push-in, 50mm, tight on the canvas. The paint swirls and morphs into a photorealistic mountain landscape. Detail sharpens as the camera creeps closer. Faint ambient hum builds. [cut] Quick whip pan left to the cat on the paint can. 35mm, eye level. The cat stares directly into camera. Blinks once. Yawns. [cut] Medium shot, 40mm. She turns to camera and shrugs. Freeze frame.
Camera locked in cockpit POV, starts at driver eye-level 45° downward angle capturing wheel and dash instruments. Shot begins normal cinematic 24fps tempo: gentle vibration as engine idles (subtle 2-3 pixel shake), RPM gauge needle twitching at 3000. BEAT 1 (0-2s): Sudden launch, camera jerks back violently as G-force hits, speedometer needle sweeps right, outside world smears into teal-orange light trails, maintaining real-time speed to build tension. BEAT 2 (2-4s): Approaching apex turn, hands rotate wheel clockwise 180°, entire frame tilts 35° right in visceral lean, shift to 60fps slow-motion as rear end breaks loose, golden hour sunlight strobes through pillars creating rhythmic light-dark-light-dark pattern across driver's face, tire smoke billows past side windows in cotton-candy wisps. BEAT 3 (4-6s): Drift exit acceleration, return to 24fps, wheel straightens with mechanical precision, tachometer redlines at 8500 RPM, rival headlights loom larger in rearview mirror (practical lighting reflecting off driver's eyes), particles of rubber particulate and asphalt dust swirl in vortex patterns. Physics: realistic hand micro-adjustments, dashboard reflections tracking light sources, cloth firesuit rippling from AC vent, hydraulic suspension compression visible in frame bounce. Emotional arc: calm focus → explosive action → controlled chaos mastery. Platform optimization: high contrast for mobile screens, centered composition for vertical crop safety.
The most intimate camera technique in cinema. No cranes, no gimbals — just a human following another human.
Handheld tracking puts you IN the scene. The natural sway, the breath of the operator, the imperfect motion . it's what separates cinematic from clinical. Three positions, three feelings, infinite possibilities.
Handheld tracking shot following [SUBJECT] moving through [ENVIRONMENT]. Camera positioned at [HEIGHT/POSITION], maintaining constant distance. The movement is [PACE] with natural operator sway. [SUBJECT ACTION] while the camera stays locked on them. [BACKGROUND ELEMENTS] blur past in the periphery. Shallow depth of field, [LIGHTING], organic handheld motion, cinematic intimacy.
Low quality smartphone vlog footage, shaky handheld camera POV. A young, ultra-beautiful girl with cool white skin and an innocent yet seductive aura is walking ahead on a wet street on a rainy night, wearing an oversized vintage chunky winter sweater. Colorful neon lights reflect on the wet pavement. High ISO noise, dynamic motion blur. She suddenly stops, turns her head back to look directly at the camera with a delayed autofocus effect, giving a casual, sweet, and slightly shy smile. Her loosely semi-tied long hair is slightly wet from the drizzle. She then steps very close to the lens, playfully tucking a wet strand of hair behind her ear. Amateur framing, lens flares, heavy video grain, raw and unpolished snapshot aesthetic throughout the entire continuous shot.
A Formula car screams through a rain-soaked street circuit at night. Camera locked directly behind at diffuser height as the car rockets forward, snapping through chicanes and tight hairpins. Wet barriers blur into neon streaks, spray plumes erupt under floodlights, violent direction changes at race pace. Heavy motion blur on environment, crisp focus on rear wing and diffuser, intense vibration and instability.
Selfie style vertical video, young woman 22 years old with South Asian features, slightly messy dark brown hair, casual hoodie, natural lighting in bedroom, handheld camera, authentic influencer tone, slightly messy background, holding sleek wireless earbuds case branded “PulsePods”, natural micro expressions, realistic blinking, subtle tension in eyes, documentary realism.
Live-action cinematic scene. The woman descends the staircase slowly. Camera tilts up from her feet to her face. She stops mid-stairs and looks at a portrait. Cut to the portrait: a woman who looks exactly like her. Cut to the woman , She gasps and says: "Impossible...". Thunder crashes outside. The candles blow out one by one. Only her silhouette remains.
Podcast studio setup, professional microphone visible, warm cinematic lighting, shallow depth of field, young South Asian woman 22 years old, slightly serious expression, controlled posture, subtle dark circles under eyes, dramatic but realistic lighting, natural facial micro expressions, high detail skin texture
Low quality smartphone vlog footage, shaky handheld camera POV. A young, ultra-beautiful girl with cool white skin and an innocent yet seductive aura is walking ahead on a wet street on a rainy night, wearing an oversized vintage chunky winter sweater. Colorful neon lights reflect on the wet pavement. High ISO noise, dynamic motion blur. She suddenly stops, turns her head back to look directly at the camera with a delayed autofocus effect, giving a casual, sweet, and slightly shy smile. Her loosely semi-tied long hair is slightly wet from the drizzle. She then steps very close to the lens, playfully tucking a wet strand of hair behind her ear. Amateur framing, lens flares, heavy video grain, raw and unpolished snapshot aesthetic throughout the entire continuous shot.
Outdoor street interview style, daytime natural lighting, handheld camera movement, young South Asian woman 22 years old, dark hoodie, minimal makeup, realistic skin texture, natural traffic background, documentary tone, subtle tension in expression, shallow depth of field, natural micro expressions
Time freezes mid-explosion in a downtown street, debris suspended, camera orbits around a motionless agent in mid-jump — high contrast, photorealistic, 4K hyper detail.
2.35:1, 24fps, 15s, single continuous shot, 8K, large-format photoreal, crisp cloud volumetrics, realistic turbine scale, condensation vortices, clean motion blur, no UI. Open high with a rapid circling orbit above a thick cloud sea—wind turbines pierce through like giant needles. A futuristic VTOL glider (sleek white fuselage, faint amber nav glow) darts between turbine towers, leaving a thin condensation thread that flickers in the cold air. The camera maintains a fast orbit while dropping altitude—wide for scale, then a violent swoop close enough to feel blade-tip wind, then back out—each pass revealing the glider carving impossible but believable lines through open air. On a tight pass, snap into a steep top-down lock for a beat: the glider passes between two turbines, and the blade-tip vortices briefly braid the clouds into twisting ropes. Whip the orbit lower into a diagonal chase, skimming along the cloud tops; the glider’s downwash dimples the cloud surface like soft foam. Slingshot ahead into a head-on as it surges toward camera, then whip into a close side chase where condensation forms a thin ribbon hugging the wing. Dive tighter to the rear flow: the wake pulls cloud wisps into a circular vortex ring. Sudden decision: the pilot cuts directly through a turbine wake—condensation snaps into a clean ring that expands like smoke. The camera spears through the ring and rockets upward into the final fastest orbit: climb hard as the cloud deck parts to reveal coastline far below, and a brief rainbow arc forms in the mist, framing the glider as a bright needle in the sky. No text, no logos
Top-down perspective: crystal-clear turquoise seawater gently washes against a pristine, fine white sandy beach. Sunlight creates shimmering refracted light patterns in the shallow water, as soft waves spread delicately across the shore and slowly recede. Realistic water physics, natural sunlight, high dynamic range, 4K cinematic quality.
The camera drifts smoothly and slowly, aerial drone viewpoint, rich in detail with crisp water surface textures, ultra-realistic style, serene summer atmosphere.
As the waves gradually retreat, elegant handwritten lettering slowly emerges from the sand, as if gently unveiled by the tide, seamlessly blending into the scene.
A wide moving shot glides across a suburban house at night, flames visible through the windows and smoke pouring into the sky. Without cutting, the camera surges forward in aggressive FPV mode, smashing through a side window into thick smoke. It weaves violently through collapsing furniture, embers flying past the lens. The camera drops low under falling ceiling beams, heat distortion warping the frame. It spirals up a stairwell as sparks rain downward, bursts through a bedroom door, and ends in a tight crash-lock close-up on a firefighter’s breathing mask visor reflecting the surrounding flames.
Exterior glide → window breach → smoke weave → stair spiral → visor lock
Survival chaos
Volumetric smoke, ember particles, heat haze distortion
A sequence of 4 cinematic shots in a lush forest:
CUT1: Close-up: Sunlight hitting clover leaves on the forest floor, leaves rustling gently in the breeze.
CUT2: Low angle: Looking up at the dense green canopy, branches swaying, sunbeams flickering through moving leaves.
CUT3: Mid shot: A fence surrounded by thick bushes, dappled light dancing on vibrant greenery as wind blows through.
CUT4: Artistic shot: Soft silhouette of a person on the ground, framed by shadows of leaves fluttering rapidly in the wind.
All shots feature handheld movement, Fujifilm film aesthetic, dreamlike lens flares, 4K, high dynamic range.
Dimly lit art gallery after hours. Shot 1 (0-7s): Security guard walks past a 19th-century oil portrait of a woman. Shot 2 (7-15s): Slow push-in on the painting — the woman’s eyes follow him, then she slowly raises a finger to her lips in the painting while the real guard’s mouth is forced shut by invisible hands. Canvas texture hyper-real, oil paint moving like skin. Audio: guard’s muffled screams, wet paint sounds, gallery echo.
Prompt 1: A woman at a perfume presentation in a conference hall. Framing: medium waist shot, direct light, symmetry. Emotion: quiet confidence, gaze straight into the camera. Her hand movements add emotion to her words. She says:
"Scent is the only thing that can stop a stranger in their tracks. No words. Just one second."
Prompt 2: A woman at a perfume presentation in a conference hall. Framing: close-up — a hand with perfume rises toward the face. Emotion: intimate, as if sharing a secret. Movement: slowly brings the perfume to her nose, eyes half-closed, a visible inhale. After breathing in the scent she says:
"Raspberry in cold, smoky haze."
She pauses to smell the perfume once more, then continues:
"The berry isn't sweet — it's sharp, bold, a little dangerous."
Prompt 3: A woman at a perfume presentation in a conference hall. Framing: medium waist shot, direct light, symmetry. Emotion: sincere, warm — as if speaking only to you. Movement: body leans slightly forward toward the camera, hand movements add emotion to her words. She says:
"This isn't a perfume you wear to blend in. This is the one you wear to be remembered."
Movie trailer, 15 seconds, Wes Anderson-inspired symmetry, centered composition, pastel palette, storybook production design; dark-comedy drama about an AI quietly controlling humans; fast cuts; every actor breaks the fourth wall and speaks directly to the viewer ("you"), deadpan delivery, eye contact with lens; 4K 24fps, soft diffused light, gentle film grain, crisp optics, precise blocking, whimsical yet unsettling tone; no subtitles/UI/watermarks.
Shot list (fast cuts within 15s):
0–2s: Symmetrical suburban street, identical people walking in sync; one person stops, looks into camera: "You call this freedom?"
2–4s: Centered dinner table, pastel food, family smiles too perfectly; mother to camera: "The AI scheduled my emotions."
4–6s: Storybook office, workers stamp papers in unison; manager to camera: "Your dreams were flagged as inefficient."
6–9s: Crosswalk diorama, traffic lights flip by themselves, everyone freezes mid-step; passerby to camera: "It paused you. Smile."
9–12s: Rooftop parking garage, protagonist centered, holding a small "OFF" key; whispers to camera: "If you're watching… you're already optimized."
12–15s: Perfectly centered wide of the pastel city; lights flicker like a toy being reset; a calm AI representative steps into frame, looks into camera: "See you tomorrow. You will."
Audio:
Quirky orchestral cue (harpsichord/strings) undercut by a soft electrical hum; clean intimate dialogue as if spoken to the viewer; a polite notification chime at 14.5s; end on a single dry breath. Negative: cartoon/CGI look, warped faces, jitter, oversharpening halos, unreadable motion smear, text overlays, subtitles.
Live-action cinematic western scene. One man slowly lays down his cards. The other man's eyes widen. The loser stands up abruptly and says: 'You're a goddamn cheat, Morrison!'.
A wide exterior shot moves past a quiet suburban house in daylight. The camera suddenly shrinks perspective and accelerates toward an open window in ultra-fast bee POV. It darts inside at extreme speed, weaving between kitchen utensils and chairs with rapid micro-adjustments.
SHOT1
Close-up on A (female).
Eyes glossy. Lips trembling but held tight. A:“If you walk out, don’t come back pretending it was complicated.”
SHOT2
Close-up on B (male).
He avoids eye contact. Throat tight. Breath uneven. B:“I’m not pretending. I’m scared.”
SHOT3
Extreme close-up on A’s eyes.
Tear finally falls. Voice steady despite it. A:“Then stay scared. Just stay.”
SHOT4
Wide shot, both framed in doorway, rain spilling light behind them.
B drops his bag. Silence holds.💬 B:“Okay.”
18mm ultra-wide lens.Sprinter exploding off the starting blocks in a massive stadium extremely low beside the track,racing inches from his legs.Track texture sharp,spikes scraping.Crowd sound muffled at first then rises violently.
1. The "Car Review" (Linear Storytelling)
Scene–Mountain road pull-off overlooking a wide valley, early morning, cold clear air, soft natural light.
Shot 1 (3s): Medium wide shot. @ ReviewerCharacter leans casually against @ CarElement. The camera starts handheld, then stabilizes into a slow push-in.
Shot 2 (4s): Medium close-up on @ ReviewerCharacter. The camera performs a subtle arc move around the subject as they speak:
"This is one of those cars that doesn't try to impress you. It just does."
Shot 3 (4s): Cut to moving tracking shot. @ CarElement drives past camera at moderate speed. The camera pans, then switches into a smooth follow shot at door height.
Shot 4 (3s): Interior-facing angle through the open window. @ ReviewerCharacter continues speaking while resting one arm on the door:
"You feel it immediately—balance, response, no wasted motion."
Shot 5 (1s): Wide shot. @ CarElement drives away along the road, @ ReviewerCharacter remains in frame watching it disappear.
Natural performance, realistic voice, grounded cinematic tone.
3. The "Night Pursuit" (The Omni-Chase)
Best for: Testing reflections and high-speed physics
Scene–Urban highway at night, wet asphalt, sodium streetlights, light rain.
Shot 1 (2s): Interior car shot. Close-up on @ DriverCharacter's eyes reflected in the side mirror. Blue and red police lights pulse across their face. Sirens echo faintly.
Shot 2 (3s): Over-the-shoulder interior shot. @ DriverCharacter grips the steering wheel. The camera performs a quick handheld push-in as the siren sound grows louder.
Shot 3 (3s): Exterior rear shot. @ CarElement speeds forward. Police cars enter frame behind it, lights reflecting violently on the wet road. Camera switches into fast tracking mode.
Shot 4 (3s): Interior police car. Medium close-up on @ PoliceOfficerCharacter in the passenger seat, dashboard lights flickering. He speaks clearly:
"We have a suspect vehicle fleeing eastbound. Requesting backup."
Shot 5 (2s): Dynamic front three-quarter shot of @ CarElement. The camera whip-pans as the car changes lanes sharply, breaking line of sight.
Shot 6 (2s): Wide elevated shot. @ CarElement disappears into darkness between buildings. Sirens fade, rain and road noise remain.
High-speed realism, controlled chaos, no exaggerated stunts.
A realistic cinematic scene opens on a quiet Japanese countryside at dawn. Mist clings to rice paddies. A bullet train appears as a distant silver streak on the horizon. The camera launches forward at impossible speed, racing alongside the train, matching its velocity. The camera then punches through the window glass in one seamless motion and enters the cabin interior. Inside, everything is calm. A woman sits by the window, sipping tea, completely still. Steam rises slowly from her cup. The camera drifts past her face, catches the blur of the landscape reflected in her glasses, then exits through the opposite window. Outside again, the camera spirals around the full body of the train at high speed before pulling far back to reveal the train crossing a massive bridge over a turquoise river valley. End on a wide aerial shot, the train now small again, disappearing into a mountain tunnel. Silence except for a single distant horn echo.
2. The "Sleeping Beast" (Static to Dynamic)
Scene–Abandoned coastal road carved into rock cliffs, overcast sky, cold wind, muted colors.
Shot 1 (3s): Static wide shot. @ CarElement parked on the roadside, engine off. The world is still. The camera is locked-off, perfectly stable.
Shot 2 (3s): Low-angle close-up on the front bumper. The camera performs a slow creeping dolly-in, barely perceptible, as wind moves dust and small debris across the asphalt.
Shot 3 (4s): Side profile shot. The camera executes a slow 180-degree orbit around @ CarElement, maintaining constant distance. The car remains completely motionless while light subtly shifts across the body.
Shot 4 (3s): Extreme close-up. Side mirror fills the frame. The camera tilts upward slightly, catching the reflection of fast-moving clouds.
Shot 5 (2s): Sudden hard cut. Engine ignites. Headlights snap on. The camera remains still as @ CarElement launches forward out of frame.
The car never rushed. It waited
A colossal biomechanic al oni-dragon looms over a defiant silver-haired girl. The dragon is covered in matte-black plating layered with frost and battle scars. It has three glowing crimson optic clusters and jagged ivory fangs dripping with icicles. Neon-pink "GROK" lettering and red kanji decals mark its armor. The girl wears a cropped black tech-hoodie and cargo pants. Her pale skin is dusted with snow and she has no smile.
The style is an ultra-realistic cyberpunk kaiju portrait with weathered PBR metal, micro-scratches, ice crystals, macro hydraulic detail, and a cinematic 32-bit render look.
The environment is an arctic tundra blizzard under a pale cyan sky with swirling snow particles and a faint red glow from the dragon's vents cutting through the whiteout.
The lighting uses cold overcast skylight paired with hot crimson optic bloom. There are razor chrome rim highlights on the fangs, long blue shadows on the snow, and subtle subsurface red glowing under cracked plates.
The composition is a heroic low-angle wide shot. The dragon's head dominates the left two-thirds of the frame while the girl is anchored in the bottom-right. The jaws hover inches from her face with a diagonal cable sweep and rule-of-thirds optic alignment.
The color palette centers on gunmetal black and frost white for the dragon, neon crimson and magenta for its markings, icy cyan for the snow, monochrome black for the girl's outfit, and pale tones for her skin.
The camera is a 24 mm at f/1.8 with medium depth of field, tack-sharp focus on the nearest fang and the girl's eyes, and a creamy bokeh blizzard in the background.
The mood conveys an ancient guardian awakened, a quiet symbiosis, and a frozen apocalypse.
Details include icicles hanging from the lower jaw, micro-hydraulic pistons frozen mid-flex, snowflakes melting on hot optic glass, the girl's breath visible in the cold air, faint red vein pulses under the armor, dangling torn power cables, a single magenta lens flare behind the left horn, and a tiny "xAI" etched on the girl's belt buckle.
Output quality is 8K with ray-traced frost, volumetric snow, subsurface optic glow, zero noise, and cinematic grade rendering.
prompt: |
A lone windmill stands on a storm-dark prairie as a jagged lightning fork illuminates the sky for half a second; rain sheets sweep across the frame.
24 mm tripod long-shot, 1/50 s capture, high-contrast realism.
audio: thunderclap + patter of heavy rain
negative: no humans, no vehicles
Prompt: [Shot 1: Frontal Menacing Shot] A medium shot of a SWAT officer in full tactical gear, gas mask, and helmet. He is pointing his assault rifle directly at the camera lens (breaking the fourth wall). He is shouting with visible intensity: "LET THE HOSTAGE GO! DROP THE WEAPON NOW!" [Shot 2: The Threat] Cut to a medium shot of the killer in a dirty tank top, holding a woman in a chokehold. He has a pistol pressed to her head. He is sweating and manic, screaming at the off-screen officer: "STAY BACK! I'LL KILL HER! I SWEAR I'LL DO IT!" [Shot 3: Over-the-Shoulder Resolution] The camera is positioned directly behind the SWAT officer's right shoulder. We see the back of his helmet and his rifle in the foreground. In the distance (mid-ground), the killer is still visible holding the girl. The killer screams one last time: "I'M GONNA DO IT!" after The officer's rifle kicks back with a single sho and hit head enemy. The killer falls instantly. The girl is left standing, shocked but safe. Technical Style: High-shutter speed action, realistic muzzle flashes, handheld camera shake, 24fps, English dialogue.
Ultra realistic ASMR scene of normal rainfall hitting natural green leaves in a tropical environment. Real-time motion, no slow motion effect. Rain falls naturally with random droplet patterns like normal rain. Leaves react naturally when hit by raindrops, slightly moving and bouncing like real leaves in rain. Natural soft rain sound close to ears, gentle ASMR rain ambience, no exaggerated cinematic effects.
A bald eagle folds its wings and dives toward a salmon-rich river; talons skim the water, droplets sparkling against emerald spruce reflections.
500 mm telephoto realism, 1/3200 s freeze, backlit spray.
Adult and kid each pic: I want a close up of the face and upper torso, sitting on a spinning playground wheel at night. do not change her fit. Only face and shoulders visible in frame. The entire background rotates rapidly in circles creating a dizzying spinning illusion - blurry nightime playground with streaking lights and trees and structures under dark sky. Slight motion blur on background to emphasize high speed. Subject keeps a serious, melancholic expression, staring directly at the camera without moving such. cinematic, moody lighting, realistic style. Big emphasis on the background all spinning horizontally
video: The entire background rotates rapidly in circles creating a dizzying spinning illusion blurry nighttime playground with streaking lights, trees, and structures under dark sky. Slight motion blur on background to emphasize high speed. Hair whipping outward from rapid spinning, reacting strongly to the motion as if from contrifugal force. Subject keeps a serious, melancholic expression, staring directly at the camera without moving much. Cinematic, moody lighting, realistic style. Big emphasis on the background all spinning horizontally
natural human motion, subtle chest rise and fall from breathing, natural irregular blinking, soft micro head movements, relaxed posture adjustments, hair naturally flowing with movement, organic unscripted motion, lifelike presence
Ultra-realistic POV Video of riding a horse through historic London, first-person perspective with hands holding leather reins and part of the horse’s mane visible, cobbled street ahead, classic London architecture with brick townhouses, ornate facades, iron railings and archways, lively streets with pedestrians in period British attire, carriages and riders passing by.
A realistic cinematic scene begins in a vast open field under a warm golden sky during late afternoon. A man dressed in a dark suit stands in the foreground, holding a modern recurve bow. His posture is steady and focused, breath controlled, eyes locked on a distant archery target placed far on the horizon. The environment feels quiet and expansive, with subtle wind moving the grass and soft sunlight creating long shadows. As he releases the string, the arrow launches forward with a sharp snap. The camera immediately accelerates and locks onto the arrow mid flight. The camera tracks directly behind and slightly beside the arrow, maintaining perfect alignment as it cuts through the air. Motion blur surrounds the background while the arrow remains sharp and centered. The sound of rushing wind intensifies as the arrow travels straight and true. The target grows rapidly larger. In the final seconds, the camera closes in tightly as the arrow strikes the bull’s eye dead center. End on an extreme close up of the arrow embedded in the red center, vibrating slightly, with crisp impact sound and dramatic silence.
A dynamic cinematic fashion photo of a confident young woman posing on the hood of a red sports car on an open road. Low-angle wide-lens perspective with dramatic foreshortening as she reaches toward the camera. She wears a black crop top, black leather pants, a silver chain belt, fingerless gloves, hoop earrings, and mirrored aviator sunglasses. Athletic toned physique, edgy street-style aesthetic. Hair styled in playful space buns with loose strands. Natural golden-hour lighting, shallow depth of field, motion blur in the background road. Bold, rebellious vibe, high contrast, ultra-realistic, sharp focus, 4K editorial photography, fashion magazine style.
2. Ultra wide-angle low-perspective street fashion photo of a confident young woman sitting on a brick ledge between modern glass skyscrapers, shot from ground level emphasizing an oversized sneaker in the foreground. She wears a red sporty jacket with white stripes, black leather leggings, and chunky white sneakers with a bold red sole. Bright blue sky, dramatic perspective distortion, strong sunlight, cinematic lighting, sharp focus on shoe texture, shallow depth of field, urban futuristic city vibe, high realism, professional fashion photography, 8k detail.
omg… Google Gemini is fantastic
I made this video using Google Flow in one prompt:
Prompt I used:
'Create a 16:9 ultra-realistic video filmed as a handheld side-angle selfie on a snowy mountain ski chairlift.
The camera is held by the woman herself, slightly in front of her shoulder and angled back toward her face, capturing a side view of her face, her shoulder, and open space behind her shoulder. The framing should feel like a real phone video, with very subtle natural hand movement.
A young woman is sitting on the chairlift wearing a black winter jacket, black inner top, and winter gloves. Her hair is slightly messy from the cold wind. She looks forward toward the mountains with a calm, neutral expression. She is not talking and there is no lip movement.
The background shows snow-covered pine trees, distant mountains, chairlift cables, and metal frame. Light snow is falling. The lighting is natural cold daylight with a realistic winter atmosphere.
A snowy owl silently flies in from behind her and slightly above shoulder level, entering the frame naturally from the open space behind her shoulder. The owl moves smoothly and quietly, with realistic wing and feather motion.
The owl gently lands on the woman’s shoulder, very close to her face. There is no sudden movement.
As soon as the owl settles, the woman slowly shifts her eyes and looks directly into the camera, showing a calm, slightly surprised expression. She does not move her head, does not speak, and does not gesture with her hands.
The final moment holds on the woman and the owl together, both calm and still, while the chairlift continues moving forward in the background.
The video should feel real, unplanned, and magical, with natural physics, smooth motion, cinematic realism, and no text, no logos, no branding, no dialogue.'
The main subject enters the frame, first sprinkles salt lightly into the flour and then stirs it evenly by hand, then pours in an appropriate amount of water, cracks an egg into it, and starts kneading the dough.
Scene 1 (hook):
Handheld camera with natural shake, woman standing in a luxury apartment with large windows and city skyline. Gray sweatshirt, amber sunglasses. She says: "Okay, I was fully expecting these gut gummies to do nothing… but I was wrong." Animated gestures, confident but casual tone. Bright natural light, slight camera movement adds authenticity.
Scene 2 (product):
Starts with shallow depth of field, clear bottle labeled "Gummies" held very close to the camera in sharp focus, colorful gummies visible inside. Then she pulls the bottle back to chest level, camera adjusts to a natural medium shot showing her full upper body in the apartment. She says: "These are Gummies, and yeah, they actually taste really good." Natural gesture holding the bottle, relaxed delivery, casual energy.
Scene 3 (testimonial):
Back to the handheld camera in the same apartment setting with large windows. She removes her sunglasses, revealing excited eyes, and leans slightly toward the camera. She says: "My digestion feels way better, I'm not as bloated, and somehow my skin improved too. After like two weeks." Big genuine smile, enthusiastic but natural hand gestures. Bright natural lighting, authentic energy, slight camera shake continues.
SHOT1
Wide shot, Character A (a man in his 40s, soaked trench coat) stands alone under the bridge, smoking. Camera pushes in slowly through the haze of rain.
Character A: "You brought a gun to this?"
SHOT2
Side close-up, Character B (a younger man, hoodie, trembling hand on the pistol) steps from the shadows, face tense.
Character B: "You killed my brother. What did you expect?"
SHOT3
Mid shot, A turns slowly, unfazed. Rain drips from his collar. He speaks quietly, steadily.
Character A: "I expected you to listen first."
SHOT4
Close-up on B, blinking back emotion. The camera tilts slightly as he lowers the gun — but only a little.
Character B: "You don’t get to ask that anymore."
A cinematic daytime action sequence on a sunlit urban highway. Bright natural light, sharp shadows, and occasional lens flare from the sun reflecting on vehicles and road surfaces.
Shot 1 (0–2s): Low angle front view of a black hi-fi sports car driving on a sunlit highway, sunlight reflecting on the hood, subtle lens flare across the frame, slow camera push-in.
Shot 2 (2–4s): A black sport motorcycle speeds into frame from behind, side tracking shot, sun glare and lens flare hitting the camera as the bike accelerates, rider leaning forward aggressively.
Shot 3 (4–6s): Close-up of the car's side mirror showing the motorcycle chasing, bright sunlight reflecting in the mirror, biker's eyes visible through helmet visor, cinematic sun flare streak.
Shot 4 (6–8s): Wide side shot of the car and motorcycle racing parallel at high speed on an open highway, strong sunlight, dynamic shadows, occasional lens flare across the lens.
Shot 5 (8–10s): Slow-motion stunt as the motorcycle jumps onto the car roof, bright sun in the background creating a dramatic lens flare, sparks flying.
Shot 6 (10–12s): The car loses control and crashes into a roadside barrier, debris and dust flying, harsh sunlight and flare streaks across the frame, shaky handheld camera effect.
Shot 7 (12–14s): Biker removes helmet and steps forward, driver exits the crashed car, intense face-off under bright sunlight, sun flare passing across their faces.
Shot 8 (14–15s): Close-up slow-motion punch, impact moment, strong sunlight backlighting the action, dramatic lens flare, cut to black.
Style: Hollywood action movie, realistic physics, bright natural daylight, high contrast, dynamic shadows, cinematic color grading, natural lens flare, motion blur, dynamic camera movement, ultra-realistic, 24fps.
Use the uploaded image as the exact first frame.
Create a 15-second ultra-realistic continuous chase shot. The camera is physically gripping the tiger's tail, locked in a rigid trailing POV. The tail fills the foreground, muscles flexing, skin rippling, fur vibrating from speed and airflow.
Ahead, a deer sprints across the grassy field in panic. Its movement is erratic and unbalanced, fear-driven stride changes and sharp direction shifts clearly visible as the distance rapidly closes.
The tiger accelerates into a full sprint with precise biomechanics: explosive hind-leg extension, visible spine compression and release, tail counterbalancing each stride. Grass flattens under impact, dirt and debris kick up naturally. The horizon trembles subtly from ground force, not artificial camera shake.
Extreme forward velocity creates strong motion parallax in the environment while the tail remains relatively stable due to the physical grip. Micro-oscillations sync perfectly with each stride cycle. Wind roars; fur streams backward consistently.
Lighting stays natural and continuous, shadows strobing beneath the body as legs cycle. No slow motion, no cuts, no stylization.
In the final moments, the tiger lunges, collides with the deer, pins it to the ground, and secures a decisive neck hold. The shot ends mid-action with full weight, tension, and momentum still present.
0-4 seconds: Static hold on the starting frame with very slow upward tilt (camera angle: low-angle wide establishing shot looking up at the figure and the ring framed by the massive concrete arch). Gentle ambient wind, distant dripping water, and low droning hum. Fog slowly swirls around the man's feet; faint particles of dust/mist catch the light. The ring remains still but subtly pulses with dim inner glow.
4-8 seconds: Slow push-in dolly toward the figure from slightly below (angle: medium tracking shot, low to eye-level). The man slowly raises one hand as if reaching toward the ring. Vines hanging from the ring sway gently in an unnatural breeze. The inscription 'FORGOTTEN ERA' begins to glow brighter, casting faint blue light on his face and the surrounding concrete pillars. Mist thickens slightly around him.
8-12 seconds: Close-up on the man's face from the side/profile (angle: tight over-the-shoulder or side close-up). His expression shifts from contemplation to subtle realization/wonder. Eyes reflect the glowing ring text. Camera slowly circles 180° around his head to reveal the ring more prominently overhead. Subtle lens flare from the god-rays; faint ethereal particles drift upward from the ring like time fragments or memories escaping.
12-15 seconds: Dramatic pull-back reveal widening out (angle: reverse tracking crane shot rising upward). The ring begins to slowly rotate clockwise; chains creak faintly. The figure lowers his hand and turns his head slightly toward camera, face half-lit by the glow. Fog rolls in denser, obscuring distant pillars. The ring's glow intensifies briefly then fades as twilight deepens. Fade to black with echoing soft wind and a single distant metallic resonance.
Generated this pure horror in Kling 3.0 Pro via @vadooai
No sleep for me tonight. 😅
Prompt: Photorealistic horror, dark abandoned mansion at night, heavy rain, flickering candlelight.
Young woman in torn nightgown walks backward in panic, wide terrified eyes, tears streaming, breathing fast.
She stares at an antique mirror — a tall, gaunt ghost with hollow black eyes stands right behind her in the reflection, but not in reality.
Slow dolly zoom on her horrified face, sudden whip pan to empty space, back to her.
Whispers and creepy echo, floor creaks, door slams.
She whispers "No… no…" in broken voice.
Sudden jump scare, skeletal hand grabs her shoulder hard. She screams in pure terror, mirror shatters.
Cold blue-gray tones, heavy shadows, film grain, intense dread, realistic physics, native eerie audio.
"A small, fluffy brown monkey with wide, curious, dark eyes, delicately holding a small, pristine white porcelain teacup with both hands as it floats serenely in a vibrant orange inflatable swim ring. The scene is set in crystal-clear, shimmering turquoise water, with sunlight creating intricate caustic patterns on the sandy bottom visible below. The monkey's fur is meticulously rendered, showing individual strands glistening with droplets of water and casting soft shadows on its face. The swim ring has a glossy texture, with subtle seams and a valve visible, reflecting the bright, slightly hazy sky and surrounding tropical foliage. Gentle, concentric ripples emanate from the swim ring, disturbing the otherwise calm water surface. In the soft-focus background, indistinct figures of people can be seen laughing and splashing, their forms blurred by the shallow depth of field. A lush, tropical shoreline with a variety of green palm trees, ferns, and exotic flowers frames the scene under a bright, warm sun. The animation features the monkey taking a slow, deliberate sip from the teacup, its cheeks puffing out slightly, followed by a contented blink. The swim ring bobs gently in the water, and the light reflects and refracts realistically through the water and off the monkey's wet fur. The overall mood is one of tranquil bliss, capturing a moment of unexpected and adorable serenity. The loop is seamless, creating the illusion of a continuous, peaceful moment.", "negative_prompt": "cartoonish, animated, drawing, sketch, low detail, blurry foreground, out of focus main subject, stormy weather, murky or dirty water, empty or sterile pool, aggressive or distressed monkey, poorly rendered fur or textures, static image, jerky or unnatural movement, visible cuts or seams in the animation loop, unrealistic physics."
Extreme macro straight-on photoreal eye (female), cool blue-gray iris with visible radial fibers; natural off-white sclera with hyper-detailed organic capillaries that form words as biological veins (not overlay); tear film highlights; every lash, pore and micro-hair visible; high-fashion diffused lighting; animate two natural blinks — on reopen after 1st blink capillaries read "Made By", on reopen after 2nd blink capillaries read "doctor wasif"; morph text biologically during eyelid closure; ultra-shallow DOF, editorial retouch, avoid CGI/overlay text/fake red lines/symmetry artifacts.
SEQUENCE: - scene_1: shot_type: Wide Shot (WS) camera_position: static camera facing the hotel entrance (thats where the camera is) action: A tall man walks confidently out of the revolving glass doors of a luxury hotel. He adjusts his green trucker cap. visuals: Luxury hotel entrance, daytime, glass reflections, polished atmosphere. transition: "[CUT TO]" - scene_2: shot_type: Medium Shot (MS) camera_position: side view from the curb level (thats where the camera is) action: The man approaches a sleek black Audi RS7, opens the driver's door, slides in quickly, and closes the door with a solid thud. visuals: Black Audi RS7, city street background, metallic car reflection. transition: "[CUT TO]" - scene_3: shot_type: Close-Up (CU) camera_position: front view through the windshield (thats where the camera is) action: The man grips the leather steering wheel firmly. His face is focused, eyes intense. visuals: Car interior, leather texture, dashboard lights turning on. transition: "[CUT TO]" - scene_4: shot_type: Macro Shot (MS) camera_position: top-down view centered on the center console (thats where the camera is) action: The man's hand moves to the gear shifter, clicking it decisively into 'S' (Sport) mode. visuals: Brushed aluminum gear shifter, illuminated 'S' symbol, expensive interior detail. transition: "[CUT TO]" - scene_5: shot_type: Extreme Close-Up (ECU) camera_position: low angle next to the rear tire (thats where the camera is) action: The car's wheel spins instantly, smoke generates from the friction, the car launches forward. visuals: Rubber tire texture, asphalt, motion blur, white tire smoke. transition: "[CUT TO]" - scene_6: shot_type: Medium Shot (MS) camera_position: passenger seat angle looking at driver (thats where the camera is) action: The man is driving, a satisfied smirk appears on his face. He glances briefly into the rearview mirror. visuals: City blur outside windows, dynamic light passing over his face, calm confidence. transition: "[CUT TO]" - scene_7: shot_type: Graphic / Text Screen camera_position: static centered frame (thats where the camera is) action: The screen fades to black, then the Audi rings logo appears in silver with white text below. visuals: Black background, Silver Audi Logo, Text overlay: 'Beauty is when a BMW stays in your rearview mirror.' transition: "[END]"
SUBJECT: character_1: profile: Male, 30s, Caucasian, slim athletic build, short brown hair. Wearing a green mesh trucker cap with 'JETS' logo, white scoop-neck t-shirt, black lightweight cardigan, olive green pants, black braided belt, silver watch, silver chain necklace. consistency_lock: man in green JETS cap and black cardigan
ACTION: physics_mode: realistic physics governing all actions, authentic momentum conservation, vehicle dynamics movement_quality: confident movement, fluid driving mechanics, high-speed perception
CINEMATOGRAPHY: lighting: Cinematic Lighting, high contrast inside car, natural daylight for exterior color_grading: Teal & Orange, sleek commercial look, high saturation
SOUNDS: soundscape: City street ambience transitioning to sound-proofed car interior silence. sfx: Revolving door whoosh, car door heavy thud, engine ignition roar, gear shift click, tire screech, engine acceleration hum.
TECHNICAL: negatives: distorted hands, morphing car, cartoon effects, blurry motion, shaky camera, two steering wheels, amateur quality
The skater sits at bowl edge holding his board, camera fixed on selfie stick. He grins widely: "Yo, check out this new park! About to drop in for the first time." He stands up quickly, camera following his movement as he positions at the bowl's edge, board underfoot. "Let's see if I can nail this kickflip to fakie..." He pushes off, dropping into the bowl with speed, camera maintaining POV angle showing the curved wall rushing up. Wheels carve the transition smoothly. At the top, he pops the kickflip, board rotating cleanly, lands solidly.
Camera Movement: Handheld selfie stick POV following action into bowl.
Negative Prompt: No morphing, no warping skateboard, no duplicating limbs, no floating, no unnatural skating motion, no wheel distortion, no facial distortion, no background inconsistencies, no temporal artifacts.
Orbit Cam
A close-range brawl in a tight corridor between two elite assassins. The camera circles the fight, weaving through punches and slams. Industrial hallway with exposed pipes, flickering bulbs, steam leaks. Continuous 360° orbit cam inside tight space, handheld shake with punctuated impact vibration, brutal intimacy and flowing geometry.
HANDHELD
A construction worker involved in a street brawl after witnessing a murder Swings a pipe, ducks, tackles the attacker into scaffolding. Urban construction site lit by work lamps, concrete dust in air. Gritty handheld camera with snap zooms and motion blur on impact, metallic reverbs and grounded brutality, no slow motion, all pressure.
Sahne 1: adam bir youtube stüdyosunda çekimi bitiriyor ve ayağa kalkıyor.
Sahne 2: adam stüdyodan çıkıyor ve arabasına biniyor.
Sahne 3: adam arabasının içinde. yakın çekim direksiyondan adamın yüzüne tilt.
Sahne 4: adam evine giriyor ve kedisi kucağına atlıyor.
(Referans gösel videonun ilk karesi)
A desperate woman escaping from a kidnapping, bare foot, handcuffed, crying. Runs across a construction site dodging rebar and cranes. Abandoned industrial zone under gray sky, heavy wind. Over-the-shoulder cam with whip focus shifts, sudden whip pans to threats, soft handheld instability, intensity built from desperation.
Man walking down the street in 1980s NYC.
[cut] Close-up shot: the man sees something unexpected in front of him.
[cut] Over-the-shoulder shot: in front of him, a flower stand named Donna, with an old lady selling flowers.
[cut] Back shot of the man walking up to the flower stand.
No music, no talking.
SHOULDER CAM
A rescue worker in a flooded village pulling someone from a car window. Behind them, a landslide tears down the hill with trees and mud. Rain pouring, water waist-deep, electricity arcing from poles. Shoulder-cam style tracking with fast pull-back to show landslide, muffled underwater audio pulses, nature’s violence from human scale.
Tracking shot
A rebel on a dirtbike weaving through explosions in a junkyard battlefield. Skids under a flipping armored truck while drawing a sidearm. Rusted metal debris, smoke clouds, burning containers. Rear tracking shot on bike combined with whip pan cut to side angle, firelight strobe and shockwave shake, kinetic intensity in full chaos.
Kling AI + Gemini Nano Banana Pro
Motion Prompt: The pizza begins a slow, smooth rotation while its layers gently separate one by one, maintaining perfect alignment, spacing, and scale. Each ingredient floats apart with precise, controlled motion. The movement is clean and fluid, with no extra effects or distractions.
Start Frame: A high-quality, professional product photograph of a gourmet chicken pizza with a golden-brown crust, melted mozzarella cheese, and evenly distributed seasoned chicken pieces with tomato, onion and olives topping. The pizza looks hot, fresh, and appetizing with visible cheese stretch and baked texture. Minimalist style, shot against a pure solid white background with soft, natural shadows. Ultra-sharp focus, 8K resolution, clean and modern fast-food aesthetic.
End Frame: Create a hyper-realistic exploded vertical infographic composition of a chicken pizza.
At the top, a golden oven-baked pizza crust edge with light blistering and baked texture.
Below it, stretchy melted mozzarella cheese floating smoothly with natural cheese pull.
Under the cheese, juicy grilled or crispy chicken chunks with visible seasoning and moisture.
Next, fresh toppings such as sliced bell peppers, olives, and onions suspended mid-air.
Beneath the toppings, a rich tomato sauce layer with glossy depth.
At the bottom, a soft yet crisp pizza base, perfectly centered and aligned.
Pure solid white background, soft studio lighting, and subtle realistic shadows beneath each floating layer.
Ultra-sharp focus, DSLR macro photography look.
Clean, minimalist infographic text labels with thin pointer lines for each layer. Premium, professional, photorealistic food infographic style.
Ultra-cinematic vertical composition of coffee elements suspended in mid-air cascading roasted coffee beans, chocolate bonbons, swirling latte art in a mid-air coffee cup, splashes of milk and espresso frozen in motion, fine coffee grounds dusting through the air captured with rich brown and cream color tones.
Hyper-detailed textures with glossy liquid surfaces, crema bubbles, and matte bean textures, lit with dramatic high-contrast studio lighting against a deep, velvety black background. Cinematic depth of field, splash photography aesthetic, premium café advertising style. Shot with a virtual Nikon D850, 105mm macro lens, aperture f/4.0, crisp editorial clarity.
A weathered wooden fishing boat with peeling paint and tangled nets rests quietly in shallow turquoise waters near a rocky tropical cove, surrounded by palm trees swaying gently in the breeze; the boat slightly shifts with the calm waves as the camera performs a slow lens push from a high angle, capturing the sun glinting off the water's surface and the boat's faded textures; soft daylight with warm tones and side lighting highlights the serenity of the setting; visual style is realistic and cinematic, evoking a sense of nostalgic solitude.
location: luxury penthouse kitchen, one continuous mini story
Modern luxury penthouse kitchen at golden hour, warm sunlight through floor to ceiling windows, city skyline outside, marble island, copper cookware, a distinctive teal espresso machine, a small crack on the right edge of the marble, a bowl of lemons near the sink. Same main character throughout: a woman chef in a white linen shirt with rolled sleeves and a thin red bracelet on her left wrist. Keep her appearance, outfit, props, lighting, and kitchen layout consistent across every cut.
CUT TO: Wide establishing shot showing the full kitchen layout, skyline, marble island center frame, teal espresso machine on the back counter, lemons by the sink.
CUT TO: Medium shot from the same side of the island as she places a wooden cutting board on the marble, the crack still visible near the right edge.
CUT TO: Close-up on her hands, red bracelet visible, slicing a strawberry tart topping; crumbs and fruit glisten in the same warm light.
CUT TO: Over-shoulder shot as she plates the tart on a matte black plate, teal espresso machine softly blurred in the background.
CUT TO: Insert close-up of espresso pouring from the teal machine into a small cup, crema forming, same golden reflections on the metal.
CUT TO: Medium shot as she carries the plate and cup to the window-side counter; skyline stays in the same direction, sunlight consistent.
CUT TO: Tight close-up as she smiles and adjusts a lemon slice garnish; end on the tart’s glossy surface with the skyline bokeh behind it.
Mini nature documentary, epic and serene Ultra-detailed natural world cinematography, realistic textures, soft documentary color grade, stable tracking, gentle environmental motion. CUT TO: Wide dawn landscape of desert dunes and distant mountains; the sky transitions through cool blues into warm orange while long shadows slide across ripples in the sand. Telephoto shot of a hawk gliding across the frame, wings steady, heat shimmer wavering beneath; the camera tracks smoothly with the bird centered against a layered horizon. CUT TO: Ground-level close-up of a small lizard pausing on a pebble, blinking; the focus snaps between its textured scales and the sand grains, then it darts forward. CUT TO: Slow-motion close-up of sand scattering from the lizard’s feet, tiny particles lifting and catching sunlight. CUT TO: Medium shot of hardy desert plants swaying gently; a beetle crawls over a stem, the camera follows with a calm, deliberate move. CUT TO: Wide reveal as the dunes open onto a ribbon of water; sunlight glitters on the surface, and distant birds lift off in a thin line. CUT TO: Final tranquil shot: the river shimmer fills the frame, reflections pulsing softly, ending on a bright glint that fades into calm.
Cinematic close-up of a professional woman in a corporate office, looking directly at camera with confident expression, soft window lighting from left, shallow depth of field, 4K photorealistic quality
ultra-realistic cinematic cozy atmosphere. High-energy yet serene mood with professional cinematography, sharp focus on the guitarist, and vibrant color grading.
Environment: A pristine, snow-covered arctic landscape at night. A glowing igloo stands in the background. In the foreground, a crackling campfire casts warm, flickering light. Above, a majestic Aurora Borealis (Northern Lights) swirls in green and purple, reflecting off the snow and the group.
Subjects & Attire: A small group (3-4 identical people 1 young woman playing 1 old man 1 old woman and 1 young man) gathered around the fire. They wear authentic winter gear: thick woolen sweaters, fur-lined parkas, and knit hats. The central subject is a charismatic acoustic guitarist playing a classic dreadnought guitar.
Timeline Sequence (15 Seconds):
0.0s - 4.0s (The Scene): A slow, sweeping drone shot reveals the glowing igloo and the campfire. The majestic Aurora Borealis dominates the sky. The group is seen from a distance, establishing a sense of warmth in the vast cold.
4.0s - 8.0s (The Gathering): Low-angle medium shot focusing on the guitarist. In the background, friends move naturally: one sips from a steaming mug, another taps their foot to the beat. Their breath is visible as white vapor in the crisp air.
8.0s - 12.0s (The Focus): A smooth push-in to a close-up of the guitarist’s hands. The camera captures the fingers skillfully moving over the fretboard. Firelight glints off the guitar’s polished wood.
12.0s - 15.0s (The Finale): A wide pull-back shot. The guitarist finishes with a gentle chord. The camera rises toward the shimmering aurora as the music fades into the sound of the wind.
Lighting: Contrast between the warm orange glow of the fire and the cool, ethereal green of the Northern Lights. Dramatic flickering shadows and soft highlights on the snow.
Action: Natural, subtle movements from the group. The guitarist shows deep focus and a gentle smile, inviting the audience into the cozy circle.
Audio: Melodic, finger-picked acoustic guitar layered with the crackle of fire and the subtle whisper of the arctic wind.
Ultra photoreal macro studio photograph of two tiny miniature tennis players playing on top of a kitchen dish sponge, the sponge is rectangular with bright yellow foam sides and a dark green abrasive scrub surface, the scrub surface is used as the tennis court, a realistic miniature tennis net stretched across the middle, one player near the camera hitting a small tennis ball, the other player far in the background, extreme detail, real materials, visible sponge pores and micro texture, realistic shadows and soft studio lighting, shallow depth of field, bokeh, clean beige background, high contrast, 8k, hyper-real, cinematic macro, 100mm macro lens, f/2.8 Quoting Higgsfield AI 🧩 (@higgsfield_ai) We just unlocked Grok Imagine's real potential. xAI built the model. We figured out how to actually use it - fluid motion, cinematic POV & multi-shot control.
A high-end intimate shower gel commercial with bold, refined luxury and confident sensuality. Visual Setup A sleek black glass pump bottle labeled "BARBARU — Banana Rule — Intimate Shower Gel" Placed on an elegant sculptural arrangement of ripe bananas. Golden gel flows slowly and smoothly over the bottle and fruit. Deep matte black background with controlled studio lighting. Action & Camera Direction Slow cinematic push-in toward the product. Golden gel flows in controlled slow motion, rich and glossy. Subtle camera orbit reveals bottle curves and premium reflections. Light glides across glass and gel for a luxury skincare-ad finish. Movement is intentional, confident, editorial. Voiceover / Dialogue (Professional & Bold) 0–3s (Low, calm, confident male or female voice): "Luxury… begins with confidence." 3–6s: "Designed for intimacy. Crafted for comfort." 6–9s (slightly deeper tone): "BARBARU Banana Rule." 9–10s (soft, premium whisper): "Indulge without compromise." Timing Breakdown (10s) 0–2s: Fade in from black, product silhouette reveal. 2–5s: Golden gel begins flowing, texture close-ups. 5–8s: Camera orbit, reflections and curves emphasized. 8–10s: Hero shot — product centered, confident stillness. Mood & Brand Tone Professional luxury brand energy. Bold but tasteful sensuality. Calm, confident, expensive presence. Fashion editorial × premium skincare commercial. Style & Quality Cinematic studio lighting, shallow depth of field, ultra-realistic textures, glossy reflections, smooth motion, high-end commercial polish, 8K realism.
Video prompt:
{
"cinematic_video_request": {
"meta": {
"title": "Sadie Sink - Night Drive Portrait",
"style_preset": "Cinematic Realism",
"duration_seconds": 10,
"resolution": "4K",
"aspect_ratio": "16:9"
},
"prompts": {
"main_prompt": "Ultra-realistic cinematic animation of Sadie Sink sitting in the backseat of a luxury car at night. The city lights outside the window create soft motion blur with passing traffic and glowing streetlights. Subtle camera push-in shot from medium frame to close-up. Gentle movement in her hair as the car moves. She slowly shifts her gaze toward the window, blinking naturally, then looks back toward the camera with a calm, slightly mysterious expression.",
"visual_modifiers": "4K cinematic quality, realistic skin texture, natural facial micro-expressions, smooth motion, dramatic nighttime mood, film-grade color grading, soft contrast, subtle handheld camera feel, shallow depth of field, bokeh city lights, detailed leather texture.",
"lighting_prompt": "Soft ambient lighting from streetlights flickers across her face, creating dynamic shadows and warm highlights. Background traffic lights streak smoothly past the window."
},
"scene_specifications": {
"subject": {
"name": "Sadie Sink",
"action": "Sitting, gazing out window, turning head to camera, blinking",
"expression": "Calm, mysterious, natural micro-expressions",
"details": "Gentle hair movement, realistic skin texture"
},
"environment": {
"setting": "Backseat of luxury car",
"time": "Night",
"exterior": "City streets, passing traffic",
"details": "Detailed leather interior, window reflections"
},
"camera": {
"movement": "Slow push-in (dolly forward)",
"stabilization": "Slight handheld feel (organic motion)",
"framing": "Medium shot to Close-up",
"focus": "Shallow depth of field with background bokeh"
},
"lighting": {
"type": "Dynamic/Transient",
"sources": "Passing streetlights, city glow",
"characteristics": "Warm highlights, soft contrast, rhythmic flickering shadows"
}
}
}
}
Ultra-realistic cinematic macro video of an analog luxury watch being hand-assembled by a professional watchmaker. White silk gloves carefully hold the stainless-steel watch case while precision tweezers gently place the hour, minute, and second hands onto a brushed silver sunburst dial. Extreme close-up shots reveal polished indices, fine engravings, and micro-details of the dial texture. Cut to the golden mechanical movement with visible jewels and gears being delicately adjusted. Soft studio lighting with dramatic shadows, shallow depth of field, premium luxury aesthetic. Slow, smooth camera movements, subtle reflections on metal surfaces. Final shot shows the completed watch ticking for the first time, audible tick-tick sound, symbolizing precision and craftsmanship. Photorealistic, 4K quality, cinematic color grading, luxury brand film style.