Key Features of Kling O3 Edit
All-in-One Reference 3.0: Enhanced consistency and stronger multimodal response using image/video/element/text prompts.
Elements 3.0: Video-character reference with visual and audio capture, plus voice binding on character elements.
Storyboard Narration 3.0: Free duration with custom shots and precise multi-shot control up to 15s.
All-in-One Reference 3.0
VIDEO 3.0 Omni treats uploaded images, videos, elements, and text as prompts in one unified model. Compared to O1, it improves element consistency, text responsiveness, and dynamic quality while reducing visual distortions.
Example 1
Element/Reference Image
@Kling Lipstick




@Image

Text Description
Pure black background. In the darkness, a river of color-matching the @Kling Lipstick shade-streaks across, leaving a rich, flawless trail. The trail then "comes alive," flowing like liquid and elegantly spreading and blending on the surface to form patterned designs @Image. The color river then gathers into the lipstick bullet of @Kling Lipstick resting on water. Soft water surrounds it with budding flowers that slowly bloom, gentle ripples forming across the surface.
Outputs
Example 2
Element/Reference Image
@Boxer A

@Boxer B

Scene-Rooftop

Text Description
Shot 1 (2s): Wide shot, @Boxer A and @Boxer B face off in the center of the rooftop, feet apart in a boxing stance. Shot 2 (2s): Both move in, testing each other up close: @Boxer A throws a quick punch, @Boxer B sidesteps and blocks. Shot 3 (3s): @Boxer A continues the attack, landing a punch on @Boxer B's head, and @Boxer B retaliates. Shot 4 (4s): Wide shot, the two boxers continue their intense fight. Shot 5 (2s): A bird's-eye view of the scene shows the two separated and having stopped fighting.
Outputs
Example 3
Element/Reference Image
@Male Protagonist



@Female Protagonist




Text Description
Long take. On a windy day in an Icelandic mountain range, @Male Protagonist says with a barely contained smile, "Do you think our wedding is too simple-like there's no one here to bless us?" The camera circles the subjects to reveal @Female Protagonist standing opposite, smiling and replying, "The wind-the wind is their blessing to us." Cinematic, handheld feel.
Outputs
From Kling AI Creative Partner @FOS
@Kling Lipstick




@Image

Pure black background. In the darkness, a river of color-matching the @Kling Lipstick shade-streaks across, leaving a rich, flawless trail. The trail then "comes alive," flowing like liquid and elegantly spreading and blending on the surface to form patterned designs @Image. The color river then gathers into the lipstick bullet of @Kling Lipstick resting on water. Soft water surrounds it with budding flowers that slowly bloom, gentle ripples forming across the surface.
@Boxer A

@Boxer B

Scene-Rooftop

Shot 1 (2s): Wide shot, @Boxer A and @Boxer B face off in the center of the rooftop, feet apart in a boxing stance. Shot 2 (2s): Both move in, testing each other up close: @Boxer A throws a quick punch, @Boxer B sidesteps and blocks. Shot 3 (3s): @Boxer A continues the attack, landing a punch on @Boxer B's head, and @Boxer B retaliates. Shot 4 (4s): Wide shot, the two boxers continue their intense fight. Shot 5 (2s): A bird's-eye view of the scene shows the two separated and having stopped fighting.
@Male Protagonist



@Female Protagonist




Long take. On a windy day in an Icelandic mountain range, @Male Protagonist says with a barely contained smile, "Do you think our wedding is too simple-like there's no one here to bless us?" The camera circles the subjects to reveal @Female Protagonist standing opposite, smiling and replying, "The wind-the wind is their blessing to us." Cinematic, handheld feel.
From Kling AI Creative Partner @FOS
Elements 3.0
Upload/record 3-8s single-character video to build reusable character assets with both likeness and voice. Character multi-image elements can also bind uploaded voice recordings for stronger lip-sync and expression.
Example 1
Element/Reference Image
@Grace

@Alan

@Samoyed



@Image

Prompt
Shot 1 (3s): Mid-shot, background @Image. @Grace sits on the sofa eating cookies as @Alan walks in holding @Samoyed. @Samoyed lunges for the cookie in @Grace's hand. @Grace says, "Hey! Watch your dog!" Shot 2 (2s): @Alan sits beside her, pulling the leash and lifting @Samoyed. Close-up, @Alan says, "He just likes cookies more than me." Shot 3 (3s): Close-up, @Grace smiles and says, "Well, he has good taste at least."
Outputs
Example 2
Element/Reference Image
@Little Scholar




@Reference Image

Prompt
Shot 1 (3s): Close-up on the comedy open-mic stage @Reference Image, with a large retro neon "KLING" sign in the background. Warm golden backlight outlines the scene. The camera follows the performer as they walk to the microphone, lightly adjusting its height. Shot 2 (4s): Mid-close shot of @Little Scholar, who says, "我居然输给了 Kid,他上过几天班呀,教大家如何快乐上班" Shot 3 (4s): @Little Scholar with a restrained, slight smile, naturally pausing, saying, "你听听,花 5 分钟,论证了这么个伪命题" Shot 4 (2s): Switch to the audience laughing loudly.
Outputs
Example 3
Element/Reference Image
@Explorer



Prompt
@Explorer is live, welcoming everyone to her world. She says, "Do you know what the most interesting thing in the world is? It's going on an adventure with me! The next stop is the Atlantic Ocean!" Cut to a panoramic view of the Atlantic, where @Explorer is steering through a storm.
Outputs
Example 4
Element/Reference Image
@Sculpture



@Image

Prompt
Top-down wide shot: @Sculpture stands at the center of @Image. Mid-shot, side view: The camera circles around @Sculpture once. Close-up: @Sculpture's hand moves slightly. Close-up, face: @Sculpture says, "I'm back."
Outputs
@Grace

@Alan

@Samoyed



@Image

Shot 1 (3s): Mid-shot, background @Image. @Grace sits on the sofa eating cookies as @Alan walks in holding @Samoyed. @Samoyed lunges for the cookie in @Grace's hand. @Grace says, "Hey! Watch your dog!" Shot 2 (2s): @Alan sits beside her, pulling the leash and lifting @Samoyed. Close-up, @Alan says, "He just likes cookies more than me." Shot 3 (3s): Close-up, @Grace smiles and says, "Well, he has good taste at least."
@Little Scholar




@Reference Image

Shot 1 (3s): Close-up on the comedy open-mic stage @Reference Image, with a large retro neon "KLING" sign in the background. Warm golden backlight outlines the scene. The camera follows the performer as they walk to the microphone, lightly adjusting its height. Shot 2 (4s): Mid-close shot of @Little Scholar, who says, "我居然输给了 Kid,他上过几天班呀,教大家如何快乐上班" Shot 3 (4s): @Little Scholar with a restrained, slight smile, naturally pausing, saying, "你听听,花 5 分钟,论证了这么个伪命题" Shot 4 (2s): Switch to the audience laughing loudly.
@Explorer



@Explorer is live, welcoming everyone to her world. She says, "Do you know what the most interesting thing in the world is? It's going on an adventure with me! The next stop is the Atlantic Ocean!" Cut to a panoramic view of the Atlantic, where @Explorer is steering through a storm.
@Sculpture



@Image

Top-down wide shot: @Sculpture stands at the center of @Image. Mid-shot, side view: The camera circles around @Sculpture once. Close-up: @Sculpture's hand moves slightly. Close-up, face: @Sculpture says, "I'm back."
Storyboard Narration 3.0
Kling VIDEO 3.0 Omni keeps flexible duration and adds native custom multi-shot control. Users can specify shot duration, framing, angle, narrative content, and camera movement for coherent transitions.
Example 1
Element/Reference Image
@Mike
@Cindy
@Image

Prompt
Shot 1 (1s): Mike and Cindy sit face to face on the seats of an old green train, the train moving forward. Shot 2 (2s): Cut to a close-up of Cindy's profile. She rests her chin on her hand, looking out the window, asking, "Where are we about to go?" Shot 3 (3s): Cut to a close-up of Mike's face. He looks at Cindy and says, "We are about to go to a place where it is summer all year round." Shot 4 (2s): Cut to Cindy turning around, looking at Mike, smiling and nodding, saying, "I love summer." Shot 5 (2s): Cut to a wide shot of the two facing each other, smiling at one another.
Outputs
Example 2
Element/Reference Image
@Element1

@Element2

Prompt
Shot 1 (3s): Wide shot. A neon-lit street corner late at night, wet pavement reflecting lights. @Element1 leans against a red phone booth, smoking, with strong motion blur. Shot 2 (2s): Cut to close-up. @Element1's profile is half-hidden in shadow. He looks down and asks, "You still haven't decided which road to take?" Shot 3 (4s): Cut to close-up of @Element2-lips and swaying earrings. She flips a coin and says, "I heard there's a place where people never ask for directions." Shot 4 (3s): Cut to mid-shot. @Element1 lets out a self-mocking smile, exhales smoke that obscures his face, and says, "A place like that must be lonely." Shot 5 (3s): Cut to long shot. @Element1 and @Element2 face each other, blurred headlights flowing between them. City noise drops to silence as they slowly fade into the glow.
Outputs
Example 3
Element/Reference Image
@Image

@Goro

@Kaiko

Prompt
[00:00 - 00:02] Medium shot: @Goro, gestures emphatically with a lit cigarette walking towards a locker, smoke curling around his hand as he punctuates each beat of his point. Audio: The faint, organic crackle of the cigarette tip under his words. [00:02 - 00:04] Close-up: @Goro weathered face fills the frame-eyes wide, intensity sharpened, jaw working as he speaks like he's carving the truth into the air. Audio: Cigarette crackle continues; room tone low and tight. [00:04 - 00:06] Cutaway: @Kaiko, a young woman with a blonde buzzcut and a scar on her eyebrow, looks down at her athletic-taped hands-stoic, absorbing, refusing to react. Audio: Crackle softens slightly; her breath is barely audible. [00:06 - 00:08] Close-up: Goro's mouth forms the word "pop"-a small puff of white smoke escapes on the consonant. Audio: A tiny smoke-breath exhale overlays the cigarette's crackle. [00:08 - 00:10] Medium shot: @Goro leans his back against a row of dented industrial metal lockers, crossing his arms while still holding the cigarette-settling into authority, like the room belongs to him. - Goro: "You opened it-pop-and heat hit your face. Now? Wax paper. Burger sweats, gets soggy. Bun dissolves into meat. Mush of good intentions. No boundary. No definition."
Outputs
From Kling AI Creative Partner @Nigel Watson
@Mike
@Cindy
@Image

Shot 1 (1s): Mike and Cindy sit face to face on the seats of an old green train, the train moving forward. Shot 2 (2s): Cut to a close-up of Cindy's profile. She rests her chin on her hand, looking out the window, asking, "Where are we about to go?" Shot 3 (3s): Cut to a close-up of Mike's face. He looks at Cindy and says, "We are about to go to a place where it is summer all year round." Shot 4 (2s): Cut to Cindy turning around, looking at Mike, smiling and nodding, saying, "I love summer." Shot 5 (2s): Cut to a wide shot of the two facing each other, smiling at one another.
@Element1

@Element2

Shot 1 (3s): Wide shot. A neon-lit street corner late at night, wet pavement reflecting lights. @Element1 leans against a red phone booth, smoking, with strong motion blur. Shot 2 (2s): Cut to close-up. @Element1's profile is half-hidden in shadow. He looks down and asks, "You still haven't decided which road to take?" Shot 3 (4s): Cut to close-up of @Element2-lips and swaying earrings. She flips a coin and says, "I heard there's a place where people never ask for directions." Shot 4 (3s): Cut to mid-shot. @Element1 lets out a self-mocking smile, exhales smoke that obscures his face, and says, "A place like that must be lonely." Shot 5 (3s): Cut to long shot. @Element1 and @Element2 face each other, blurred headlights flowing between them. City noise drops to silence as they slowly fade into the glow.
@Image

@Goro

@Kaiko

[00:00 - 00:02] Medium shot: @Goro, gestures emphatically with a lit cigarette walking towards a locker, smoke curling around his hand as he punctuates each beat of his point. Audio: The faint, organic crackle of the cigarette tip under his words. [00:02 - 00:04] Close-up: @Goro weathered face fills the frame-eyes wide, intensity sharpened, jaw working as he speaks like he's carving the truth into the air. Audio: Cigarette crackle continues; room tone low and tight. [00:04 - 00:06] Cutaway: @Kaiko, a young woman with a blonde buzzcut and a scar on her eyebrow, looks down at her athletic-taped hands-stoic, absorbing, refusing to react. Audio: Crackle softens slightly; her breath is barely audible. [00:06 - 00:08] Close-up: Goro's mouth forms the word "pop"-a small puff of white smoke escapes on the consonant. Audio: A tiny smoke-breath exhale overlays the cigarette's crackle. [00:08 - 00:10] Medium shot: @Goro leans his back against a row of dented industrial metal lockers, crossing his arms while still holding the cigarette-settling into authority, like the room belongs to him. - Goro: "You opened it-pop-and heat hit your face. Now? Wax paper. Burger sweats, gets soggy. Bun dissolves into meat. Mush of good intentions. No boundary. No definition."
From Kling AI Creative Partner @Nigel Watson
All-in-One Reference 3.0 Showcase
Example 1
Reference Materials

Prompt
Lipstick fluid narrative on black background with product-forming finale
Output
Example 2
Reference Materials

Prompt
Five-shot rooftop boxing with persistent boxer identity
Output
Reference Materials
Prompt
Output

Lipstick fluid narrative on black background with product-forming finale

Five-shot rooftop boxing with persistent boxer identity
Storyboard Narration 3.0 Showcase
Example 1
Reference

Prompt
Five-shot train dialogue with explicit line ordering
Output
Reference
Prompt
Output

Five-shot train dialogue with explicit line ordering
Kling VIDEO 3.0 Omni Capabilities Upgrade
Capability
Text-to-Video
Kling VIDEO O1
No Native Audio, No Multi-shot
Kling VIDEO 3.0 Omni
✅ Supports Native Audio and Multi-shot
Capability
Image-to-Video
Capability
Start & End Frames-to-Video
Capability
Multi-image Reference
Capability
Element Reference
Capability
Video Element Reference
Kling VIDEO O1
Not supported
Kling VIDEO 3.0 Omni
✅ Supports uploading/recording video elements
Capability
Added Element Voice Control
Kling VIDEO O1
Not supported
Kling VIDEO 3.0 Omni
✅ Supports adding voice to elements
Capability
Video Duration
Kling VIDEO O1
Up to 10s
Kling VIDEO 3.0 Omni
✅ Up to 15s
| Capability | Kling VIDEO O1 | Kling VIDEO 3.0 Omni |
|---|---|---|
| Text-to-Video | No Native Audio, No Multi-shot | ✅ Supports Native Audio and Multi-shot |
| Image-to-Video | ||
| Start & End Frames-to-Video | ||
| Multi-image Reference | ||
| Element Reference | ||
| Video Element Reference | Not supported | ✅ Supports uploading/recording video elements |
| Added Element Voice Control | Not supported | ✅ Supports adding voice to elements |
| Video Duration | Up to 10s | ✅ Up to 15s |
How To Use Kling O3 Edit AI Video Model on skills.video
Select the Kling O3 Edit model
Head to the create page and choose this model from the dropdown list.
Input your detailed prompt
Describe the scene, style, and motion you want. Adjust settings as needed.
Download your video
Click create, then download or share once the generation finishes.