Kling AI

Get better results from Text-to-Video. This guide reveals the complete prompt formula for mastering Subject, Movement, Scene, Camera Language, and Lighting.

By inputting a text passage, the Kling large model generates a 5-second or 10-second video that translates the text into visual imagery. It currently supports two modes of generation: "Standard Mode" for quicker video production and "Professional Mode" for superior image quality. "Kling" also supports three aspect ratios: 16:9, 9:16, and 1:1, to more diversely meet everyone's video creation requirements.

We recognize that "Prompt" serves as the key interactive language for the text-to-video model, and it directly dictates the content of the video produced by the model. Consequently, understanding and learning how to use effective Prompts for AI video creation is a goal for all users. As the new incarnation of the AI video model 2.0, "Kling" continues to evolve and improve. It's essential to explore continuously and tap into the full potential of Kling to adeptly utilize it and excel in AI video production. We have crafted a formula for Kling prompts for your reference:

💡

Prompt = Subject（Subject Description）+ Subject Movement + Scene（Scene Description）+（Camera Language + Lighting + Atmosphere)

-- optional

Subject: The subject is the main focus in the video, serving as an important embodiment of the theme. It can be people, animals, plants, objects, and so on；

Subject Description: Descriptions of the subject's appearance details and body posture can be listed using multiple short sentences. For example: Athletic performance, Hairstyle and color, Clothing and accessories, Facial features, Body posture and so on；

Subject Movement: Descriptions of the subject's movement status, including stillness and motion, should be straightforward and suitable for a 5-second video；

Scene: The scene represents the environment in which the subject is situated, encompassing the foreground, background, and other elements;

Scene Description: Scene descriptions for the subject's environment can be concise and focused, using a few short sentences to outline the setting without overwhelming the viewer. It should be suitable for what can be displayed within a 5-second video. Such as Indoor scene, Outdoor setting, Natural scene;

Camera Language: It pertains to employing various applications of the camera lens, along with the transitions and edits between shots, to communicate a narrative or message and to generate particular visual impacts and emotional tones. Techniques include ultra-wide angle shots, bokeh (background blur), close-ups, telephoto shots, low-angle shots, high-angle shots, aerial views, and depth of field, among others; (Note: This should be differentiated from camera motion control.)

Lighting: Light and shadow are the vital elements that imbue photographic works with soul. The application of light and shadow can make photos more profound and emotionally resonant, enabling us to create works with a rich sense of depth and expressive power. Techniques include:Ambient lighting, Morning light, Sunset, Interplay of light and shadow, Tyndall effect, Artificial lighting;

Atmosphere: Describing the atmosphere of the anticipated video footage can involve various elements to set the mood and tone.

The most fundamental components of the aforementioned formula are the subject, motion, and setting, which constitute the most straightforward and essential units for depicting a video scene. To provide a more detailed description of the subject and setting, one should enumerate various descriptive short sentences, maintaining the integrity of the elements intended to appear in the Prompt. "Kling" will then extrapolate from our expressions to produce a video that aligns with our vision.

Given "A giant panda is reading a book in a café," we can enrich the details of the subject and scene by adding: "A giant panda, wearing black-rimmed glasses, is reading a book in a café, with the book resting on a table where a steaming cup of coffee sits beside it, next to the café's window." This creates a more specific and manageable image. If you want to add some cinematic language and lighting ambience, we can also try to "Shot in medium range, with a blurred background and atmospheric lighting, a giant panda, adorned with black-rimmed glasses, is seen reading a book in a café. The book lies on a table, accompanied by a steaming cup of coffee, steaming hot, next to the cafe windows, movie-level color palette". The texture of the video generated in this way will be further enhanced, and it is possible to get results beyond expectations.

prompt

A giant panda is reading a book in a café.

A giant panda wearing black-framed glasses is reading a book in a café, with the book placed on the table. On the table, there is also a cup of coffee emitting steam, and next to it is the café's window.

In the shot, a medium shot with a blurred background and ambient lighting captures a scene where a giant panda, adorned with black-framed glasses, is reading a book in a café. The book rests on the table, accompanied by a cup of coffee that's steaming gently. Beside the cozy setting is the café's window, with a cinematic color grading applied to enhance the visual appeal.

video

The purpose of the formula is to help everyone more effectively describe the video scenes they envision. We can also let our imagination run wild and not be limited by the formula, to communicate freely and boldly with "Kling," which might yield even more astonishing outcomes! Here are some excellent examples shared by creators, let's check them out~

Some high-quality examples

--Video examples below are shared by Kling creators

video
Parameters	Prompt: A giant panda is eating hot pot with chopsticks, with the street as the background. Ratio: 16:9 Mode: Standard Mode length: 5s	Prompt: A Pikachu is sitting on a chair, drinking coffee and reading a newspaper. Ratio: 16:9 Mode: Standard Mode length: 5s	Prompt: A polar bear is playing the violin in the snow. Ratio: 16:9 Mode: Standard Mode length: 5s	Prompt: A bee with a puppy's head Ratio: 16:9 Mode: Standard Mode length: 5s
video
prompt	Prompt: Morning mist, sunrise, lens flare, and a cool breeze. A young Chinese woman with exquisite facial features, her long hair blown by the wind, strands of hair scattered across her face, dressed in summer attire, with a seaside beach as the backdrop. Ratio: 16:9 Mode: Standard Mode length: 5s	Prompt: Indoor shooting, close-up, a Chinese child is eating dumplings. Ratio: 16:9 Mode: Standard Mode length: 5s	Prompt: A beautiful girl with Chinese style Ratio: 16:9 Mode: Standard Mode length: 10s	Prompt: A Chinese little girl is holding a pink balloon and smiling happily in the playground, with a slide in the background. Ratio: 16:9 Mode: Standard Mode length: 10s
video
prompt	Prompt: Aerial shot, blue waves pounding against the rocks, a magnificent and magnificent scene. Ratio: 16:9 Mode: Standard Mode length: 10s	Prompt: A medieval sailing ship sailing on the sea, a foggy night, bright moonlight, and an eerie atmosphere. Ratio: 16:9 Mode: Standard Mode length: 5s	Prompt: First-person perspective, high-speed flight, symmetrical composition, rotation, countless lightning bolts amidst dark clouds, motion blur. Ratio: 16:9 Mode: Standard Mode length: 5s	Prompt: The camera zooms into a beacon tower on the Great Wall, first-person perspective, high-speed flight, symmetrical composition, motion blur, and atmospheric lighting. Ratio: 16:9 Mode: Standard Mode length: 5s
video
prompt	Prompt: A space fighter jet speeds through a huge sci-fi internal tunnel, rushes out of the tunnel into space, and a space battle can be seen at the end of the tunnel. Ratio: 16:9 Mode: Standard Mode length: 5s	Prompt: A racing car is racing on the surface of the moon against a space backdrop, with tilt-shift zoom effect. Ratio: 16:9 Mode: Professional Mode length: 5s	Prompt: Aerial shot of a cyberpunk city. Ratio: 16:9 Mode: Standard Mode length: 10s	Prompt: On an alien planet, the streetscape of a cyberpunk city, with futuristic buildings, the camera slowly advances forward, and there are pedestrians on the street. Ratio: 16:9 Mode: Professional Mode length: 5s
video
prompt	Prompt: A woman is engaged in a gunfight with someone in an alley, with a Blade Runner-style atmosphere, neon lights, and ambient lighting. Ratio: 16:9 Mode: Professional Mode length: 5s	Prompt: First-person perspective, a man driving a car on a night street with fireworks blooming ahead. Ratio: 16:9 Mode: Standard Mode length: 5s	Prompt: A circling camera shot captures a handsome young man dressed in ancient clothing, wearing white, seated by the pond with his eyes closed, meditating. Ratio: 16:9 Mode: Professional Mode length: 5s	Prompt: The back view of a woman, in a red long gown, standing on the rooftop, with buildings smoking in the distance. Ratio: 16:9 Mode: Standard Mode length: 5s

Tips
- Use simple words and sentence structures, avoiding overly complex language;
- Keep the visual content as simple as possible, aiming for a completion within 5 to 10 seconds;
- Using words like "Oriental mood," "China," and "Asia" can more easily generate a Chinese style and depict Chinese people;
- Current large video models are not sensitive to numbers, making it difficult to maintain consistency in counts, such as "10 puppies on the beach";
- For a split-screen scene, you can use a prompt like: "4 camera angles, representing spring, summer, autumn, and winter."；
- At the current stage, it is challenging to generate complex physical movements, such as the bouncing of a ball or the trajectory of a high-altitude throw；

（Updating, welcome to add more）

相关推荐

Standard Mode & Professional Mode

Camera Movement

Start and End Frames

Motion Brush

创作工具 ▼

开发者平台 ▼

关于我们 ▼