Kling v2.6 Pro

Adds Kling v2.6 Pro for image-to-video with optional native audio, GPT-5.2 for the AI editor agent, and improves video editor and preview handling.

Note: Kling v2.6 Pro is Kuaishou's latest image-to-video model with native audio-visual co-generation. Learn more at Kling AI and try it directly.

We've added Kling v2.6 Pro as a video model for image-to-video generation. Turn your images into cinematic video clips with optional native audio—voiceovers, sound effects, and ambient sounds generated in a single pass.

About Kling v2.6 Pro

Kling v2.6 Pro is Kuaishou's state-of-the-art image-to-video model with native audio-visual co-generation. Our integration brings the following benefits:

  • Native Audio-Visual Co-Generation: Generate visuals, voiceovers, sound effects, and ambient sounds in one pass—no separate dubbing workflow needed.
  • Image-to-Video: Transform static images into dynamic video clips with smooth, cinematic motion.
  • Character-Synced Speech: Accurate lip synchronization when using voiceover, so visuals and audio stay in sync.
  • Bilingual Support: Strong voice generation in both Chinese and English.
  • Professional Output: Up to 1080p HD, 30 FPS, with support for 16:9, 9:16, and 1:1 aspect ratios.

What's New

  • Kling v2.6 Pro: A new image-to-video model with optional native audio support. Turn your images into video with Kling's latest generation model.
  • GPT-5.2 for AI Editor Agent: GPT-5.2 is now available for the AI editor agent, offering more capable assistance when editing and generating videos.
  • Video Editor & Preview Improvements: Better handling of video previews and editor behavior for a smoother creation experience.
  • Agent Tooling: Kling v2.6 Pro is wired into the agent tool map so AI assistants can use it when generating videos. Voiceover vs. music tools are selected based on your audio preferences.

How It Works

Select Kling v2.6 Pro in your video model settings when creating or generating videos. Use an image as your starting frame and optionally enable native audio. The model supports 5- or 10-second clips and flexible aspect ratios. Our AI editor agent can now better understand prompts and recommend or use Kling v2.6 Pro when it's the right fit for your request.

Learn more about Kling and Kuaishou's AI video models at kling.kuaishou.com.