Kuaishou’s unified multimodal video model with native 4K/60fps output, AI Director multi-shot storyboarding, multilingual native audio, and ultimate character consistency, unifying video understanding, generation, and editing in one workflow.
| Provider | Kling |
| Tasks | text-to-video · image-to-video |
| Starting from | 0.0740 USD / call · Pricing details |
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
prompt is required for kling-v3
kling-v3 prompt is required for image-to-video
duration must be between 3 and 15 seconds
extends must be a valid object
extends.audio must be a boolean
extends.cfg_scale must be between 0 and 1
negative_prompt must be at most 2500 characters
first_frame_image must be a valid URL
last_frame_image must be a valid URL
size must map to aspect_ratio 16:9, 9:16 or 1:1