Alibaba

wan2.6

Alibaba’s multimodal video generation model series supporting role-play (reference-to-video), multi-shot narrative, audio-visual sync, and up to 15-second output, enabling creators to star in AI videos with their own appearance and voice.


Provider	Alibaba
Tasks	text-to-video · image-to-video
Starting from	0.3049 USD / call · Pricing details

POST

video-generation

wan2.6

curl --request POST \
  --url https://api.linkai.one/api/v1/video-generation \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "wan2.6",
  "prompt": "<string>",
  "audio_url": "<string>",
  "duration": 5,
  "first_frame_image": "<string>",
  "resolution": "720P"
}
'

{
  "code": 0,
  "data": {
    "task_id": "cb6111cf-e89f-4978-b8e3-aa59c21cceff",
    "order_id": "019da9cf-d1db-78e1-ac23-526774d01193",
    "status": "processing",
    "price": 0.074
  },
  "msg": "success",
  "request_id": "250b66dd-e1fb-4bb6-b5f6-c668efadc35d"
}

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

application/json

model

enum<string>

required

model is required

Available options:

wan2.6

prompt

string

required

prompt must be between 1 and 2000 characters

Required string length: 1 - 2000

audio_url

string

audio_url must be valid http/https url

duration

enum<integer>

duration must be 5 or 10 or 15 seconds

Available options:

5,

10,

15

first_frame_image

string

first_frame_image is required

resolution

enum<string>

resolution must be 720P or 1080p

Available options:

720P,

1080P

Response

Task accepted and queued for generation.

code

integer

Example:

0

data

object

Show child attributes

msg

string

Example:

"success"

request_id

string<uuid>

Example:

"250b66dd-e1fb-4bb6-b5f6-c668efadc35d"

seedance-v1.5-proByteDance's native audio-visual joint generation model built on a dual-branch DiT architecture, producing synchronized video and audio in a single pass with multilingual lip-sync, cinematic camera control, and narrative coherence. | | | | --- | --- | | **Provider** | Bytedance | | **Tasks** | text-to-video · image-to-video | | **Starting from** | 0.0552 USD / call · <a href="https://linkai.one/en/models/seedance-v1.5-pro" target="_blank" rel="noopener">Pricing details</a> |

wan2.6

curl --request POST \
  --url https://api.linkai.one/api/v1/video-generation \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "wan2.6",
  "prompt": "<string>",
  "audio_url": "<string>",
  "duration": 5,
  "first_frame_image": "<string>",
  "resolution": "720P"
}
'

{
  "code": 0,
  "data": {
    "task_id": "cb6111cf-e89f-4978-b8e3-aa59c21cceff",
    "order_id": "019da9cf-d1db-78e1-ac23-526774d01193",
    "status": "processing",
    "price": 0.074
  },
  "msg": "success",
  "request_id": "250b66dd-e1fb-4bb6-b5f6-c668efadc35d"
}

Overview

Video Generation

Image Generation

wan2.6

Authorizations

Body

Response