Skip to main content
Reference-to-video

Wan — Generate from reference

Submit reference video

POST
/services/aigc/video-generation/video-synthesis
curl --location 'https://dashscope-intl.aliyuncs.com/api/v1/services/aigc/video-generation/video-synthesis' \
  -H 'X-DashScope-Async: enable' \
  -H "Authorization: Bearer $DASHSCOPE_API_KEY" \
  -H 'Content-Type: application/json' \
  -d '{
  "model": "wan2.6-r2v-flash",
  "input": {
    "prompt": "Character2 sits on a chair by the window, holding character3, and plays a soothing American country folk song next to character4. Character1 says to Character2: \"that sounds great\"",
    "reference_urls": [
      "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/en-US/20260205/aacgyk/wan-r2v-role1.mp4",
      "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/en-US/20260205/mmizqq/wan-r2v-role2.mp4",
      "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20260129/qpzxps/wan-r2v-object4.png",
      "https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20260129/wfjikw/wan-r2v-backgroud5.png"
    ]
  },
  "parameters": {
    "size": "1280*720",
    "duration": 10,
    "audio": true,
    "shot_type": "multi",
    "watermark": true
  }
}'
{
  "request_id": "<string>",
  "output": {
    "task_id": "<string>",
    "task_status": "PENDING"
  }
}
Generate natural, lifelike performance videos from multimodal input (text, image, or video). Use a person or object as the main character.
  • Basic capabilities: Set duration (2–10s), resolution (720P/1080P), and watermarks.
  • Character portrayal: Replicate appearance from reference image or video. Videos also replicate voice timbre. Supports single or multi-character performances.
  • Multi-shot narrative: Intelligent multi-shot scheduling maintains character consistency across dialogue and interactions.

Authorizations

string
header
required

DashScope API Key. Create one in the Qwen Cloud console.

Header Parameters

enum<string>
required

Must be enable to create an asynchronous task.

enable

Body

application/json
enum<string>
required

Model name.

wan2.6-r2v-flash,wan2.6-r2v
wan2.6-r2v-flash
object
required

Input data for reference-to-video generation.

object

Generation parameters for reference-to-video.

Response

200-application/json
string

Unique request identifier.

object