Skip to main content
Reranking

DashScope reranking

DashScope reranking API

POST
/services/rerank/text-rerank/text-rerank
curl --request POST \
  --url https://dashscope-intl.aliyuncs.com/api/v1/services/rerank/text-rerank/text-rerank \
  --header "Authorization: Bearer $DASHSCOPE_API_KEY" \
  --header "Content-Type: application/json" \
  --data '{
  "model": "qwen3-rerank",
  "input": {
    "query": "What is a rerank model?",
    "documents": [
      "Rerank models are widely used in search engines and recommendation systems. They sort candidate documents based on text relevance.",
      "Quantum computing is a cutting-edge field of computer science.",
      "The development of pre-trained language models has brought new advancements to rerank models."
    ]
  },
  "parameters": {
    "return_documents": true,
    "top_n": 2
  }
}'
{
  "output": {
    "results": [
      {
        "document": {
          "text": "<string>"
        },
        "index": 0,
        "relevance_score": 0.9334521178273196
      }
    ]
  },
  "usage": {
    "total_tokens": 0
  },
  "request_id": "85ba5752-1900-47d2-8896-23f99b13f6e1"
}
Rerank documents by semantic relevance to a query using qwen3-rerank. Uses a nested request structure with input and parameters wrappers.

Endpoint

  • HTTP: POST https://dashscope-intl.aliyuncs.com/api/v1/services/rerank/text-rerank/text-rerank
  • SDK base_http_api_url: https://dashscope-intl.aliyuncs.com/api/v1

Model overview

ModelMax DocumentsMax Tokens/DocMax Request TokensLanguagesPrice (per 1M tokens)Free QuotaUse Cases
qwen3-rerank5004,000120,000100+ languages$0.11M tokens (valid for 90 days)Text semantic search, RAG
Parameter definitions:
  • Max Tokens/Doc: Maximum token count per query or document. Content exceeding this limit is truncated, which may affect ranking accuracy.
  • Max Documents: Maximum number of documents per request.
  • Max Request Tokens: Calculated as Query Tokens x Document Count + Total Document Tokens. Must not exceed the limit.

Authorizations

string
header
required

Qwen Cloud API Key. Create one in the console.

Body

application/json
enum<string>
required

Model name. Must be qwen3-rerank.

qwen3-rerank
qwen3-rerank
object
required

Input data containing the query and documents to rank.

object

Optional parameters for the reranking request. The parameters wrapper is optional — top_n and instruct can be placed at the same level as model.

Response

200-application/json
object

Output wrapper containing ranked results.

object

Token usage statistics.

string

Unique request identifier.

85ba5752-1900-47d2-8896-23f99b13f6e1