Embedding & reranking models

Text embedding

Text-only search, RAG, or clustering → text-embedding-v4. Migrating an existing v3 index → text-embedding-v3 (compatible dimensions).

How many dimensions?

Large-scale search where storage matters → 256 or 512. General use → 1024 (default, good balance). Maximum accuracy on benchmarks → 1536 or 2048.

Multimodal embedding

Search images or videos by text query → tongyi-embedding-vision-plus.

Text-only data?

Use text-embedding-v4 instead — faster, cheaper, more dimension options. Multimodal embedding is for cross-modal retrieval (text↔image, text↔video).

Accuracy vs speed

Best accuracy → tongyi-embedding-vision-plus (dimensions up to 1152). Budget or latency-sensitive → tongyi-embedding-vision-flash (up to 768).

Reranking

Improve RAG precision → add qwen3-rerank after your embedding search. Re-scores top-N results with cross-attention for better ranking quality. Limits: 500 documents per request, 4,000 tokens per item, 30,000 tokens per request.

Migrate from closed-source models

Replacing OpenAI, Cohere, or Voyage embeddings? Use these Qwen Cloud equivalents.

Task	Closed-source examples	Qwen Cloud recommendation
Text embedding	OpenAI text-embedding-3-large, Voyage-3-large, Cohere embed-v4	`text-embedding-v4`
Multimodal embedding	Cohere embed-v4, Voyage multimodal	`tongyi-embedding-vision-plus`
Reranking	Cohere Rerank 3.5, Voyage rerank-2.5	`qwen3-rerank`

All models

Model	Use this when...	Dimensions	Max tokens
`text-embedding-v4`	Text search, RAG, clustering	64, 128, 256, 512, 768, 1024 (default), 1536, 2048	8,192
`text-embedding-v3`	Existing v3 index migration	512, 768, 1024 (default)	8,192
`tongyi-embedding-vision-plus`	Cross-modal search, best accuracy	1152, 1024, 512, 256, 128, 64	1,024
`tongyi-embedding-vision-flash`	Cross-modal search, budget	768, 512, 256, 128, 64	1,024
`qwen3-rerank`	Re-rank search results	—	4,000/item

Embedding & reranking models

Text embedding

How many dimensions?

Multimodal embedding

Text-only data?

Accuracy vs speed

Reranking

Migrate from closed-source models

All models

Learn more

Text embedding guide

Multimodal embedding API

​Text embedding

​How many dimensions?

​Multimodal embedding

​Text-only data?

​Accuracy vs speed

​Reranking

​Migrate from closed-source models

​All models

​Learn more

Text embedding guide

Multimodal embedding API

Text embedding

How many dimensions?

Multimodal embedding

Text-only data?

Accuracy vs speed

Reranking

Migrate from closed-source models

All models

Learn more