Skip to content

Rerank

POST https://www.kkiai.com/v1/rerank

Given a prompt, the model will return one or more predicted completions, and can also return the probability of alternative tokens at each position.

Create a completion for the provided prompt and parameters

Official documentation: https://docs.siliconflow.cn/cn/api-reference/rerank/create-rerank

Request Parameters

Authorization

Add the Authorization parameter to the Header, with its value being the Token concatenated after Bearer

Example: Authorization: Bearer ********************

Header Parameters

Parameter NameTypeRequiredDescriptionExample
Content-TypestringRequiredapplication/json
AcceptstringRequiredapplication/json
AuthorizationstringOptionalBearer {{YOUR_API_KEY}}
X-Forwarded-HoststringOptionallocalhost:5173

Body Parameters (application/json)

Parameter NameTypeRequiredDescription
modelstringRequired
documentsarray[string]Required
querystringRequired
top_nintegerRequired
instructstringRequired

Request Example

json
{
  "model": "qwen3-rerank",
  "documents": [
    "Text ranking models are widely used in search engines and recommendation systems. They rank candidate passages based on their relevance to a query.",
    "Quantum computing is an advanced field of computer science that studies computation using quantum-mechanical principles.",
    "The development of pretrained language models has brought significant improvements to text ranking models."
  ],
  "query": "What is a text ranking model?",
  "top_n": 2,
  "instruct": "Given a web search query, retrieve relevant passages that answer the query."
}

cURL Example

bash
curl --location --request POST 'https://www.kkiai.com/v1/rerank' \
--header 'Accept: application/json' \
--header 'Authorization: Bearer <YOUR_API_KEY>' \
--header 'Content-Type: application/json' \
--data-raw '{
  "model": "qwen3-rerank",
  "documents": [
    "Text ranking models are widely used in search engines and recommendation systems. They rank candidate passages based on their relevance to a query.",
    "Quantum computing is an advanced field of computer science that studies computation using quantum-mechanical principles.",
    "The development of pretrained language models has brought significant improvements to text ranking models."
  ],
  "query": "What is a text ranking model?",
  "top_n": 2,
  "instruct": "Given a web search query, retrieve relevant passages that answer the query."
}'

Response

🟢 200 OK

Response Body

Parameter NameTypeRequiredDescription
idstringRequired
objectstringRequired
createdintegerRequired
choicesarray[object]Required
  └ indexintegerOptional
  └ messageobjectOptional
  └ finish_reasonstringOptional
usageobjectRequired
  └ prompt_tokensintegerRequired
  └ completion_tokensintegerRequired
  └ total_tokensintegerRequired

Response Example

json
{
    "id": "chatcmpl-123",
    "object": "chat.completion",
    "created": 1677652288,
    "choices": [
        {
            "index": 0,
            "message": {
                "role": "assistant",
                "content": "\n\nHello there, how may I assist you today?"
            },
            "finish_reason": "stop"
        }
    ],
    "usage": {
        "prompt_tokens": 9,
        "completion_tokens": 12,
        "total_tokens": 21
    }
}