Chat API

Introduction

This interface serves as the foundational dialogue API, generating AI assistant responses based on user input. Through a single interface, you can access all LinkAI capabilities:

Support for binding applications or workflows to leverage their underlying knowledge bases and plugins
One-click switching between all supported large language models
Support for both streaming/non-streaming output, with OpenAI-compatible interface structure
Support for multimodal input/output, allowing text and image inputs; text, image, video, and file outputs

API Definition

Endpoint

POST https://api.linkai.cloud/v1/chat/completions

Request Headers

Parameter	Value	Description
Authorization	Bearer YOUR_API_KEY	Create an API Key following the API Authentication guide
Content-Type	application/json	Indicates JSON format request

Request Body

Parameter	Type	Required	Description
messages	`list<object>`	Yes	Message context list, where each element has the structure `{"role": "user", "content": "Hello"}`. The role field can be "system", "user", or "assistant", and content cannot be empty
app_code	`string`	No	Code for the application or workflow. If omitted, the request is sent directly to the model without binding to a specific application
model	`string`	No	Model code. If not provided, the application's default model is used. See Model List for all available models
temperature	`float`	No	Controls randomness. Range [0, 1]. Higher values produce more creative responses, lower values produce more deterministic responses
top_p	`int`	No	Controls the sampling range, default is 1
frequency_penalty	`float`	No	Discourages repetition. Range [-2, 2], default is 0
presence_penalty	`float`	No	Encourages diversity. Range [-2, 2], default is 0
stream	`bool`	No	Whether to stream the output, default is false

Note:

When specifying an application via app_code, the system will use the application settings as system prompts, the default model configured in the application, and the application temperature as the temperature value
When specifying a workflow via app_code, the workflow will execute from the start node, and the output of the end node will be returned through the interface

Request example:

{
    "app_code": "G7z6vKwp",
    "messages": [
        { "role": "user", "content": "Hello" }
    ]
}

Note: Replace app_code with your own application code, a public application code from the application marketplace, or omit it to directly use the underlying model capabilities.

Response

Non-Streaming Response

By default, the interface returns all content at once after generation is complete:

{
    "choices": [
        {
            "index": 0,
            "message": {
                "role": "assistant",
                "content": "Hello! How can I assist you today?"
            }
        }
    ],
    "usage": {
        "prompt_tokens": 9,
        "completion_tokens": 17,
        "total_tokens": 26
    }
}

Note:

choices.message.content contains the AI's response. The usage section shows prompt_tokens, completion_tokens, and total_tokens, representing the token count for the request, response, and total consumption respectively.
A conversation's token calculation includes both request and response tokens. The request includes application settings, conversation history, knowledge base content, and user questions. These token limits can be configured in Application Management.

Streaming Response

To enable streaming, set the stream parameter to true. This will return content in real-time as the model generates it, suitable for web pages, apps, and mini-programs:

data: {"choices": [{"index": 0, "delta": {"content": "Hello!"}, "finish_reason": null}], "session_id": null}

data: {"choices": [{"index": 0, "delta": {"content": " How"}, "finish_reason": null}], "session_id": null}

data: {"choices": [{"index": 0, "delta": {"content": " can"}, "finish_reason": null}], "session_id": null}

data: {"choices": [{"index": 0, "delta": {"content": " I"}, "finish_reason": null}], "session_id": null}

data: {"choices": [{"index": 0, "delta": {"content": " help"}, "finish_reason": null}], "session_id": null}

data: {"choices": [{"index": 0, "delta": {"content": " you?"}, "finish_reason": null}], "session_id": null}

data: {"choices": [{"index": 0, "delta": {}, "finish_reason": "stop", "usage": {"prompt_tokens": 9, "completion_tokens": 6, "total_tokens": 15}}], "session_id": null}

data: [DONE]

Note: The output "[DONE]" indicates the end of the stream.

Error Responses

When an exception occurs, the API returns the following structure:

{
    "error": {
        "message": "Invalid request: user message content is empty",
        "type": "invalid_request_error"
    }
}

Error types are determined by HTTP status codes and error messages:

HTTP Status Code	Description
400	Request format error
401	Authentication failure, check if your API Key is correct
402	Application does not exist, check if the app_code parameter is correct
403	No access permission; for private applications, only the creator account can access
406	Insufficient account credits
409	Content moderation failed; questions, answers, or knowledge base may contain sensitive content
503	Interface call exception, contact customer service

Model List

The complete list of supported models is available on the Model Management page:

Model Code	Context Length	Description
gpt-4.1	1000K	OpenAI 4.1 model
gpt-4.1-mini	1000K	OpenAI 4.1 mini model
gpt-4.1-nano	1000K	OpenAI 4.1 nano model
gpt-3.5	16K	OpenAI 3.5 model
gpt-4o-mini	128K	OpenAI 4o-mini model
gpt-4o	128K	OpenAI 4o model
gpt-4-turbo	128K	OpenAI 4-turbo model
gpt-4	8K	OpenAI 4.0 model
o1-mini	128K	Optimized for code, math, and reasoning scenarios
o1-preview	128K	Optimized for complex reasoning tasks
claude-3-7-sonnet	200K	Claude 3.7 model
claude-3-5-sonnet	200K	Claude 3.5 model
claude-3-haiku	200K	Claude 3 Haiku
claude-3-sonnet	200K	Claude 3 Sonnet
claude-3-opus	200K	Claude 3 Opus
gemini-2.5-pro	1000K	Gemini 2.5 Pro
gemini-2.0-flash	1000K	Gemini 2.0 Flash
gemini-1.5-flash	1000K	Gemini 1.5 Flash
gemini-1.5-pro	1000K	Gemini 1.5 Pro
deepseek-chat	64K	DeepSeek-V3 conversation model
deepseek-reasoner	64K	DeepSeek-R1 model, returns thinking process
qwen3	128K	Qwen 3
qwen-turbo	8K	Qwen Turbo
qwen-plus	32K	Qwen Plus
qwen-max	8K	Qwen Max

To use a model, pass its code in the model parameter. We recommend not specifying the model parameter to use the default model configured in your application. For pricing, see Billing Rules.

Example Code

Text Dialogue

1. CURL Request

Non-Streaming
Streaming

curl --request POST \
  --url https://api.linkai.cloud/v1/chat/completions \
  --header 'Authorization: Bearer YOUR_API_KEY' \
  --header 'Content-Type: application/json' \
  --data '{
  "app_code": "",
  "messages": [
    {
      "role": "user",
      "content": "Who are you?"
    }
  ]
}'

curl --request POST \
  --url https://api.linkai.cloud/v1/chat/completions \
  --header 'Authorization: Bearer YOUR_API_KEY' \
  --header 'Content-Type: application/json' \
  --data '{
  "app_code": "",
  "messages": [
    {
      "role": "user",
      "content": "Who are you?"
    }
  ],
  "stream": true
}'

Note: Replace YOUR_API_KEY with your own API Key and fill in your application code in app_code.

2. Python Request

Non-Streaming
Streaming

import requests

url = "https://api.linkai.cloud/v1/chat/completions"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer YOUR_API_KEY"
}
body = {
    "app_code": "",
    "messages": [
        {
            "role": "user",
            "content": "Hello"
        }
    ]
}
res = requests.post(url, json=body, headers=headers)
if res.status_code == 200:
    reply_text = res.json().get("choices")[0]['message']['content']
    print(reply_text)
else:
    error = res.json().get("error")
    print(f"Request error, status code={res.status_code}, error type={error.get('type')}, error message={error.get('message')}")

import json
import requests

url = "https://api.linkai.cloud/v1/chat/completions"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer YOUR_API_KEY"
}
body = {
    "app_code": "",
    "messages": [
        {
            "role": "user",
            "content": "Write a design document for a login module"
        }
    ],
    "stream": True,
}
res = requests.post(url, json=body, headers=headers, stream=True)
for i in res.iter_lines():
    st = str(i, encoding="utf-8")
    st = st.replace("data: ", "", 1)
    if st:
        if st == "[DONE]":                  # Output ended
            break
        chunk = json.loads(st)
        if not chunk.get("choices"):
            continue
        chunk_message = chunk["choices"][0]["delta"].get("content")
        if chunk_message:
            print(chunk_message, end="")    # Output each data segment

Note:

Replace YOUR_API_KEY with your own API Key and fill in your application code in app_code.
If you're using the OpenAI SDK, you can quickly integrate by modifying the api_base configuration. See OpenAI Compatibility for details.

Image Recognition

Users can upload images and ask questions about them. Prerequisites:

For application integration: The "Image Recognition" plugin must be enabled in the application
For workflow integration: The workflow must use the "Image Recognition" plugin

curl https://api.linkai.cloud/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "app_code": "default",
    "messages": [
      {
        "role": "user",
        "content": [
          {
            "type": "text",
            "text": "What is shown in this image?"
          },
          {
            "type": "image_url",
            "image_url": {
              "url": "https://cdn.linkai.cloud/docs/vision-model-config.jpg"
            }
          }
        ]
      }
    ]
  }'

Note:

Replace YOUR_API_KEY with your own API Key and replace the app_code value with your application or workflow code.
The image URL must be a publicly accessible image address.
Image editing calls work similarly to image recognition but require the GPT-Image-1 or AI Image Editing plugin. When you provide an image URL, the response will include the URL of the generated image.

OpenAI Compatibility

This interface is fully compatible with OpenAI's input and output formats, so you can use the OpenAI SDK directly by simply setting the api_base and api_key:

1.x Version
0.x Version

client = OpenAI(
    base_url = "https://api.linkai.cloud/v1",
    api_key = "YOUR API KEY"
)

openai.api_base = "https://api.linkai.cloud/v1"
openai.api_key = "YOUR API KEY"

If you need to specify an application while using the OpenAI SDK, you can append the app_code parameter to the api_key using a "-" separator, for example: Link_tOCJYmHxxm55eA1xs-Kv2fXJcH2.

Chat API

Introduction​

API Definition​

Endpoint​

Request Headers​

Request Body​

Response​

Non-Streaming Response​

Streaming Response​

Error Responses​

Model List​

Example Code​

Text Dialogue​

1. CURL Request​

2. Python Request​

Image Recognition​

OpenAI Compatibility​

Introduction

API Definition

Endpoint

Request Headers

Request Body

Response

Non-Streaming Response

Streaming Response

Error Responses

Model List

Example Code

Text Dialogue

1. CURL Request

2. Python Request

Image Recognition

OpenAI Compatibility