:online
- Standard web search ($0.006 per request)
:online/linkup-deep
- Deep web search ($0.06 per request)
:memory
to any model namememory: true
:online:memory
:memory-<days>
(1..365) or header memory_expiration_days: <days>
; header takes precedencemodel_context_limit
.
model_context_limit
(number or numeric string)<think>
tags) from responses using the optional reasoning
parameter:
reasoning.exclude
is set to true
, the API removes content between <think>
and </think>
tags before returning the response. This works for both streaming and non-streaming requests.
"<think>The user is asking for a simple addition. 2+2 equals 4.</think>\n\nThe answer is 4."
With reasoning.exclude: "The answer is 4."
:reasoning-exclude
:reasoning-exclude
to the model
string.
{ "reasoning": { "exclude": true } }
:reasoning-exclude
is stripped before provider routing; all other suffixes remain active:reasoning-exclude
with existing suffixes; only :reasoning-exclude
is removed prior to provider routing:
:thinking
(and variants like …-thinking:8192
):online
and :online/linkup-deep
:memory
and :memory-<days>
claude-3-7-sonnet-thinking:8192:reasoning-exclude
gpt-4o:online:reasoning-exclude
claude-3-5-sonnet-20241022:memory-30:online/linkup-deep:reasoning-exclude
phala/*
) require byte-for-byte SSE passthrough for signature verification. For those models, streaming cannot be filtered; the suffix has no effect on the streaming bytes.Provider | Score |
---|---|
LinkUp Deep Search | 90.10% |
Exa | 90.04% |
Perplexity Sonar Pro | 86% |
LinkUp Standard Search | 85% |
Perplexity Sonar | 77% |
Tavily | 73% |
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Parameters for chat completion
The body is of type object
.
Chat completion response
The response is of type object
.