Chat Completion

POST

chat

completions

curl --request POST \
  --url https://nano-gpt.com/api/v1/chat/completions \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "model": "chatgpt-4o-latest",
  "messages": [
    {
      "role": "user",
      "content": "Testing, please reply!"
    }
  ],
  "stream": false,
  "temperature": 0.7,
  "max_tokens": 4000,
  "top_p": 1,
  "frequency_penalty": 0,
  "presence_penalty": 0,
  "cache_control": {
    "enabled": false,
    "ttl": "5m"
  }
}'

{
  "id": "<string>",
  "object": "<string>",
  "created": 123,
  "choices": [
    {
      "index": 123,
      "message": {
        "role": "assistant",
        "content": "<string>"
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 123,
    "completion_tokens": 123,
    "total_tokens": 123
  }
}

Overview

The Chat Completion endpoint provides OpenAI-compatible chat completions with support for web search capabilities through our LinkUp integration.

Web Search

All models can access real-time web information by appending special suffixes to the model name:

:online - Standard web search ($0.005 per request)
- Returns 10 search results
- Perfect for straightforward questions
:online/linkup-deep - Deep web search ($0.05 per request)
- Iteratively searches for comprehensive information
- Ideal when initial results aren’t sufficient

Examples

import requests
import json

BASE_URL = "https://nano-gpt.com/api/v1"
API_KEY = "YOUR_API_KEY"

headers = {
    "Authorization": f"Bearer {API_KEY}",
    "Content-Type": "application/json"
}

# Standard web search
data = {
    "model": "chatgpt-4o-latest:online",
    "messages": [
        {"role": "user", "content": "What are the latest developments in AI?"}
    ]
}

response = requests.post(
    f"{BASE_URL}/chat/completions",
    headers=headers,
    json=data
)

# Deep web search
data_deep = {
    "model": "chatgpt-4o-latest:online/linkup-deep",
    "messages": [
        {"role": "user", "content": "Provide a comprehensive analysis of recent AI breakthroughs"}
    ]
}

Performance Benchmarks

LinkUp achieves state-of-the-art performance on OpenAI’s SimpleQA benchmark:

Provider	Score
LinkUp Deep Search	90.10%
Exa	90.04%
Perplexity Sonar Pro	86%
LinkUp Standard Search	85%
Perplexity Sonar	77%
Tavily	73%

Important Notes

Web search increases input token count, which affects total cost
Models gain access to real-time information published less than a minute ago
Internet connectivity can provide up to 10x improvement in factuality
All models support web search - simply append the suffix to any model name

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

application/json

Parameters for chat completion

The body is of type object.

Response

200

application/json

Chat completion response

The response is of type object.

curl --request POST \
  --url https://nano-gpt.com/api/v1/chat/completions \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "model": "chatgpt-4o-latest",
  "messages": [
    {
      "role": "user",
      "content": "Testing, please reply!"
    }
  ],
  "stream": false,
  "temperature": 0.7,
  "max_tokens": 4000,
  "top_p": 1,
  "frequency_penalty": 0,
  "presence_penalty": 0,
  "cache_control": {
    "enabled": false,
    "ttl": "5m"
  }
}'

{
  "id": "<string>",
  "object": "<string>",
  "created": 123,
  "choices": [
    {
      "index": 123,
      "message": {
        "role": "assistant",
        "content": "<string>"
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 123,
    "completion_tokens": 123,
    "total_tokens": 123
  }
}

POST

chat

completions

curl --request POST \
  --url https://nano-gpt.com/api/v1/chat/completions \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "model": "chatgpt-4o-latest",
  "messages": [
    {
      "role": "user",
      "content": "Testing, please reply!"
    }
  ],
  "stream": false,
  "temperature": 0.7,
  "max_tokens": 4000,
  "top_p": 1,
  "frequency_penalty": 0,
  "presence_penalty": 0,
  "cache_control": {
    "enabled": false,
    "ttl": "5m"
  }
}'

{
  "id": "<string>",
  "object": "<string>",
  "created": 123,
  "choices": [
    {
      "index": 123,
      "message": {
        "role": "assistant",
        "content": "<string>"
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 123,
    "completion_tokens": 123,
    "total_tokens": 123
  }
}

Overview

The Chat Completion endpoint provides OpenAI-compatible chat completions with support for web search capabilities through our LinkUp integration.

Web Search

All models can access real-time web information by appending special suffixes to the model name:

:online - Standard web search ($0.005 per request)
- Returns 10 search results
- Perfect for straightforward questions
:online/linkup-deep - Deep web search ($0.05 per request)
- Iteratively searches for comprehensive information
- Ideal when initial results aren’t sufficient

Examples

import requests
import json

BASE_URL = "https://nano-gpt.com/api/v1"
API_KEY = "YOUR_API_KEY"

headers = {
    "Authorization": f"Bearer {API_KEY}",
    "Content-Type": "application/json"
}

# Standard web search
data = {
    "model": "chatgpt-4o-latest:online",
    "messages": [
        {"role": "user", "content": "What are the latest developments in AI?"}
    ]
}

response = requests.post(
    f"{BASE_URL}/chat/completions",
    headers=headers,
    json=data
)

# Deep web search
data_deep = {
    "model": "chatgpt-4o-latest:online/linkup-deep",
    "messages": [
        {"role": "user", "content": "Provide a comprehensive analysis of recent AI breakthroughs"}
    ]
}

Performance Benchmarks

LinkUp achieves state-of-the-art performance on OpenAI’s SimpleQA benchmark:

Provider	Score
LinkUp Deep Search	90.10%
Exa	90.04%
Perplexity Sonar Pro	86%
LinkUp Standard Search	85%
Perplexity Sonar	77%
Tavily	73%

Important Notes

Web search increases input token count, which affects total cost
Models gain access to real-time information published less than a minute ago
Internet connectivity can provide up to 10x improvement in factuality
All models support web search - simply append the suffix to any model name

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

application/json

Parameters for chat completion

The body is of type object.

Response

200

application/json

Chat completion response

The response is of type object.

curl --request POST \
  --url https://nano-gpt.com/api/v1/chat/completions \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "model": "chatgpt-4o-latest",
  "messages": [
    {
      "role": "user",
      "content": "Testing, please reply!"
    }
  ],
  "stream": false,
  "temperature": 0.7,
  "max_tokens": 4000,
  "top_p": 1,
  "frequency_penalty": 0,
  "presence_penalty": 0,
  "cache_control": {
    "enabled": false,
    "ttl": "5m"
  }
}'

{
  "id": "<string>",
  "object": "<string>",
  "created": 123,
  "choices": [
    {
      "index": 123,
      "message": {
        "role": "assistant",
        "content": "<string>"
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 123,
    "completion_tokens": 123,
    "total_tokens": 123
  }
}

Overview

Web Search

Examples

Performance Benchmarks

Important Notes

Authorizations

Body

Response

Get Started

Endpoint Examples

API Reference

Miscellaneous

Integrations

Chat Completion

Overview

Web Search

Examples

Performance Benchmarks

Important Notes

Authorizations

Body

Response

​Overview

​Web Search

​Examples

​Performance Benchmarks

​Important Notes

Authorizations

Body

Response

Get Started

Endpoint Examples

API Reference

Miscellaneous

Integrations

​Overview

​Web Search

​Examples

​Performance Benchmarks

​Important Notes

Authorizations

Body

Response

Overview

Web Search

Examples

Performance Benchmarks

Important Notes

Overview

Web Search

Examples

Performance Benchmarks

Important Notes