Miscellaneous
Rate Limits
Information about API rate limits
Rate Limits
The NanoGPT API currently imposes a limit of 25 requests per second. If you need a higher limit than this reach out to us at support@nano-gpt.com and let us know what model(s) and roughly what capacity you would need.
If your usecase requires more than 1 billion tokens daily of a specific model, reach out to us at support@nano-gpt.com. It should be no problem, but we might be able to offer a discount.
If rate limits are implemented in the future, this documentation will be updated accordingly.