OpenAI GPT API and Anthropic Claude API response time tracker
The charts below track the response times of the main large language model APIs : OpenAI (gpt-4o, gpt-4-turbo, gpt-4, gpt-3.5-turbo) and Anthropic Claude (claude-3-5-sonnet).
The response times are measured by generating a maximum of 512 tokens with a randomized prompt every 10 minutes in 3 locations. The maximum response time is capped at 60 seconds but could be higher in reality.
OpenAI GPT APIs
Anthropic Claude APIs
How to get a faster response time?
- Choose a model with a faster response time
- Try again outside of peak hours
- Break down your executions into smaller ones
We are not affiliated with OpenAI or Anthropic.
Please refer to their official status pages for official information:
https://status.openai.com/
https://status.anthropic.com/