OpenAI GPT API and Anthropic Claude API response time tracker

The charts below track the response times of the main large language model APIs : OpenAI (gpt-4o, gpt-4-turbo, gpt-4, gpt-3.5-turbo) and Anthropic Claude (claude-3-5-sonnet).

The response times are measured by generating a maximum of 512 tokens with a randomized prompt every 10 minutes in 3 locations. The maximum response time is capped at 60 seconds but could be higher in reality.

OpenAI GPT APIs

GPT for Work

Anthropic Claude APIs

GPT for Work

How to get a faster response time?

  • Choose a model with a faster response time
  • Try again outside of peak hours
  • Break down your executions into smaller ones


We are not affiliated with OpenAI or Anthropic.
Please refer to their official status pages for official information:
https://status.openai.com/
https://status.anthropic.com/