OpenAI GPT API, Anthropic Claude API and Gemini API response time tracker

The charts below track the response times of the main large language model APIs: OpenAI GPT, Anthropic Claude and Gemini.

The response times are measured by generating a maximum of 512 tokens with a randomized prompt every 10 minutes in 3 locations. The maximum response time is capped at 60 seconds but could be higher in reality.