LLM Leaderboard: Compare & Check Latest API Prices for LLMs

Compare and check the latest prices for LLM (Large Language Model) APIs from leading providers such as OpenAI, Mistral, Anthropic, Google, Meta, Perplexity, and more. Evaluate and rank the performance of over 50+ AI models (LLMs) across key metrics, including quality, context window, price, knowledge cutoff, and others. This in-depth comparison allows users to easily identify the best-suited LLM for their specific needs and budget.

LLM Leaderboard Highlights:

Quality: The highest quality models are GPT-4o and Llama 3.1 405B. These are followed by Claude 3.5 Sonnet and Llama 3.1 70B.

Context Window: The models with the largest context windows are Gemini 1.5 Pro (2 Million) and Gemini 1.5 Flash (1 Million). These are followed by Codestral-Mamba and Jamba Instruct.

Price ($ per M tokens): OpenChat 3.5 ($0.14) and Phi-3 Medium 14B ($0.14) are the cheapest models, followed by Gemma 7B and Llama 3.1 8B.

MODELCREATORLICENSEQUALITYCONTEXTINPUT $/1MOUTPUT $/1MKNOWLEDGEFREE TRIAL
GPT-4oOpenAIProprietary100128k$5.00$15.00Oct 2023https://openrouter.ai/chat?models=openai/gpt-4o
GPT-4 TurboOpenAIProprietary94128k$10.00$30.00Dec 2023https://openrouter.ai/chat?models=openai%2Fgpt-4-turbo
GPT-4o MiniOpenAIProprietary88128k$0.15$0.60Oct 2023https://openrouter.ai/chat?models=openai/gpt-4o-mini
GPT-4OpenAIProprietary848k$30.00$60.00Sep 2021https://openrouter.ai/chat?models=openai/gpt-4
GPT-3.5 Turbo InstructOpenAIProprietary604k$1.50$2.00Sep 2021https://openrouter.ai/chat?models=openai/gpt-3.5-turbo-instruct
GPT-3.5 TurboOpenAIProprietary5916k$0.50$1.50Sep 2021https://openrouter.ai/chat?models=openai/gpt-3.5-turbo-0125
Gemini 1.5 ProGoogleProprietary952m$3.50$10.50Nov 2023https://openrouter.ai/chat?models=google%2Fgemini-pro-1.5
Gemini 1.5 FlashGoogleProprietary841m$0.35$1.05Nov 2023https://openrouter.ai/chat?models=google%2Fgemini-flash-1.5
Gemma 2 (27B)GoogleOpen788k$0.80$0.80Jun 2024https://openrouter.ai/chat?models=google/gemma-2-27b-it
Gemma 2 (9B)GoogleOpen718k$0.20$0.20Jun 2024https://openrouter.ai/chat?models=google/gemma-2-9b-it
Gemma 7BGoogleOpen458k$0.15$0.15https://openrouter.ai/chat?models=google/gemma-7b-it
Gemini 1.0 ProGoogleProprietary6233k$0.50$1.50Nov 2023https://openrouter.ai/chat?models=google%2Fgemini-pro
Llama 3.1 (405B)MetaOpen100128k$5.33$9.50Dec 2023https://openrouter.ai/chat?models=meta-llama/llama-3.1-405b-instruct
Llama 3.1 (70B)MetaOpen95128k$0.89$0.89Dec 2023https://openrouter.ai/chat?models=meta-llama/llama-3.1-70b-instruct
Llama 3 (70B)MetaOpen838k$0.90$0.90Dec 2023https://openrouter.ai/chat?models=meta-llama/llama-3-70b-instruct
Llama 3.1 (8B)MetaOpen66128k$0.16$0.16Dec 2023https://openrouter.ai/chat?models=meta-llama/llama-3.1-8b-instruct
Llama 3 (8B)MetaOpen648k$0.20$0.20Mar 2023https://openrouter.ai/chat?models=meta-llama/llama-3-8b-instruct
Llama 2 Chat (70B)MetaOpen574k$0.95$1.00Sep 2022https://openrouter.ai/chat?models=meta-llama/llama-2-70b-chat
Llama 2 Chat (13B)MetaOpen394k$0.25$0.28Sep 2022https://openrouter.ai/chat?models=meta-llama/llama-2-13b-chat
Llama 2 Chat (7B)MetaOpen294k$0.20$0.20Sep 2022https://huggingface.co/meta-llama/Llama-2-7b-chat-hf
Mistral Large 2mistral-aiOpen91128k$3.00$9.00https://chat.mistral.ai/
Codestralmistral-aiOpen33k$1.00$3.00https://chat.mistral.ai/
Codestral-Mambamistral-aiOpen256k$0.25$0.25https://openrouter.ai/chat?models=mistralai/codestral-mamba
Mistral Largemistral-aiProprietary7633k$4.00$12.00https://openrouter.ai/chat?models=mistralai/mistral-large
Mixtral 8x22Bmistral-aiOpen7165k$1.20$1.20Sep 2021https://openrouter.ai/chat?models=mistralai/mixtral-8x22b-instruct
Mistral Smallmistral-aiProprietary7133k$1.00$3.00https://openrouter.ai/chat?models=mistralai/mistral-small
Mistral Mediummistral-aiProprietary7033k$2.70$8.10https://openrouter.ai/chat?models=mistralai/mistral-medium
Mistral NeMomistral-aiOpen64128k$0.30$0.30Apr 2024https://openrouter.ai/chat?models=mistralai/mistral-nemo
Mixtral 8x7Bmistral-aiOpen6133k$0.50$0.55Dec 2023https://openrouter.ai/chat?models=mistralai/mixtral-8x7b-instruct
Mistral 7Bmistral-aiOpen4033k$0.20$0.20Dec 2023https://openrouter.ai/chat?models=mistralai/mistral-7b-instruct
Claude 3.5 SonnetAnthropicProprietary98200k$3.00$15.00Apr 2024https://openrouter.ai/chat?models=anthropic%2Fclaude-3.5-sonnet
Claude 3 OpusAnthropicProprietary93200k$15.00$75.00Aug 2023https://openrouter.ai/chat?models=anthropic%2Fclaude-3-opus
Claude 3 SonnetAnthropicProprietary80200k$3.00$15.00Aug 2023https://openrouter.ai/chat?models=anthropic%2Fclaude-3-sonnet
Claude 3 HaikuAnthropicProprietary74200k$0.25$1.25Aug 2023https://openrouter.ai/chat?models=anthropic%2Fclaude-3-haiku
Claude 2.0AnthropicProprietary70100k$8.00$24.00https://openrouter.ai/chat?models=anthropic%2Fclaude-2
Claude InstantAnthropicProprietary63100k$0.80$2.40https://openrouter.ai/chat?models=anthropic%2Fclaude-instant-1
Claude 2.1AnthropicProprietary55200k$8.00$24.00https://openrouter.ai/chat?models=anthropic%2Fclaude-2.1
Sonar LargePerplexity-aiProprietary33k$1.00$1.00https://openrouter.ai/chat?models=perplexity/llama-3-sonar-large-32k-chat
Sonar SmallPerplexity-aiProprietary33k$0.20$0.20https://openrouter.ai/chat?models=perplexity/llama-3-sonar-small-32k-chat
OpenChat 3.5OpenchatOpen508k$0.14$0.14https://openrouter.ai/chat?models=openchat/openchat-7b
Command LightCohereProprietary4k$0.30$0.60https://coral.cohere.com/
CommandCohereProprietary4k$1.25$2.00https://openrouter.ai/chat?models=cohere/command
Command-R+CohereOpen75128k$3.00$15.00Mar 2024https://openrouter.ai/chat?models=cohere/command-r-plus
Command-RCohereOpen63128k$0.50$1.50Mar 2024https://openrouter.ai/chat?models=cohere/command-r
Phi-3 Medium (14B)MicrosoftOpen128k$0.14$0.14Oct 2023https://openrouter.ai/chat?models=microsoft/phi-3-medium-128k-instruct
Reka CoreRekaProprietary90128k$3.00$15.00Nov 2023https://chat.reka.ai/
Reka FlashRekaProprietary78128k$0.80$2.00Nov 2023https://chat.reka.ai/
Reka EdgeRekaProprietary6064k$0.40$1.00Nov 2023https://chat.reka.ai/
DBRXDatabricksOpen6233k$1.20$1.20Dec 2023https://openrouter.ai/chat?models=databricks/dbrx-instruct
Jamba InstructAi21Open63256k$0.50$0.70Mar 2024https://openrouter.ai/chat?models=ai21/jamba-instruct
ArcticSnowflakeOpen554k$2.40$2.40https://openrouter.ai/chat?models=snowflake/snowflake-arctic-instruct
Qwen2 (72B)AlibabaOpen83128k$0.90$0.90https://openrouter.ai/chat?models=qwen/qwen-2-7b-instruct
Yi-Large01aiProprietary8132k$3.00$3.00https://openrouter.ai/chat?models=01-ai/yi-large
DeepSeek-Coder-V2DeepseekOpen128k$0.14$0.28https://openrouter.ai/chat?models=deepseek/deepseek-coder
DeepSeek-V2DeepseekOpen82128k$0.14$0.28https://openrouter.ai/chat?models=deepseek/deepseek-chat

Key Definitions

  • Quality: The index shows the average performance across Chatbot Arena, MMLU, and MT-Bench, adjusted for comparison.
  • Context Window: The maximum total number of tokens allowed for both input and output combined. The limit for output tokens is often much lower and varies depending on the model.
  • Input Price: Cost per token sent to the API as part of the request or message, shown in US dollars per million tokens.
  • Output Price: Cost per token produced by the model (output from the API), shown in US dollars per million tokens.
  • Knowledge: The Knowledge cutoff for GPT models refers to the date after which the model no longer has access to new information.

Frequently Asked Questions

What is the LLM Leaderboard?

The LLM Leaderboard is a comprehensive tool designed to compare various Large Language Models (LLMs) based on multiple key metrics such as performance on benchmarks, specific capabilities, price, and other relevant factors.

Which LLM API is the best?

The “best” LLM API depends on your specific needs and budget. Our LLM price comparison tool helps you evaluate factors like quality, price, and features to find the ideal fit.

How do I compare LLM prices?

To compare LLM prices, use our LLM Leaderboard tool on the official ChatGBT website and check the cost per 1,000 tokens for each model. Compare these costs against your budget and consider the balance between price and the quality/performance you need.