Question 1

What is the cheapest LLM for chatbots?

Accepted Answer

Llama 3.1 8B Instruct (Meta) is the cheapest generally-available model we track for chatbots, at $0.02 per 1M input tokens and $0.03 per 1M output tokens — about $0.29/month for a busy chatbot handling ~10M input and ~3M output tokens a month. Ministral 3 3B is the next cheapest at $0.52/month.

Question 2

How is "cheapest for chatbots" calculated?

Accepted Answer

We price a representative monthly workload — a busy chatbot handling ~10M input and ~3M output tokens a month — against every generally-available model, then rank by total cost. All prices are USD per 1M tokens, sourced from official provider documentation.

Question 3

Is the cheapest model always the right choice for chatbots?

Accepted Answer

No. Price is one axis; quality, latency, rate limits and reliability matter too. Use this ranking to shortlist, then test the top candidates on your own chatbots workload before committing. Cost is easy to measure — fit is not.

#	Model	Context	Input $/M	Output $/M	Monthly cost
1	Llama 3.1 8B Instruct Meta	128K	$0.02	$0.03	$0.29 ◎
2	Ministral 3 3B Mistral	—	$0.04	$0.04	$0.52
3	Amazon Nova Micro Amazon	128K	$0.035	$0.14	$0.77
4	Command R7B Cohere	128K	$0.037	$0.15	$0.82
5	Amazon Nova Lite Amazon	300K	$0.06	$0.24	$1.32
6	Qwen-Flash Alibaba	1M	$0.05	$0.4	$1.70
7	Llama 4 Scout (17B-16E Instruct) Meta	10M	$0.1	$0.3	$1.90
8	Ministral 3 8B Mistral	256K	$0.15	$0.15	$1.95
9	Llama 3.3 70B Instruct Meta	128K	$0.1	$0.32	$1.96
10	Qwen3.5-Flash Alibaba	1M	$0.1	$0.4	$2.20
11	Ministral 3 14B Mistral	—	$0.2	$0.2	$2.60
12	Llama 4 Maverick (17B-128E Instruct) Meta	1M	$0.15	$0.6	$3.30

Cheapest LLM for a chatbot

Cheapest models for chatbots

Cheapest LLM for chatbots

Cheapest LLM for a chatbot

Cheapest models for chatbots

Cheapest LLM for chatbots

Get alerted when a cheaper model for chatbots ships