What is the best LLM API for cheap api?

Based on our verified pricing data, the cheapest model that qualifies is Granite 4.0 Micro by IBM at $0.017/Mtok input. See the full ranking above for more options.

How often are prices updated?

Prices are verified against official provider pricing pages 3 times daily (8am, 2pm, 8pm UTC) by our automated scraper pipeline.

Pricing / Best For / Cheapest LLM APIs by Token Cost

Cheapest LLM APIs by Token Cost

The cheapest LLM APIs ranked by token cost. Find budget-friendly models under $1/Mtok for high-volume production workloads.

50 models qualify Showing top 15 Sorted by blended cost

Granite 4.0 Micro

IBM

$0.017 in $0.112 out

$0.065/Mtok blended

128K ctx

LFM2 24B A2B

Together

$0.030 in $0.120 out

$0.075/Mtok blended

128K ctx

GLM-OCR

Zhipu

$0.030 in $0.030 out

$0.030/Mtok blended

128K ctx

Cost calculator for this use case

Tokens per day

Input/output ratio: 70/30

Days per month

🥇 Granite 4.0 Micro $—

🥈 LFM2 24B A2B $—

🥉 GLM-OCR $—

Full ranking — top 15 models

#	Model	Provider	Input $/Mtok	Output $/Mtok	Blended	Context
1	Granite 4.0 Micro	IBM	$0.017	$0.112	$0.065	128K	→
2	LFM2 24B A2B	Together	$0.030	$0.120	$0.075	128K	→
3	GLM-OCR	Zhipu	$0.030	$0.030	$0.030	128K	→
4	Nova Micro	Amazon	$0.035	$0.140	$0.088	128K	→
5	Qwen-Turbo	Alibaba	$0.050	$0.200	$0.125	1M	→
6	Nova Lite	Amazon	$0.060	$0.240	$0.150	300K	→
7	Granite 4 H Small	IBM	$0.060	$0.250	$0.155	128K	→
8	Baichuan M2-32B	Baichuan	$0.070	$0.070	$0.070	33K	→
9	GPT OSS 20B	Fireworks	$0.070	$0.300	$0.185	128K	→
10	GLM-4.7-FlashX	Zhipu	$0.070	$0.400	$0.235	128K	→
11	Gemini 2.5 Flash	Google	$0.075	$0.300	$0.188	1M	→
12	Mistral Small 3.2 24B	Mistral	$0.080	$0.200	$0.140	128K	→
13	Gemini 2.5 Flash-Lite	Google	$0.100	$0.400	$0.250	1M	→
14	Ministral 3 3B	Mistral	$0.100	$0.100	$0.100	128K	→
15	Voxtral Small 24B	Mistral	$0.100	$0.300	$0.200	128K	→

How models are selected

Budget-category models, sorted by input price per million tokens.

Prices are per million tokens (Mtok), sourced directly from official provider pricing pages and verified by our automated scraper pipeline that runs 3x daily. "Blended cost" is the average of input and output pricing — a quick proxy for typical 50/50 usage patterns.

Cheapest LLM APIs by Token Cost

Granite 4.0 Micro

LFM2 24B A2B

GLM-OCR

Cost calculator for this use case

Full ranking — top 15 models

How models are selected

Other use case rankings