LIVE Cheapest: GLM-4.7-Flash $0.000/Mtok in 153 models tracked Updated Jun 25, 2026
Jun 25, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / Nemotron 70B Instruct vs Mixtral 8x7B Instruct

Nemotron 70B Instruct vs Mixtral 8x7B Instruct

Side-by-side API pricing comparison · NVIDIA vs Mistral

🏆 Nemotron 70B Instruct is 440% cheaper on blended cost ($0.100 vs $0.540/Mtok)

Nemotron 70B Instruct

by NVIDIA

Current open weights Open weights
Input
$0.100/Mtok
Output
$0.100/Mtok
✓ Cheaper
Blended avg$0.100/Mtok
Context128K tokens
Modalitytext
Parameters70B
ReleasedJun 1, 2025
Full details →

Mixtral 8x7B Instruct

by Mistral

Current open weights Open weights
Input
$0.540/Mtok
Output
$0.540/Mtok
Blended avg$0.540/Mtok
Context32K tokens
Modalitytext
Parameters46.7B (8x7B MoE)
ReleasedJul 1, 2024
Full details →

Cost at scale — 1M tokens (50/50 input/output)

VolumeNemotron 70B InstructMixtral 8x7B InstructSavings
1M tokens $0.1 $0.54 $0.44 (81.5%)
10M tokens $1 $5.4 $4.4 (81.5%)
100M tokens $10 $54 $44 (81.5%)
1000M tokens $100 $540 $440 (81.5%)

Summary

Nemotron 70B Instruct by NVIDIA costs $0.100/Mtok input and $0.100/Mtok output, with a 128K-token context window. It supports text input.

Mixtral 8x7B Instruct by Mistral costs $0.540/Mtok input and $0.540/Mtok output, with a 32K-token context window. It supports text input.

On a blended cost basis, Nemotron 70B Instruct is 440% cheaper than Mixtral 8x7B Instruct. It also has a larger context window.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.