The $360K question about Large Language Models Economics

June 22, 2023

TrueFoundry

Share this post

https://www.truefoundry.com/blog/economics-of-large-language-models

URL

The $360K question about Large Language Models Economics

Table of Contents

Lorem Ipsum Dolor

**Sample Task: Summarising Wikipedia Articles**

Subscribe to our Newsletter

Delivered twice a month

Join AI/ML leaders for the latest on product, community, and GenAI developments

Management free AI infrastructure

Book a demo now

Subscribe to our newsletter

By clicking Subscribe you're confirming that you agree with our Terms & Conditions.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

By clicking Subscribe you're confirming that you agree with our Terms & Conditions.

Discover More

AutoDeploy: LLM Agent for GenAI Deployments

Engineering and Product

LLMs & GenAI

March 18, 2025

Autopilot: Automating Infrastructure Management for GenAI

Engineering and Product

LLMs & GenAI

November 12, 2024

Benchmarking the TrueFoundry LLM Gateway: it's blazing fast ⚡

LLMs & GenAI

July 16, 2024

TrueFoundry Recognized by HFS as a Hot Tech Vendor for GenAI in Enterprises

Thought Leadership

Related Blogs

No items found.

INPUT COST (/MN TOKENS)	OUTPUT COST (/MN TOKENS)
$30	$60

INPUT COST (/MN TOKENS)	OUTPUT COST (/MN TOKENS)
$60	$120

INPUT COST (/MN TOKENS)	OUTPUT COST (/MN TOKENS)
$11	$32

INPUT COST (/MN TOKENS)	OUTPUT COST (/MN TOKENS)
$20	$20

INPUT COST (/MN TOKENS)	OUTPUT COST (/MN TOKENS)
$2	$2

PRETRAINED / FINE TUNED	MODEL NAME	PARAMS*	FINE TUNING COST ($)	INPUT COST ($)	OUTPUT COST ($)	TOTAL COST ($)
Pretrained	GPT-4 32K	1 Tn +	NA	360k	360k	720k
	GPT-4 8K	1 Tn +	NA	180k	180k	360k
	DaVinci	175 Bn	NA	120k	60k	180k
	Claude v1	52 Bn	NA	66k	96k	162k
	Curie	13 Bn	NA	12k	6k	18k
	Self-hosted 7B	7 Bn	NA	350	1750	2.1k
Fine Tuned	DaVinci	175 Bn	180k	720k	360k	1.26M
	Curie	13 Bn	18k	72k	36k	126k
	Self-hosted 7B	7 Bn	1400	350	1750	3.5k

TASK TYPE	BEST 6B/7B OOTB MODEL FEW-SHOT	MOVELM 7B ZERO-SHOT	GPT-3.5 TURBO ZERO-SHOT	GPT-3.5 TURBO FEW-SHOT	GPT-4 ZERO-SHOT	GPT-4 FEW-SHOT
Relevance - internal dataset	0.33	0.93	0.84	0.84	0.92	0.95
Extraction - structured output for queries	0.38	0.98	0.22	0.72	0.38	0.73
Reasoning - custom triggering	0.62	0.93	0.87	0.88	0.9	0.88
Classification - domain of user query	0.21	0.79	0.6	0.73	0.7	0.76
Extraction - structured output from entity typing	0.83	0.87	0.9	0.89	0.89	0.89

The $360K question about Large Language Models Economics

Summarizing Wikipedia

The sample for pricing analysis

Details Size of the Task

Size of the Wikipedia Corpus

The expected size of the summarized output

Understanding the costs

Levers of pricing in OpenAI/3rd Party APIs

Input Cost

Output Cost

Basis of the cost incurred with self-hosted models

Cost of Machine

Spot instances

Comparing cost of the different models

GPT 4 - 8K context length

Unit Costs

Cost Formula

= $360,000

GPT 4 - 32K context length

Unit Costs

Cost Formula

= $720,000

Anthropic Claude V1

Unit Costs

Cost Formula

= $162,000

InstructGPT - DaVinci

Unit Costs

Cost Formula

= $180,000

Curie

Unit Costs

Cost Formula

= $18,000

Self-Hosted 7B Model

Unit Costs

Cost Formula

= $360,000

Fine Tuning Models

Fine Tuned DaVinci

= $1,260,000

Fine Tuned Curie

= $126,000

Self Hosted, Fine Tuned, 7B Model

= $126,000

Putting it all together

Effect of fine-tuning on performance

What We Are Doing

TrueFoundry believes the future of LLMs is the co-existence of open-source and commercial LLMs within the same application!

If you are using OpenAI

If you want to use Open Source LLMs

Subscribe to our Newsletter

Subscribe to our newsletter

Discover More

AutoDeploy: LLM Agent for GenAI Deployments

Autopilot: Automating Infrastructure Management for GenAI

Benchmarking the TrueFoundry LLM Gateway: it's blazing fast ⚡

TrueFoundry Recognized by HFS as a Hot Tech Vendor for GenAI in Enterprises

Related Blogs

Blazingly fast way to build, track and deploy your models!

Company

Product

Resources

Goodreads

Subscribe to our newsletter

`The` sample for pricing analysis