Pricing - OpenCompress

How pricing works

OpenCompress uses a pay-for-savings model. We charge 20% of the money we save you. If compression doesn’t save you anything on a particular request, there’s no additional fee.

Your cost = LLM cost (on compressed tokens) + 20% × savings

Example

Say you send a 10,000-token prompt that we compress to 6,000 tokens using GPT-4o ($2.50/1M input tokens):

	Tokens	Cost
Without OpenCompress	10,000 input	$0.0250
With OpenCompress	6,000 input	$0.0150
Savings	4,000 tokens	$0.0100
OpenCompress fee (20%)		$0.0020
Your total cost		$0.0170
Net savings		$0.0080 (32%)

You save 32% on this call. We earn $0.002. Everyone wins.

Router vs BYOK pricing

Router Mode

You pay us: LLM cost + 20% feeWe handle the LLM provider. Simple, no setup.

BYOK Mode

You pay us: 20% fee onlyYou pay your LLM provider directly. We only charge the compression fee.

Billing details

Minimum deposit: $10
Deposit options: $10,$ 50, $100,$ 500, $1,000, or custom amount
Payment: Stripe (credit card)
Balance: Prepaid, deducted per request
Refunds: Available for unused balance

Output savings

When we compress your input, the model’s output also tends to be shorter. We estimate output savings conservatively:

output_savings = input_compression_rate × 0.5

This means a 40% input compression results in an estimated 20% shorter output. Both input and output savings are factored into your total savings calculation.

How Billing WorksPer-request billing with full transparency.

​How pricing works

​Example

​Router vs BYOK pricing