How Billing Works

Per-request billing

Every API call is billed individually. After each request completes, we:

Measure the original and compressed token counts
Calculate the cost difference (savings)
Charge the actual LLM cost + 20% of savings (router mode) or just the 20% fee (BYOK)
Deduct from your prepaid balance
Log the full breakdown in your dashboard

Billing formula

original_cost = original_input_tokens × input_price + estimated_original_output × output_price
actual_cost   = compressed_input_tokens × input_price + actual_output_tokens × output_price
savings       = max(0, original_cost - actual_cost)
fee           = savings × 0.20

# Router mode
total_charged = actual_cost + fee

# BYOK mode
total_charged = fee

What you see in the dashboard

Each request in your usage history shows:

Field	Description
Model	The model used
Input tokens	Original → compressed
Output tokens	Actual tokens generated
Compression rate	% of input tokens saved
Savings	Dollar amount saved
Charged	Amount deducted from balance

Balance protection

If your balance reaches $0, API requests return 402 Insufficient balance
We never charge more than your available balance
No surprise charges — you control how much you deposit

Monitoring usage

Track your spending in real time:

Dashboard: opencompress.ai/dashboard shows balance, monthly stats, and per-request logs
API: Usage data is included in the usage field of every response

Getting Started

Integration

Features

Billing

Per-request billing

Billing formula

What you see in the dashboard

Balance protection

Monitoring usage

Getting Started

Integration

Features

Billing

Documentation Index

​Per-request billing

​Billing formula

​What you see in the dashboard

​Balance protection

​Monitoring usage

Per-request billing

Billing formula

What you see in the dashboard

Balance protection

Monitoring usage