Per-request billing
Every API call is billed individually. After each request completes, we:- Measure the original and compressed token counts
- Calculate the cost difference (savings)
- Charge the actual LLM cost + 20% of savings (router mode) or just the 20% fee (BYOK)
- Deduct from your prepaid balance
- Log the full breakdown in your dashboard
Billing formula
What you see in the dashboard
Each request in your usage history shows:| Field | Description |
|---|---|
| Model | The model used |
| Input tokens | Original → compressed |
| Output tokens | Actual tokens generated |
| Compression rate | % of input tokens saved |
| Savings | Dollar amount saved |
| Charged | Amount deducted from balance |
Balance protection
- If your balance reaches $0, API requests return
402 Insufficient balance - We never charge more than your available balance
- No surprise charges — you control how much you deposit
Monitoring usage
Track your spending in real time:- Dashboard: opencompress.ai/dashboard shows balance, monthly stats, and per-request logs
- API: Usage data is included in the
usagefield of every response