How quota consumption is calculated
When a request completes, OpenOpen8 calculates the quota consumed using this formula:- tokens — the number of tokens processed in the request (input + output, or as determined by the provider)
- model_ratio — a multiplier specific to the model used
- group_ratio — a multiplier applied to your account’s group
Model ratios
Each model has two ratios:| Ratio | What it controls |
|---|---|
| Input ratio | Multiplier applied to input (prompt) tokens |
| Output ratio | Multiplier applied to output (completion) tokens |
gpt-4o has an input ratio of 5 and an output ratio of 15, a request that uses 1,000 input tokens and 500 output tokens would consume (1000 × 5) + (500 × 15) = 12,500 quota units.
You can view the effective pricing for each model in the OpenOpen8 dashboard. Ratios are set to closely match actual upstream provider costs.
Group ratios
Group ratios apply a blanket multiplier to all requests from users in a given group. This enables:- Discounted tiers — a group ratio below
1.0means credits are consumed more slowly - Standard tiers — the default group ratio is
1 - Premium tiers — a group ratio above
1.0for higher-cost access
5 combined with a group ratio of 0.5 results in an effective multiplier of 2.5.
Checking your effective rates
To understand how much a specific model costs in credits:- Log in to the OpenOpen8 dashboard at openopen8.ai.
- Check the model pricing information available in the dashboard.
- Review your usage in Logs to see actual credit consumption per request.