Token Usage

Token Usage shows how much AI processing your workspace has consumed, broken down by time period, feature, and individual user. Use this page to monitor your usage, understand costs, and plan your subscription accordingly.


What Is a Token?

Tokens are the unit Saifa AI uses to measure AI processing. Every time AI reads a message, generates a reply, or processes a request, it consumes tokens — both for the input it receives and the output it produces.

Token basics (English):

Content
Approximate Tokens

1 word

~1.3 tokens

1 sentence

~20 tokens

1 paragraph

~100 tokens

1 page of text

~700 tokens

1 image

~1,200 tokens

📌 1 token ≈ 4 characters ≈ 0.75 words in English. Thai text may use more tokens per word due to character encoding differences.


AI Model Tiers

Saifa AI processes requests across three model tiers, each with different capability and token cost levels. Output tokens cost 4–6× more per token than input tokens across every tier.

Tier
Best For
Saifa Tokens per Call (250 in / 150 out)

Lite

Simple greetings, short FAQs, fast replies

29 tokens

Quick

Standard customer support, product questions

100 tokens

Think

Complex analysis, long conversations, detailed answers

350 tokens

Cost per unit — Input side:

Content
Lite
Quick
Think

1 word

0.03

0.13

0.67

1 sentence

0.5

2

10

1 paragraph

2.5

10

50

1 page

17.5

70

350

1 image

30

120

600

💡 Most customer service conversations (greetings, order status, FAQs) use the Lite or Quick tier. Complex multi-turn conversations or AI analysis features use Think.

🔢 How Token Usage Is Calculated

Every time Saifa AI processes a message, it uses tokens — small units of text that represent both the input and the output of a conversation.

What counts as tokens?

Each AI request includes:

  • Input tokens 📥 Everything sent to the AI, including:

    • Customer message

    • Previous conversation (chat history)

    • System instructions

    • Connected knowledge or data

  • Output tokens 📤 The response generated by the AI


💸 Why Token Usage Can Increase

Token usage may vary from message to message. Here are the main reasons:

1. Longer conversations = more context

As conversations continue, more previous messages are included to maintain context. 👉 This increases input tokens, even if the latest message is short.


2. More detailed responses = more output tokens

If the AI provides longer or more detailed answers, it uses more tokens. 👉 Output tokens are more expensive because they require more processing.


3. Language differences

Some languages (like Thai) use more tokens per sentence than English. 👉 The same message may cost more depending on the language used.


4. Complex requests use more AI processing

More advanced tasks (e.g. explanations, recommendations, or analysis) require:

  • More context

  • Longer responses

👉 This results in higher total token usage.


⚙️ How Saifa Calculates Cost

Saifa uses a weighted model:

  • Input tokens = standard cost

  • Output tokens = higher cost (typically 4–6× more)

So total usage is roughly:


Top-Level Metrics

Three summary cards at the top of the page show your workspace's current token consumption at a glance.

Total Tokens Used

The cumulative number of tokens your workspace has consumed since the start of the current billing period, shown alongside the percentage of your plan limit used.

Example: 10,617 tokens used = 10.6% of limit


Avg Daily Tokens

The average number of tokens consumed per day over the last 7 days, with a percentage change compared to the previous 7-day period.

Use this to spot unusual spikes — a sudden increase may indicate a new integration going live, a high-volume campaign, or unexpected AI usage.


Transaction of Token

The number of individual AI API calls made in the last 30 days, with a percentage change compared to the previous 30-day period.

Each transaction represents one complete AI request-and-response cycle — for example, one AI auto-reply sent to a customer, or one Saifa Assistant query.


Token Usage Limit

A progress bar below the summary cards shows your current usage against your plan's total token limit.

When you approach your limit, Saifa AI will notify you. If you exceed your limit, AI features may pause until your next billing cycle or until you upgrade your plan.

📌 To increase your token limit, go to Settings → Subscription Plan.


Path: Token Usage → Usage Trends

The Usage Trends tab shows token consumption and costs over time with two charts.

Monthly Token Usage

An area chart showing total tokens consumed per month over the last 7 months. Hover over any month to see the exact token count.

Use this to identify which months had high AI activity and correlate spikes with campaigns, product launches, or team growth.

Toggle between:

  • Token Usage — volume of tokens consumed

  • Compare — overlay with the previous equivalent period


Monthly Costs

A bar chart showing API cost in USD per month over the last 7 months. Hover over any bar to see the exact cost for that month.

Use this alongside Monthly Token Usage to understand your cost efficiency — if token usage increased but cost stayed flat, your conversations are using lighter model tiers effectively.


Daily Usage (Last 7 Days)

A line chart showing token consumption by day of the week for the past 7 days. Use this to identify your busiest days and ensure you have adequate AI capacity during peak periods.


Mode Breakdown Tab

Path: Token Usage → Mode Breakdown

The Mode Breakdown tab shows how tokens are distributed across the different Saifa AI features.

Usage by Mode

A donut chart and detailed breakdown table showing token consumption split by feature:

Mode
What It Covers

General Chat

Saifa Assistant conversations — writing, planning, analysis

Thunderstorm

Instant content generation feature

Customer Support Reply

AI auto-replies sent to customers in the inbox

Customer Support Suggestion

AI reply suggestions shown to agents (AI Assist ✨)

Understanding your mode distribution helps you identify which features consume the most tokens and whether usage matches your expectations.


User Request Table

Below the donut chart, a table shows token usage broken down by individual workspace member:

Column
Description

User

Team member name

Total Request

Number of AI requests made by this user

Last Request

Date and time of their most recent AI interaction

Use this to understand which team members are actively using AI features and whether usage is distributed appropriately across your team.


Recent Activity Tab

Path: Token Usage → Recent Activity

The Recent Activity tab shows a log of individual API calls — every AI request made in your workspace, listed in reverse chronological order.

Recent API Requests Table

Column
Description

Mode

The feature and sub-mode used (e.g. chat / general_chat)

Tokens

Total tokens consumed by this request (input + output)

Cost

USD cost of this individual API call

Datetime

Exact date and time the request was made

Status

Success (completed) or error status

Use this log to audit specific requests, investigate unexpected token spikes, or verify that all calls completed successfully.

You can control how many entries appear per page using the Show dropdown, and paginate through the full history using the arrows at the bottom right.


Tips for Managing Token Usage

Check Usage Trends monthly — compare your current month to the previous one to catch unexpected growth early, before you hit your limit.

Use Mode Breakdown to optimize costs — if Customer Support Reply is consuming most of your tokens, review whether your Knowledge Base can be made more concise so AI gives shorter, more efficient answers.

Monitor Daily Usage during campaigns — if you're running a promotion or product launch, check daily usage that week to ensure you won't hit your limit mid-campaign.

Review User Request to identify heavy users — if one team member accounts for a disproportionate share of General Chat tokens, check whether they're using Saifa Assistant efficiently or running unnecessary queries.

Upgrade before you hit the limit — don't wait for AI to pause. When your usage bar reaches 80%, go to Settings → Subscription Plan to review your options.

Last updated