The Grok API, developed by xAI, provides access to the powerful Grok 4 and Grok 4 Heavy large language models. Built for advanced reasoning, multimodal understanding, and long-context tasks, Grok’s pricing model is designed to accommodate a wide range of developers—from independent builders to large-scale enterprise teams.
This article breaks down the latest Grok API pricing structure (as of July 2025), including token-based costs, subscription tiers, usage considerations, and how it compares to other major LLM APIs like GPT-4o and Claude 4.
Grok API pricing is usage-based, calculated per million tokens. Charges apply to both input tokens (what you send to the API) and output tokens (what the model returns).
Token Type | Price per 1M Tokens |
---|---|
Input Tokens | $3.00 |
Output Tokens | $15.00 |
Cached Input Tokens | $0.75 |
Input Tokens: Charged at $3.00 per million tokens submitted in your request.
Output Tokens: Charged at $15.00 per million tokens generated by Grok 4.
Cached Input Tokens: If you reuse recently processed prompts, you're charged only $0.75 per million tokens—a significant discount.
xAI offers two main subscription tiers tailored to different usage levels:
Full access to Grok 4 API
Increased daily and monthly usage limits
Access to the full 256,000-token context window
Suitable for developers, startups, and product teams
Access to Grok 4 Heavy, the most advanced variant with multi-agent collaboration
Higher usage quotas
Early access to new features and experimental tools
Dedicated technical support and SLAs
Ideal for enterprise and AI infrastructure teams
Model | Context Window | Input Price | Output Price | Cached Input Price |
---|---|---|---|---|
Grok 4 | 256,000 tokens | $3.00 / 1M | $15.00 / 1M | $0.75 / 1M |
Grok 3 | 131,072 tokens | $3.00 / 1M | $15.00 / 1M | N/A |
Grok 3-mini | 131,072 tokens | $0.30 / 1M | $0.50 / 1M | N/A |
Grok 4 is best for long-form, reasoning-intensive applications like legal parsing, coding assistants, and research tools.
Grok 3-mini is optimized for lightweight tasks, offering budget-friendly performance at scale.
Some users may receive monthly free credits for testing and low-scale development.
Ideal for experimentation and prototyping without initial costs.
Custom pricing and volume discounts are available for large-scale or high-volume use cases.
Enterprises using the SuperGrok Heavy tier benefit from SLAs, support agreements, and white-glove onboarding.
Compared to top-tier competitors, Grok 4 offers a cost-effective blend of scale and intelligence:
Model | Input | Output | Max Context | Function Calling | Multimodal | Cached Input |
---|---|---|---|---|---|---|
Grok 4 | $3.00 / 1M | $15.00 / 1M | 256,000 tokens | ✅ Yes | ✅ Yes (vision) | ✅ Yes |
GPT-4o | ~$5.00 / 1M | ~$15.00 / 1M | 128,000 tokens | ✅ Yes | ✅ Yes | ❌ No |
Claude 3 Sonnet | $3.00 / 1M | $15.00 / 1M | 200,000+ tokens | ⚠️ Limited | ❌ No | ❌ No |
Key Advantage of Grok:
The cached input pricing and 256K token context window make Grok 4 especially efficient for long documents, repeat queries, or real-time data workflows.
User Type | Recommended Plan |
---|---|
Solo developers / hobbyists | Basic plan + per-token pricing |
Startups & SaaS teams | SuperGrok ($300/year) |
Enterprises & platforms | SuperGrok Heavy ($3,000/year) |
In 2025, Grok API offers both free-tier access and premium subscription plans designed to match different usage levels:
Plan | Access Level | Ideal For |
---|---|---|
Free Tier | Limited usage, no subscription | Hobbyists, light testing |
SuperGrok ($300/year) | Full Grok 4 access, higher limits | Developers, startups, SaaS tools |
SuperGrok Heavy ($3,000/year) | Access to Grok 4 Heavy + dedicated support | Enterprises, AI platforms, research labs |
Free Plan Limitations:
Access to Grok 3 or 3-mini (not Grok 4)
Lower rate limits
No access to function calling, multimodal features, or extended context window
Premium tiers unlock Grok 4, support 256,000-token contexts, offer enterprise features, and come with higher quotas and early access to new capabilities.
Grok 4’s pricing is primarily token-based, meaning your costs are driven by how much data you send and receive. The main cost drivers include:
Input Tokens: $3.00 per million tokens (text you send to the API)
Output Tokens: $15.00 per million tokens (responses generated by Grok 4)
Cached Input Tokens: $0.75 per million tokens (for repeated or stored prompts)
Additional cost factors:
Subscription plan (SuperGrok vs. Heavy)
Volume of requests
Length of prompts/responses
Use of real-time tools or streaming features
Whether you use direct access (xAI) or proxies (OpenRouter, CometAPI)
The pricing gap reflects the computational complexity involved:
Input tokens are relatively simple to process—Grok only needs to read and contextualize them.
Output tokens require model inference, reasoning, and generation, which is significantly more compute-intensive.
Hence, output is billed at $15.00 per million tokens, while input is only $3.00, and cached input (already processed text) is even lower at $0.75.
This model encourages developers to optimize responses, reuse prompt templates, and minimize unnecessary output generation.
For large-scale projects (e.g., AI platforms, autonomous agents, or customer-facing chat apps), enterprise-level Grok API pricing offers both opportunity and risk:
SuperGrok Heavy Plan ($3,000/year) unlocks Grok 4 Heavy with parallel tool use and multi-agent coordination.
Custom pricing discounts for high-volume users.
Dedicated support, SLAs, and higher reliability for mission-critical systems.
More stable cost-per-performance at scale due to predictable billing structure.
High output token costs may require aggressive optimization in generation-heavy workflows.
You’ll need to architect token-efficient prompts and implement caching or token reuse where possible.
For large-scale use, success comes from understanding usage patterns, optimizing prompt design, and potentially working with xAI on custom enterprise agreements.
While xAI has not publicly announced changes beyond July 2025, several factors may influence future pricing:
Rollout of Grok 4 Image generation or voice features may introduce new token types or costs.
Introduction of tiered pricing based on latency or SLA level.
Adjustment to match OpenAI’s GPT-5 or Anthropic Claude pricing.
Token bundling plans or pre-purchased credits at a discount.
Loyalty programs or discounts for long-term users.
Better compression, fine-tuning, or smart caching may reduce token usage over time.
Developers with long-term projects should monitor pricing updates via:
xAI's email communications or documentation changelogs
The Grok API pricing model is built to support flexible and scalable AI development—from early-stage experimentation to production-grade systems. With competitive token costs, generous context limits, and optional enterprise features, Grok 4 delivers excellent value for both small developers and large organizations.
If you're looking to build advanced AI agents, research assistants, or automated systems with deep reasoning and real-time multimodal capabilities, Grok 4 offers a pricing structure that supports your vision.