xAI introduces Grok 4 Fast, an artificial intelligence model that improves token efficiency by 40% and reduces costs by 98%, combining large context window, unified reasoning and non-reasoning architecture, web and X search integration.
xAI has announced Grok 4 Fast, a new version of its language model that seeks to maintain Grok 4's performance with greater efficiency in token usage. According to the company, Grok 4 Fast uses 40% fewer tokens compared to its predecessor, translating into cost reductions of up to 98% in their internal benchmarks.
The model incorporates a 2-million-token context window, allowing it to handle large volumes of information in a single workflow. Its unified architecture combines extended reasoning modes and quick responses in one model, controlled through system instructions, without needing to alternate between different configurations.
Grok 4 Fast includes native tool integration capabilities, trained with reinforcement learning. It can perform web and X (formerly Twitter) searches, as well as process multimedia content, including images and videos, to synthesize information in real time. Additionally, the model allows controlled code execution according to system instructions.
Two variants are offered for developers: grok-4-fast-reasoning and grok-4-fast-non-reasoning, both with the 2-million-token context window. The model is available on the grok.com platform, including all users, and can also be integrated through OpenRouter, Vercel AI Gateway, and xAI API.
In independent LMArena evaluations, Grok 4 Fast reached first position in the Search Arena with 1163 Elo points and eighth place in the Text Arena, showing efficiency and performance comparable to Grok 4 in different test scenarios.
With this update, xAI seeks to offer a language model that combines efficiency, unification of reasoning modes, and search capabilities, maintaining performance standards similar to its previous version but with lower token consumption and greater accessibility for developers and users.
AI assistant developed by xAI that combines text processing and image generation. Integrated with X platform for real-time data access and DeepSearch function for advanced ...
15/01/2026
Replit has launched Mobile Apps on Replit, a feature that allows users to describe an idea, create the application, and publish it completely on the ...
14/01/2026
Google has introduced Personal Intelligence, a feature that allows Gemini to access information from applications like Gmail, Google Photos, and ...
07/01/2026
OpenAI has introduced ChatGPT Health, a dedicated experience that allows users to connect their medical records and wellness apps to obtain ...
05/01/2026
Amazon introduces Alexa.com, a new platform that brings its Alexa+ artificial intelligence assistant to web browsers and completes its multi-platform ...