NEWS
DeepSeek Launches Open-Source V4 Pro and Flash with Million-Token Context— DeepSeek OpenAI Launches GPT-5.5 with Advanced Coding and Agentic Research— OpenAI Google Unveils Gemini Enterprise Agent Platform and TPU v8 Chips— Google Elon Musk Announces Terafab Chip Factories and Cursor Deal— SpaceX Anthropic Probes Mythos Model Leak and Releases Claude Opus 4.7— Anthropic Microsoft Invests $180 Billion in Australia for AI Infrastructure— Microsoft xAI Launches Grok Voice Think Fast 1.0, Scores 67.3% on τ-voice Bench— xAI MiMo-V2.5-Pro Launches: Xiaomi's Most Capable Agent Model Uses 40–60% Fewer Tokens Than Opus 4.6— Xiaomi OpenAI Launches ChatGPT Images 2.0 with Advanced Reasoning Mode— OpenAI xAI Launches New Voice APIs for Grok at 10x Lower Cost— xAI Cerebras Files Official IPO Application— Cerebras Anthropic Releases Claude Design Tool for Images and Layouts— Anthropic Google Holds Advanced Talks with Marvell on Custom AI Inference Chips— Google OpenAI Enhances Agents SDK with New Sandbox Security Features— OpenAI Anthropic Discusses Mythos Model with Trump Administration and US Agencies— Anthropic DeepSeek Launches Open-Source V4 Pro and Flash with Million-Token Context— DeepSeek OpenAI Launches GPT-5.5 with Advanced Coding and Agentic Research— OpenAI Google Unveils Gemini Enterprise Agent Platform and TPU v8 Chips— Google Elon Musk Announces Terafab Chip Factories and Cursor Deal— SpaceX Anthropic Probes Mythos Model Leak and Releases Claude Opus 4.7— Anthropic Microsoft Invests $180 Billion in Australia for AI Infrastructure— Microsoft xAI Launches Grok Voice Think Fast 1.0, Scores 67.3% on τ-voice Bench— xAI MiMo-V2.5-Pro Launches: Xiaomi's Most Capable Agent Model Uses 40–60% Fewer Tokens Than Opus 4.6— Xiaomi OpenAI Launches ChatGPT Images 2.0 with Advanced Reasoning Mode— OpenAI xAI Launches New Voice APIs for Grok at 10x Lower Cost— xAI Cerebras Files Official IPO Application— Cerebras Anthropic Releases Claude Design Tool for Images and Layouts— Anthropic Google Holds Advanced Talks with Marvell on Custom AI Inference Chips— Google OpenAI Enhances Agents SDK with New Sandbox Security Features— OpenAI Anthropic Discusses Mythos Model with Trump Administration and US Agencies— Anthropic
models April 23, 2026 3 min read

Grok Voice Think Fast 1.0: xAI Speech Agent Tops τ-voice Bench by 20 Points

On April 23, 2026, xAI launched grok-voice-think-fast-1.0, scoring 67.3% on τ-voice Bench, a 20+ point lead over Gemini 3.1 Flash Live (43.8%) and GPT Realtime 1.5 (35.3%) in real-world voice scenarios.

ف

فريق AI DayaHimour

April 23, 2026

Grok Voice Think Fast 1.0: xAI Speech Agent Tops τ-voice Bench by 20 Points

On April 23, 2026, xAI released grok-voice-think-fast-1.0, a voice agent that not only launched but immediately dominated the τ-voice Bench with a 67.3% overall score. Gemini 3.1 Flash Live trailed at 43.8% and GPT Realtime 1.5 at 35.3% — a gap of more than 23 percentage points. Those numbers were produced under the worst possible real-world conditions: background noise, heavy accents, rapid interruptions, and natural turn-taking.

The previous model, grok-voice-fast-1.0, reached only 38.3%. The leap to 67.3% is architectural, not incremental, thanks to a “silent thinking” layer that leaves response time untouched.

Four Hard Problems Solved at Once

xAI structured the agent around four bottlenecks that have historically kept voice agents out of production:

Accurate collection in chaos — Grok Voice Think Fast 1.0 reliably extracts email addresses, phone numbers, and account numbers even when the speaker is fast, heavily accented, or corrects themselves mid-statement. It handles these corrections the way a human listener naturally does.

Orchestrating 28 tools — A single agent can call up to 28 tools spanning hundreds of workflows: service credits, device replacements, plan changes, billing disputes. The caller never senses the backend complexity.

Hidden real-time reasoning — The model performs edge-case inference in the background before speaking, virtually eliminating hallucinated confidence. When asked “Which months contain the letter X?” it correctly answered “None,” while other models confidently guessed February.

Noise resilience — Tested over compressed phone codecs, street noise, and sharp dialects, the model supports more than 25 authentic languages, making it ready for global contact centers.

τ-voice Bench Performance

The table below illustrates the gulf between Grok Voice Think Fast 1.0 and its competitors:

SectorGrok Voice Think Fast 1.0Gemini 3.1 Flash LiveGPT Realtime 1.5
Telecom (plans, billing, troubleshooting)73.7%40.4%21.1%
Airline (bookings, delays, complex itineraries)66%64%36%
Retail (orders, returns, noise)62.3%45.6%38.6%

In telecom, a 33-point gap is a rarity in AI benchmarks. Even in airline — where Gemini comes close at 64% — Grok stayed ahead under the pressure of itinerary changes.

τ-voice Bench — April 2026

τ-voice Overall 67.3%
Retail 62.3%
Airline 66%
Telecom 73.7%
Overall
Retail
Airline
Telecom

xAI didn’t wait for market validation — it deployed the model into Starlink’s live operations. The phone line +1 (888) GO STARLINK now uses Grok for sales and customer support, delivering:

  • 20% sales conversion during the call itself.
  • 70% self-resolution rate for support inquiries.

These metrics move voice agents from pilot-stage promise to full-scale operation. At a time when call centers worldwide are under increasing strain, a model handling hundreds of workflows with a single orchestration layer is a strategic necessity, not a luxury.

Pricing: $0.05/min with OpenAI Realtime API Compatibility

The model costs $0.05 per minute ($3.00 per hour) through wss://api.x.ai/v1/realtime, roughly half the price of the OpenAI Realtime API. It is compatible with the OpenAI Realtime API standard, allowing developers to switch with minimal changes. Teams get up to 100 concurrent sessions and a 30-minute session limit. The previous grok-voice-fast-1.0 is now deprecated, and migration to the new model is recommended.

A New Baseline for Voice Agents

Eighteen months after entering voice, xAI didn’t just catch up — it built a tangible lead. The telecom results, in particular, foreshadow a future where traditional call centers become the exception. Whether competitors erase the 20-point gap will define the next chapter, but the combination of hidden reasoning, aggressive pricing, and live Starlink validation makes grok-voice-think-fast-1.0 feel less like a benchmark champion and more like an operational turning point.

Grok Voice Think Fast 1.0xAIvoice agentτ-voice Bench2026
Share this article: Telegram

Related Articles