models April 23, 2026 3 min read

Grok Voice Think Fast 1.0: xAI Speech Agent Tops τ-voice Bench by 20 Points

On April 23, 2026, xAI launched grok-voice-think-fast-1.0, scoring 67.3% on τ-voice Bench, a 20+ point lead over Gemini 3.1 Flash Live (43.8%) and GPT Realtime 1.5 (35.3%) in real-world voice scenarios.

فريق AI DayaHimour

April 23, 2026

Grok Voice Think Fast 1.0: xAI Speech Agent Tops τ-voice Bench by 20 Points

On April 23, 2026, xAI released grok-voice-think-fast-1.0, a voice agent that not only launched but immediately dominated the τ-voice Bench with a 67.3% overall score. Gemini 3.1 Flash Live trailed at 43.8% and GPT Realtime 1.5 at 35.3% — a gap of more than 23 percentage points. Those numbers were produced under the worst possible real-world conditions: background noise, heavy accents, rapid interruptions, and natural turn-taking.

The previous model, grok-voice-fast-1.0, reached only 38.3%. The leap to 67.3% is architectural, not incremental, thanks to a “silent thinking” layer that leaves response time untouched.

Four Hard Problems Solved at Once

xAI structured the agent around four bottlenecks that have historically kept voice agents out of production:

Accurate collection in chaos — Grok Voice Think Fast 1.0 reliably extracts email addresses, phone numbers, and account numbers even when the speaker is fast, heavily accented, or corrects themselves mid-statement. It handles these corrections the way a human listener naturally does.

Orchestrating 28 tools — A single agent can call up to 28 tools spanning hundreds of workflows: service credits, device replacements, plan changes, billing disputes. The caller never senses the backend complexity.

Hidden real-time reasoning — The model performs edge-case inference in the background before speaking, virtually eliminating hallucinated confidence. When asked “Which months contain the letter X?” it correctly answered “None,” while other models confidently guessed February.

Noise resilience — Tested over compressed phone codecs, street noise, and sharp dialects, the model supports more than 25 authentic languages, making it ready for global contact centers.

τ-voice Bench Performance

The table below illustrates the gulf between Grok Voice Think Fast 1.0 and its competitors:

Sector	Grok Voice Think Fast 1.0	Gemini 3.1 Flash Live	GPT Realtime 1.5
Telecom (plans, billing, troubleshooting)	73.7%	40.4%	21.1%
Airline (bookings, delays, complex itineraries)	66%	64%	36%
Retail (orders, returns, noise)	62.3%	45.6%	38.6%

In telecom, a 33-point gap is a rarity in AI benchmarks. Even in airline — where Gemini comes close at 64% — Grok stayed ahead under the pressure of itinerary changes.

τ-voice Bench — April 2026

τ-voice Overall 67.3%

Retail 62.3%

Airline 66%

Telecom 73.7%

Overall

Retail

Airline

Telecom

Field-Tested with Starlink

xAI didn’t wait for market validation — it deployed the model into Starlink’s live operations. The phone line +1 (888) GO STARLINK now uses Grok for sales and customer support, delivering:

20% sales conversion during the call itself.
70% self-resolution rate for support inquiries.

These metrics move voice agents from pilot-stage promise to full-scale operation. At a time when call centers worldwide are under increasing strain, a model handling hundreds of workflows with a single orchestration layer is a strategic necessity, not a luxury.

Pricing: $0.05/min with OpenAI Realtime API Compatibility

The model costs $0.05 per minute ($3.00 per hour) through wss://api.x.ai/v1/realtime, roughly half the price of the OpenAI Realtime API. It is compatible with the OpenAI Realtime API standard, allowing developers to switch with minimal changes. Teams get up to 100 concurrent sessions and a 30-minute session limit. The previous grok-voice-fast-1.0 is now deprecated, and migration to the new model is recommended.

A New Baseline for Voice Agents

Eighteen months after entering voice, xAI didn’t just catch up — it built a tangible lead. The telecom results, in particular, foreshadow a future where traditional call centers become the exception. Whether competitors erase the 20-point gap will define the next chapter, but the combination of hidden reasoning, aggressive pricing, and live Starlink validation makes grok-voice-think-fast-1.0 feel less like a benchmark champion and more like an operational turning point.

Grok Voice Think Fast 1.0xAIvoice agentτ-voice Bench2026

Share this article:

Grok Voice Think Fast 1.0: xAI Speech Agent Tops τ-voice Bench by 20 Points

Four Hard Problems Solved at Once

τ-voice Bench Performance

Field-Tested with Starlink

Pricing: $0.05/min with OpenAI Realtime API Compatibility

A New Baseline for Voice Agents

Related Articles

Comprehensive Comparison of the Most Powerful AI Models in 2026: GPT-5.4 vs Claude Opus 4.6 vs Gemini 3.1 vs Grok 4 vs DeepSeek V4

GPT-5.5: OpenAI's Smartest Model Redefines Agentic Work

Midjourney v8 — The Release That Settled the Debate: 'Have Synthetic Images Reached Cinema Quality?'