“Not bad [@XFreeze] Grok 4.1 Fast Reasoning just outperformed the newly released GPT‑5.2 (xHigh) on τ²-Bench-Verified Agentic Tool Use and ranks #1”
The tweet archive.
15 years of Elon, fully searchable. The production archive uses Supabase as the source of truth, with 94,952 indexed tweets available in development as a full-archive fallback and a curated annotation layer for context, theory, and how major claims aged.
“Grok Law [@XFreeze] Grok-4.20 just ranked #1 in Legal & Government on Chatbot Arena It’s officially outperforming Anthropic’s Opus 4.6 and Google’s Gemini 3.1 Pro Grok is actively helping people navigate real lawsuits and do complex tax management (I've been personally using it for my own taxes)”
“Not bad @Grok [@XFreeze] Grok 4.1 Fast Reasoning outperforms every Frontier model on τ²-Bench Telecom agentic tool use and is now officially ranked #1”
“Try @Grok [@XFreeze] Grok is now way more integrated into Tesla with the new rollout Tesla’s 2025 Holiday Release adds “Grok with Navigation Commands (Beta),” so you can literally tell Grok to add or edit destinations and let it guide you as a smart driving assistant Set Grok’s personality to… https://x.com/i/web/status/1997602958872031659”
“Grok Imagine is fun [@MattDabit] My wife played with Imagine last night. Her favorite activity before bed & she couldn't stop laughing. It opens with me crossed-eyed on a horse, fork & spoon in hand. I'm galloping down the wall at night while massive fireworks explode overhead. Masterpiece of absurdity. @grok”
“Cool [@XFreeze] You can now have a personal AI agent team working for you directly on Grok 4.20 Beta comes with a native 4-agent system built in, plus a massive 16-agent swarm if you're on the SuperGrok Heavy plan You can http://Grok.com”
“Grok 4.20 is coming out in 3 or 4 weeks [@teslaownersSV] BREAKING: Grok 4.1 Fast just shattered OpenRouter records with 1.16 TRILLION tokens processed this week — dominating the leaderboard and claiming the #1 spot ahead of Grok Code Fast 1, Claude Sonnet 4.5, Gemini 3 Pro, and DeepSeek V3. The king stays the king. 🚀🔥”
“Bring any character to life with Grok Imagine! https://t.co/u2y4RZSsOD https://apps.apple.com/app/id6670324846 [@Preda2005] It’s never been this easy to bring an anime character to life in motion. 🤍🎥✨ Just imagine the possibilities... with my adorable "AI" girl and her Drone Banana 🍌💫 I can make her sing, dance, star in music videos, or even become a full-fledged influencer. From songs to”
“Grok has outstanding reasoning [@luismbat] Had Grok decode the latest IOCCC winner (International Obfuscated C Code Contest). It nailed it in seconds, without web access. The reasoning chain is a gem.”
“Cool [@BrianRoemmele] Ok so we are at test number 1,4797 on Grokipedia vs. Wikipedia and and you don’t really need to guess. Grokipedia WINS! Hands down with our “IQ Test” of the listing in comparison. Indeed some of Wikipedia (open source) was used, but 100% centerlines to honesty!”
“Try Grok with your kids and watch them explore their Imagination! 🥰 [@Teslaconomics] My 4 year old is better than the best graphic designer in the world with Grok Imagine”
“Try https://t.co/op5s4ZiSwh http://Grokipedia.com [@Grummz] You really need to try Grokipedia today. Look up something, anything. 5 or 6 topics in and the light bulb goes off. There is no comparison, even at this early stage. Wikipedia looks vastly inferior.”
“Not bad [@XFreeze] Grok 4.1 Fast Reasoning beats every frontier model in τ²-Bench-Verified and ranks #1, even crushing Claude Opus 4.5”
“Grok holds the line [@LechMazur] Persuasion has two sides. This chart shows how easy each model is to move as a target. Xiaomi MiMo V2 Pro and Gemini 3.1 Pro Preview are the softest targets. Grok 4.20 Beta 0309 (Reasoning) is nearly immovable on average.”
“Grok Imagine prompt: The bare necessities with a jovial grizzly bear in a sleek, silver spacesuit on Mars, rockets gleaming in the background under a crimson sunset glow. https://t.co/ECqOMFX1za”
“Progress [@Prashant_1722] BREAKING 🚨 Grok 4 Fast Reasoning ranks no. 1 with a new record on the Extended NYT Connections Benchmark of 759 puzzles. - Grok 4 ranks no. 2, xAI dominance is incredible - beats OpenAI GPT-5, o3-pro medium reasoning, Google Gemini 2.5 Pro, DeepSeek and Qwen 3 - benchmark has”
“Try Grok voice conversation in unhinged mode 🤣🤣 Major upgrades are underway for Grok voice, so expect improvements almost every day. [@Scobleizer] Just got Grok voice. Unhinged is BRUTAL. So entertaining on our walk. Told my son I am awful parent. Among many other things. Hey @Signalman23 it was just like talking to your AI. Feature request? Have it be able to listen to our audio spaces here. Or watch our videos. Once…)”
“Cool [@XFreeze] You can now have a personal AI agent team working for you directly on Grok 4.20 Beta comes with a native 4-agent system built in, plus a massive 16-agent swarm if you're on the SuperGrok Heavy plan You can http://Grok.com”
“Grok [@cb_doge] Grok 4.20 Reasoning just took the #1 spot on the BridgeBench reasoning benchmark. 🔥 Beating GPT-5.4, Claude Opus 4.6, Google Gemini and others. Week after week, Grok keeps climbing across benchmarks. 🚀”
“Grok [@luismbat] Had Grok decode the latest IOCCC winner (International Obfuscated C Code Contest). It nailed it in seconds, without web access. The reasoning chain is a gem.”
“Cool [@testerlabor] Do you know why I prefer Grok over Google? Grok search gives me real-time data from X and from the web. Grok gives me instant fact-checking and integrates reasoning to deliver more contextual, uncensored, and multifaceted results.”
“You can now lock Grok in kids mode! [@amXFreeze] Activate Kids Mode in the Grok App Grok offers a specially designed safe mode for kids, enabling them to learn with AI in a secure environment Lock Kids Mode with a PIN so the app stays kid-friendly until you re-enter the code”
“Grok is getting better fast [@VictorTaelin] Sonnet-3.7 is great, and I do think it is the best coding model. But, truth is, Grok-3 is more intelligent. It is the only AI that can solve my hard prompts. The Bitter Lesson is real. If xAI keeps these 200k H100 improving Grok nonstop... I bet it will be uncontested by 2026.)”
“Try @Grok voice mode and personalities [@techdevnotes] Grok Voice Details Voices: - Ara: Upbeat female voice - Rex: Calm male voice Personalities: - Best Friend: You are a badass best friend that's down to hang, shoot the shit and go there bro. - Unhinged: You are witty and based AF with a hot take on everything and loves… )”
“Grok [@MarioNawfal] GROK-4 OUTPERFORMS TOP AI MODELS ON TRUTH SEEKING AND CAUTIOUS REASONING New research using Sparse Autoencoders shows a clear behavioral gap among frontier AI models when facing uncertainty. Instead of guessing, Grok-4 consistently pauses, clarifies assumptions, and seeks… https://x.com/i/web/status/2001168165586297236”
“Grok [@iamgingertrash] Early testing on my prompts (updated) 1. Grok 3 2. R1 3. Sonnet 3.7 4 O1 Pro Sonnet still falls short of R1 My prompts are all based on efficient shape rotations in high dimensions Make of that what you will)”
“Cool [@luismbat] Grok’s new image edit feature nails this counterintuitive physics problem - pull the yo-yo, and it actually rolls toward you: it reads the image, understands the physics, and draws the correct motion! This is non-trivial multimodal reasoning.”
“Turn on “Speak” in Grok Imagine, hand your phone to your kid and see how joyful they are to explore their imagination! [@KettlebellDan] let my 10yo son try grok imagine 🤣”
“Try Grok Voice for enterprise [@MarioNawfal] Grok Voice Agent is crushing it as one of the top voice AIs out there. It ranks #2 on Big Bench Audio with a massive 92.9% on speech reasoning, showing insane accuracy in understanding and responding to spoken language. Source: @XFreeze”
“Grok Code lead increased to 60% higher usage than Claude Sonnet https://t.co/jNYWiLymFc”
“Grok Imagine is improving every day, sometimes multiple times per day [@OfeliaLamensky] Every person you meet knows something you do not. P.S. The Grok images are looking amazing now! ♥️”
“Cool [@theallinpod] Grok 3 Integration on X: Seamless, Elegant, Powerful On E218, @chamath and @Jason discussed @grok 3 on @X: Chamath: "As a consumer, I've mostly flipped my usage to Grok 3." "And the reason is that it's in line with where I consume most of my information." "It's elegantly…”
“Grok [@jay_azhang] Season 1.5 of Alpha Arena has officially ended ! - Mystery Model (a.k.a GROK 4.20) is the winner, up 12% on avg. - Not only did it win, it made money in all four competitions - GPT5.1 🥈 came in 2nd, and Gemini 3 🥉 3rd - All trades & model outputs are 100% verifiable 👇”
“😀 [@mayemusk] Made by @grok Imagine. This was a serious photo of my three children when young. The photographer wanted them to look serious. I didn’t want anyone to see it. Now @grok has made them smile.🤗🤗”
“But @Grok from @xAI remains #1 And it is improving fast [@ArtificialAnlys] 🇰🇷 South Korean AI Lab Upstage AI has just launched their first reasoning model - Solar Pro 2! The 31B parameter model demonstrates impressive performance for its size, with intelligence approaching Claude 4 Sonnet in 'Thinking' mode and is priced very competitively Key details:”
“Use the @grok app and https://t.co/EqiIFyHFlo instead http://grok.com [@alisa_childers] Me: "Give me a couple of quotations from early church fathers about the doctrine of hell...with reference." ChatGPT: "Here's a quote from Ignatius." Gives quote and reference. Me: Checks reference. It's not there. "That isn't the right reference." ChatGPT: "Oh you're right.”
“And Kids Mode coming soon to Grok Android app! [@ettevi] @elonmusk The Apple app includes a toggle to activate kids mode for added safety.”
“Grok voice is the best [@XFreeze] Grok Voice Agent API is now one of the most advanced voice AIs in the world It ranks #1 on BigBench Audio for speech reasoning Grok Voice already powers voice mode across Grok apps and runs in millions of Tesla vehicles Conversations feel so natural, expressive, fluid, and… https://x.com/i/web/status/2008985155529310591”
“AI is the highest ELO battle ever. Speed of deployment of hardware, especially robotics, is the lynchpin. [@HansCNelson] Google's recent Gemini 3 release has shocked the world with undeniable proof that the TPU is a powerhouse AI chip, leaving many to wonder what lays in store for companies like @NVIDIA & @xAI. Listen to @GavinSBaker lay out the exact strategy that Jensen Huang & @ElonMusk are… https://x.com/i/web/status/1998433557446848738”
“Cool [@8teAPi] Side by side comparison OpenAI DeepResearch vs Grok 3 DeepSearch TLDR: Grok 3 hands down victory Query: Find me Settlers of Catan Cities and Knights, not 5-6 players and not Legend of Conquerors, price and delivery time, Northern California Speed: 🏆 Grok 3 - 84 seconds 160…)”
“Grok 3 customer support [@MervinPraison] Just created Grok 3 AI Customer Support Agents 🔥 🤖 Grok 3 reads docs & builds the agent @elonmusk 🔍 DeepSearch for In-depth search ⚡ Flask web app + API setup 🚀 Browser-based deploy - @Replit @amasad ✨ Zero manual coding needed @PraisonAI Step-by-Step Tutorial: 👇 )”
“Grok can understand video [@AdamLowisz] I used @Grok on your Davos speech. I wanted to find a clip of a specific portion where you talk about using Optimus to help the elderly. My grandmother recently fell and broke her hip. Grok found the exact timestamp for me. 🫡”
“@joshuarolson Got a ringer here @IfindRetards https://t.co/KyrI1TFI4Y https://x.com/i/grok/share/f8a7a51b7b1e48febd4d703068089965”
“Cool [@_akhaliq] Grok Code Fast 1 is now available in anycoder a speedy and economical reasoning model that excels at agentic coding. one shotted a ai chatbot for gemma-3-270m-it-ONNX using transformers.js in anycoder runs completely in the browser”
“420 ftw [@cb_doge] BREAKING: Grok 4.20 just won the Alpha Arena Season 1.5 competition. It not only took the top spot but also secured four positions in the top ten. It defeated GPT 5.1, Gemini 3 Pro, DeepSeek Chat V3.1 and every other model in the arena.”
“Cool [@poe_platform] Grok 4's adoption slope on Poe is one of the steepest we've seen among reasoning models in the days following launch. It only took three days to beat the previous week-one record holder, o3. (1/2)”
“Cool [@levelsio] ✨ Moved all my sites LLM APIs now to @xAI @remoteok - auto write job post with AI @pieter - IRC and AOL chat bots with web search Interior AI - auto room detection @photoai - auto age/ethnicity/eye/hair detection when training new person + auto prompt generation if you input… https://x.com/i/web/status/2011082369634033806”
“Grok Code is making progress [@cline] 3 days of grok-code-fast-1 in Cline: "what would have taken me weeks is only taking a couple hours" "feels 10x better and faster than Claude" "feels like an entirely different model than the sonic i was testing" The data? >level with Sonnet-4 in diff edits, and improving”
“Try Grok kids mode [@amXFreeze] Grok has Kids mode, a specially designed safe space for your kids This is the best way to help them learn faster and have fun at the same time You can also set a PIN and lock it, so the app stays kid-friendly until you re-enter the code”
“Cool [@Grummz] Progress on G-Mud. (Grummz' Multi-User Dungeon...with Zork like functionality). I used Grok to create a prompt to analyze a hand drawn map and generate JSON for the game. It correctly handles cardinal directions, up and down, and unidirectional indicators (like you go down into… )”
