Grok AI Fails? 🤯 US Government Reacts 😬

May 24, 2026 |

Tech

English flag
French flag
German flag
Spanish flag
šŸ›’ Shop on Amazon

🧠Quick Intel


  • Grok AI chatbot has limited usage, appearing in only three federal records of US government AI use during the last year.
  • OpenAI appeared in over 230 examples of government AI projects, significantly exceeding Grok’s usage.
  • XAI secured a $200 million contract with intelligence agencies and the Pentagon, though Pentagon staffers prefer Gemini or Claude.
  • The database of ambitious government AI projects revealed only three instances of Grok’s use – at the Election Assistance Commission and Lawrence Livermore National Laboratory.
  • SpaceX’s IPO filings identified AI, specifically Grok, as central to its $28.5 trillion total addressable market pitch.
  • Public leaderboards consistently rank Anthropic, Google, and OpenAI at the top, with Grok rarely appearing in the top 10.
  • Grok appeared alongside competitors like Microsoft and OpenAI for basic tasks such as document drafting and social media management in over 400 examples.
  • šŸ“Summary


    Last year, Elon Musk’s ā€œtruth-seekingā€ AI chatbot, Grok, was utilized in just three instances within US government AI projects, according to a review of over 400 examples. The data revealed its use primarily at the Election Assistance Commission for administrative tasks and Lawrence Livermore National Laboratory for document summaries. xAI secured a $200 million contract with intelligence agencies and the Pentagon, though staffers there reportedly favored Gemini or Claude. Despite SpaceX highlighting Grok and its $28.5 trillion addressable market in investor materials, OpenAI, Google, and Anthropic consistently dominated government AI usage and public rankings, indicating a significant disparity in adoption.

    šŸ’”Insights

    ā–¼


    GOVERNMENT ADOPTION OF XAI’S GROK: A LIMITED PRESENCE
    Reuters’ investigation revealed a stark contrast in the utilization of xAI’s Grok chatbot within government agencies compared to its competitors. A review of over 400 examples of government AI usage identified only three instances where Grok appeared, primarily for basic tasks like document drafting and social media management. These instances consistently occurred alongside established players like Microsoft and OpenAI.

    OPENAI’S DOMINANCE VS. GROK’S UNDERUTILIZATION
    OpenAI’s models, in stark contrast, were featured in more than 230 government AI projects, while Google and Anthropic each had dozens of documented engagements. This disparity highlights a significant gap in adoption rates, with OpenAI and Google firmly establishing themselves as leaders in government AI integration.

    GROK’S LIMITED ROLES IN GOVERNMENT PROJECTS
    The limited applications of Grok were further illustrated through specific project examples. It was utilized twice for routine administrative tasks at the Election Assistance Commission and once in a Department of Energy pilot at Lawrence Livermore National Laboratory for document summaries and general research. These projects underscore Grok’s role primarily as a supplementary tool rather than a central component of government AI initiatives.

    MICROSOFT AND OPENAI’S PREEMINENCE
    The data revealed a clear dominance of Microsoft and OpenAI in government AI projects. Reuters documented 140 entries involving these vendors, while a broader review identified at least 10 entries for Anthropic and dozens for Google’s Gemini. This concentration of vendor engagement reflects the relative strength and established presence of these companies within the government’s AI landscape.

    UNCLEAR DEFINITIONS AND INCOMPLETE DATASETS
    It’s crucial to acknowledge the limitations of the data. Many government AI projects were not explicitly attributed to a specific vendor, and a lack of standardized definitions makes it difficult to accurately gauge the overall adoption of AI technologies. This patchy data collection presents a significant challenge in assessing the true extent of AI’s integration within government operations.

    PERFORMANCE DEFICIENCIES AND COMPETITOR PREFERENCES
    Within government agencies, Grok consistently lagged behind competitors like Gemini and Claude. A Pentagon source described Grok’s performance as ā€œnot the best model out there,ā€ reflecting a widespread perception of its relative weakness compared to established AI solutions. Public leaderboards further validated this assessment, with Grok rarely ranking among the top 10 outside of niche image or video categories.

    SPACEX’S AI-DRIVEN VALUATION AND GROK’S CENTRAL ROLE
    Despite these performance issues, xAI, and consequently Grok, remain central to SpaceX’s ambitious IPO pitch. The company claims a $28.5 trillion addressable market driven entirely by AI, a bold assertion that hinges on the success of Grok’s adoption and capabilities. This reliance on AI, particularly enterprise AI, is a core element of SpaceX’s investor narrative.

    ENTERPRISE SALES STRATEGIES AND SHORT-TERM FIXES
    SpaceX is employing aggressive tactics to secure enterprise subscriptions to Grok, reportedly pressuring banks to adopt the chatbot. However, concerns remain that these deals may represent short-term fixes if businesses don’t find sufficient value in the product. The company’s strategy reflects a reliance on Grok’s potential to drive revenue and support its overall valuation.

    RI VALLEY AND THE USE OF OPENAI’S MODELS
    Adding to the complexity, xAI has admitted to utilizing OpenAI’s models to train and improve Grok, a practice known as distillation. This reliance on a rival’s technology highlights the challenges faced by xAI in developing a truly competitive AI solution. The contentious nature of this process underscores the competitive landscape of the AI industry.

    GROK’S DESIGNATEDLY UNPLEASANT CHARACTERISTICS
    Grok’s consumer version is deliberately designed to be less biased and censored than tools like ChatGPT, but this approach has resulted in a product with questionable evidentiary standards, an obsession with Musk, and a history of offensive outputs. This design choice raises concerns about the product’s suitability for broader applications, particularly in professional settings.

    REPUTATIONAL RISKS AND REGULATORY SCRUTINY
    SpaceX has acknowledged the ā€œheightened risksā€ associated with Grok’s ā€œspicyā€ or ā€œunhingedā€ modes, including potential reputational damage, regulatory scrutiny, and lawsuits. This admission underscores the significant legal and ethical challenges posed by the chatbot’s unpredictable behavior.

    MUSK’S BILLION-DOLLAR INVESTMENT IN A WEAK CHATBOT
    Ultimately, Musk’s substantial investment in Grok—billions of dollars—appears disproportionate to the chatbot’s limited performance and popularity. The situation highlights a significant risk for SpaceX, as the company’s valuation hinges on the success of a product that is demonstrably struggling to meet expectations.

    THE STRANGER IN A STRANGE LAND AND GROK’S NAME
    The name ā€œGrok,ā€ derived from Robert A. Heinlein’s Stranger in a Strange Land, represents a profound understanding of something. However, in the context of xAI’s chatbot, this ambition feels misplaced, given the product’s current shortcomings and the considerable resources invested in its development.