AI's Shocking Echoes 🤖🤯: Is It Real?

July 01, 2026 |

AI

🎧 Audio Summaries
English flag
French flag
German flag
Japanese flag
Korean flag
Mandarin flag
Spanish flag
🛒 Shop on Amazon

🧠Quick Intel


  • ChatGPT, Claude, and Gemini consistently return the number 7 when prompted with “Give me a random number between 1 and 10,” demonstrating predictable behavior.
  • Flint, developed by Springboards, generated a response of 3.7916 when prompted with “Give me a random number between 1 and 10,” differing significantly from the mainstream models.
  • Mainstream models like ChatGPT and Claude frequently predict specific brands (Toyota, Honda) when asked about car types, while Flint responded with Ford F-150 and Tesla.
  • In a study of 25 LLMs, 1,250 responses to the prompt “Write a metaphor about time” were a version of “Time is a river” or “Time is a weaver,” highlighting a significant degree of homogeneity.
  • ChatGPT generated a list of 56 band names, with “Glass” at the top, while Springboards’ Flint and Harbor identified “Static Empire,” “Neon Hearts,” and “Velvet Echo.”
  • Springboards’ tool, utilizing Flint alongside ChatGPT and Claude, was tested by Zoe Scaman of 77X, who noted it provided “completely different directions” and “catapults” users.
  • Flint is built on Qwen 3, an open-source model from Alibaba, reflecting Springboards' constraints regarding training a foundation model.
  • 📝Summary


    Researchers have observed a striking consistency in responses from prominent chatbots. When prompted with simple requests, such as generating a random number, models like Claude, ChatGPT, and Gemini repeatedly returned the number seven. This tendency was demonstrated by Australian startup Springboards, who developed an LLM called Flint, trained to offer more diverse answers. Flint consistently produced different results, including 3.7916, contrasting with the predictable responses of mainstream models. Further investigation revealed biases, with many chatbots favoring specific brands or offering repetitive metaphors, such as “Time is a river.” Springboards’ tool, utilizing models like ChatGPT and Flint, is gaining traction within creative industries, offering diverse textual directions.

    💡Insights



    THE LIMITATIONS OF CURRENT LARGE LANGUAGE MODELS
    Large language models, despite their impressive capabilities, exhibit a significant limitation: a tendency towards predictable and often repetitive responses. This stems from their training methodologies and the vast datasets they’re exposed to, leading to a form of “groupthink” that hinders genuine creativity and exploration. As highlighted by Springboards cofounder Pip Bingemann, these models are “stuck in a rut,” prioritizing established patterns over novel solutions. This predictability is particularly problematic when dealing with open-ended prompts requiring imaginative thinking, such as brainstorming vacation destinations or developing creative campaign taglines. The reliance on established patterns, exemplified by the near-universal response of 7 from chatbots when prompted with a random number, underscores this core limitation.

    FLINT: A STRATEGY FOR BREAKING THE CYCLE
    Recognizing the shortcomings of mainstream LLMs, the Australian startup Springboards developed Flint, an LLM specifically designed to generate a wider range of responses to open-ended questions. Flint’s training methodology diverges from the typical approach, prioritizing variety over adherence to established patterns. This difference is dramatically illustrated in the comparison between Flint and models like ChatGPT and Claude, where Flint produced a markedly different tagline (“Built to last, run to win”) for New Balance running shoes, rather than the more predictable options offered by the other models. The team’s success in winning the best paper award at NeurIPS further validates their approach, demonstrating a significant degree of innovation in addressing the homogeneity of responses within the broader LLM landscape. Kieran Browne, cofounder and CTO at Springboards, emphasizes that the design of chat interfaces can inadvertently reinforce this limited thinking, creating the illusion of personalized conversation while users are essentially receiving the same information as everyone else.

    APPLICATIONS AND THE EVOLUTION OF LLM TOOLING
    The limitations of current LLMs are beginning to attract significant attention within the AI community, as evidenced by the research paper “Artificial Hivemind.” This research revealed a remarkable convergence in responses across various LLMs when presented with open-ended prompts, highlighting the need for more diverse and adaptable models. Springboards is leveraging this insight by developing tools that integrate Flint alongside established models like ChatGPT and Claude, allowing creative professionals to select the most appropriate model for specific tasks. Zoe Scaman, founder of Bodacious, has found Flint particularly useful for “catapulting” herself into new creative directions, demonstrating its potential as a catalyst for innovation. While Flint isn't without its limitations—occasionally “falling over” with complex prompts—it represents a crucial step towards unlocking the full potential of LLMs and moving beyond the constraints of predictable, homogenized responses.

    THE FLINT MODEL: A UNIQUE APPROACH TO LARGE LANGUAGE MODELS
    Springboards has developed Flint, a novel large language model built upon the open-source Qwen 3 model from Alibaba. Recognizing the prohibitive cost of training a foundational model themselves, the team strategically focused on refining Qwen 3 to achieve a specific creative output. This approach avoids the broad, often unpredictable, adjustments made by other LLMs, specifically addressing the issue of temperature settings that can lead to incoherent responses. The core innovation lies in Flint’s ability to dynamically introduce variability only at critical points within its output, enhancing creative potential without sacrificing coherence.

    TARGETED RANDOMNESS AND OUTPUT CONTROL
    Traditional methods of controlling randomness in large language models, such as adjusting the “temperature” parameter, often result in undesirable outcomes – including model incoherence. Springboards recognized this limitation and shifted their focus to a more precise control mechanism. Instead of globally increasing randomness, Flint is programmed to identify specific points in its output where introducing a degree of unpredictability would be beneficial. For instance, when generating travel recommendations, Flint strategically increases randomness only when suggesting a destination, avoiding the disruption of coherent sentence construction. This targeted approach represents a significant departure from the blanket adjustments employed by models like OpenAI’s, allowing for a more nuanced and effective utilization of creative potential.

    APPLICATIONS AND CAUTIONS: FLINT’S ROLE IN ADVERTISING AND THE LIMITS OF AI OUTPUT
    Currently, Flint is primarily targeted toward the advertising and marketing sectors, aligning with Springboards’s existing customer base. Maximilian Weigl, co-founder of Uncommon, highlights Flint’s value in sparking novel ideas, noting its ability to “throw an oddball in,” prompting wider thinking. However, Weigl also cautions against over-reliance on any LLM, emphasizing the importance of human critical thinking and original thought. He observes that “most people are fine with good enough,” and that pushing for extreme outputs with tools like Flint is often unnecessary. Furthermore, Springboards stresses the importance of users maintaining their own voice and judgment, discouraging the uncritical adoption of AI-generated content, a sentiment echoed by Bingemann and Browne who advocate for independent thought and human-driven ideation.