OpenAI's Secrets š¤«: Danger & Billions š°
AI
OpenAIās Human Baseline Project Unveiled
OpenAI is undertaking a significant initiative to establish a human performance baseline for evaluating its next-generation AI models. The company recently enlisted third-party contractors to generate realistic tasks and assignments mirroring those performed in previous workplaces, as revealed by records obtained by WIRED and training data company Handshake AI. This project directly aligns with OpenAIās broader efforts to measure AI performance against human professionals across diverse industries, signifying a crucial step in their pursuit of Artificial General Intelligence ā an AI system capable of exceeding human performance at most economically valuable tasks. āWe have hired individuals across various occupations to collect real-world tasks modeled after those youāve completed in your full-time jobs, enabling us to accurately measure AI model performance on these tasks,ā reads a confidential OpenAI document, highlighting the core strategy.
Realistic Tasks: A Contractor-Generated Data Set
The core of OpenAIās approach involves contracting individuals to produce authentic, work-related documents and materials. Instead of simply summarizing their previous jobs, contractors are asked to provide concrete examples ā such as Word documents, PDFs, PowerPoint presentations, Excel spreadsheets, images, or code repositories ā demonstrating their skills and capabilities. This process is further supported by presentation notes illustrating realistic scenarios, and even fabricated work examples designed to test and demonstrate AI model responses. OpenAI and Handshake AI declined to offer direct comment on the initiative, further adding to the mystery surrounding the projectās scope and objectives.
Specific Example: The Luxury Yacht Trip Scenario
A concrete example of this process emerged during a presentation, showcasing a task assigned to a āSenior Lifestyle Manager at a luxury concierge company for ultra-high-net-worth individuals.ā The task involved preparing a short, 2-page PDF draft of a 7-day yacht trip overview to the Bahamas for a family traveling there for the first time. The "experienced human deliverable" presented ā a genuine Bahamas itinerary created for a client ā demonstrated the contractorās skills and provided valuable training data. Importantly, OpenAI explicitly instructs contractors to delete all corporate intellectual property and personally identifiable information from uploaded files, adding another layer of security and control.
Risk Mitigation: Legal Concerns and Data Scrubbing
Legal experts, such as Brown, an intellectual property lawyer with Neal & McDevitt, warn of potential trade secret misappropriation claims. AI labs receiving confidential information from contractors on a large scale face substantial legal risks. Contractors who provide documentsāeven those that have been scrubbedāfrom their previous workplaces risk violating nondisclosure agreements or exposing trade secrets. āThe AI lab is putting a lot of trust in its contractors to decide what is and isnāt confidential,ā Brown explains. āAnd if they do let something slip through, are the AI labs truly taking the time to determine what constitutes a trade secret? It seems to me that the AI lab is placing itself at significant risk.ā The use of a ChatGPT tool, āSuperstar Scrubbing,ā highlights the effort being put into removing sensitive information before itās submitted.
A Growing Industry and the Importance of Data Acquisition
AI labs are increasingly reliant on third-party contracting firms, such as Surge, Mercor, and Scale AI, to manage networks of data contractors, a trend driven by the need for higher-quality data to improve their models. This has spurred the growth of a lucrative sub-industry, estimated at $3.5 billion in 2022 for Handshake and reportedly valued at $25 billion during fundraising talks for Surge last summer. These firmsāalong with OpenAI, Anthropic, and Googleāare hiring large numbers of contractors to generate training data for AI agents designed to automate enterprise tasks.
OpenAI's Data Sourcing Strategy: Direct Outreach
In exploring alternative methods for acquiring real company data, a consultant specializing in asset sales following business closures revealed to WIRED that an OpenAI representative had approached several firms regarding data acquisition. This individual, speaking on condition of anonymity to protect existing business relationships, stated that the OpenAI representative sought access to documents, emails, and other internal communicationsāprovided that all personally identifiable information would be removed.
Our editorial team uses AI tools to aggregate and synthesize global reporting. Data is cross-referenced with public records as of April 2026.