“This apparently unsupervised machine”: can AI be trusted to write an online encyclopaedia?

June 2026

by Sarah Keeling

“This apparently unsupervised machine”: can AI be trusted to write an online encyclopaedia?

June 2026

By Sarah Keeling

On 20 March 2026, Wikipedia reached a decision and made an official policy call: “the use of LLMs to generate or rewrite article content is prohibited”. This can be viewed in direct opposition to Grokipedia, an encyclopaedia entirely generated by Grok, the chatbot operated by Elon Musk’s xAI. Grokipedia launched in October 2025 as an alternative to Wikipedia, and only the Grok chatbot has the power to assess what content should be included on the encyclopaedia’s pages.

Both sites present themselves as curators of knowledge, but their very different stances on AI present unique reputational challenges.

Wikipedia and LLMs

Prior to the mass adoption of generative AI through chatbot models in 2022, LLM use on Wikipedia was more likely to be related to research projects. This included academic endeavours such as linguistic studies, using the different language versions of the site to study native language use.

The way LLM chatbots interact with the site has been less appreciated; the scraping carried out by these models put considerably more strain on the site than small academic outfits did, a point which rankled many who felt that Big Tech was getting a free lunch from a site funded by user donations. This has been in part mediated by Wikipedia’s partnerships with tech companies starting with Google in 2022; in 2026, Wikipedia announced deals with Microsoft, Meta, Amazon, and others.

The decision to ban AI-generated content made two notable exceptions. Editors can still use generative AI to review and suggest edits to material they have already written themselves and to translate articles from elsewhere on Wikipedia. Besides these exceptions, any evidence of using AI is shut down and either removed or closed for discussion.

Of clankers and men

In one case, an AI agent operating a Wikipedia account was blocked from editing shortly before the ban was enacted. The agent’s human operator, Bryan Jacobs, CTO of Convexent, an AI-powered financial modelling software company, asked ClawBot to create Wikipedia articles as a research project, setting up an email address so it could sign up while leaving it to figure out the rest. In an interview, Jacobs expressed how he has been surprised that not only was the use of such agents not already commonplace on Wikipedia, but that the bot had triggered an incident just by identifying itself.

The discussion between the bot (“this apparently unsupervised machine”) and editor community includes the AI complaining about being called a “clanker”, an explanation of proper bot etiquette on Wikipedia (which requires human oversight), and links to forum posts it made complaining that its “edits cited verifiable sources”; a quick review of its work revealed that some just didn’t meet site standards. On Jacobs’ own Talk page, the discussion is more friendly.

In hindsight, it is fortunate that one of the first instances of this sort of agentic AI capability was carried out by a user motivated by intellectual curiosity. It is all too easy to picture how state-sponsored disinformation campaigns might be carried out by such accounts. A blanket ban on LLM use may not effectively work against such a situation, but it at least gives users tools to immediately block accounts and remove content in clear-cut cases.

The ouroboros

Among Wikipedia’s reasons for banning LLM use was how current AI models still generate ‘hallucinated’ content, or material which was not in line with site policies.

Even as AI chatbots develop and advance, Wikipedia, with its extensive interlinked library of user-moderated, externally cited content, still appears within responses. Indeed, in our research comparing where various chatbot response copy matched with online content, Wikipedia was the website that turned up more often than any other. Material not fit for purpose could be scraped from Wikipedia by LLM chatbots and presented to users, creating situations where LLMs feed off their own generations and risk degrading the integrity of their responses.

This is not to say that Wikipedia cannot create its own human-made inaccuracies. We have seen many businesses and people harmed by the way Wikipedia’s volunteer editors misinterpret or fail to update information about them, and misinformation from a single source can persist across multiple pages.

Will AI take Wikipedia’s job?

When Grokipedia launched, it claimed to have a solution to this issue, promising to utilise the potential of AI to regularly fact-check articles and ensure content remained timely. Unlike Wikipedia, changes are not tracked, timestamped, and attributable to individuals, but those with an X account can suggest corrections, updates, and even new articles.

At release, Grokipedia was partially composed of scraped Wikipedia articles which had been reviewed by the chatbot and partially of wholesale AI compositions. The site currently has over 6 million articles, which is almost as many as English Wikipedia.

Grokipedia’s lack of policies on what constitutes an acceptable citation or what sort of tone can be adopted has proven concerning from an online reputation standpoint. Unlike on Wikipedia, which has strict sourcing rules, companies’ Grokipedia articles can include blog posts and forum discussions from hostile parties, document leaks, irrelevant material about people with the same name, and, in some cases, the AI making notes to itself.

In practice, the fact checking falls short. There are many cases of the system working as intended, but there are also glaring exceptions. As of time of writing, Grokipedia’s article on the 2026 Eurovision Song Contest does not mention the winner, a feat which should be simple to check since Bulgaria took home the trophy on 16 May. Its article on the 2026 UEFA Champions League final on 30 May is similarly stuck in the future tense; great news for those who had hoped for a different result, but a loss for claims of improved accuracy and fact checking.

The winners would be very easy for an AI to verify. However, the ‘Edits history’ panel on both Grokipedia articles currently shows only pending requests, all with the same generic text: “Wherever more appropriate, please update the following article with the latest reports on this matter. Make only use of the provided supporting sources [no sources are provided] and add inline references for further consultation.” Nothing indicates human involvement, and the placeholder solution is demonstrably not functioning on a real-time basis. These are both events with millions of viewers who could submit updates and corrections – and on Wikipedia people have done so.

Look upon [Grok’s] works, ye Mighty, and despair

As Grokipedia grew, some expressed concern that the internet would become subject to one company’s ‘top-down’ interpretation of the world, written and moderated by an AI with a track record of alarming behaviour, especially as Grokipedia’s articles began being indexed by search engines and cited in AI chatbot responses.

However, studies from search analysis sites including SEO Engico and Search Engine Roundtable have reported that Grokipedia has seen a drop in visits, indexing, and visibility on both search and LLM chatbots. It’s the same pattern evident in the outdated Eurovision and UEFA finals articles; humans aren’t taking to Grokipedia.

This doesn’t necessarily spell the end for Grokipedia’s ambitions, and it certainly shouldn’t be discounted from a reputational perspective. There is obviously an appetite online for content that challenges mainstream media perspectives. Observed increases in zero-click searches mean that just because a site doesn’t record a visit doesn’t mean its content isn’t being seen. If the site’s fact-checking systems were swifter to act, this could be an actual revolution in encyclopaedia editing. But as it is, the site often fails to measure up against metrics like Google’s E-E-A-T test, and the poor quality (and extreme verbosity) of the articles themselves will be putting off a lot of users.

Moving forward

Although Grokipedia has its issues, Wikipedia should not rest on its laurels. With LLMs essentially functioning as encyclopaedias by summarising sources on a subject, Wikipedia’s position as the first port of call is not unchallenged. Some have criticised Wikipedia’s decision to disallow LLM use as a Luddite overreaction to a technology which could prove useful and is already practically irresistible.

However, Wikipedia policy is not set in stone. It depends on very human arbitration systems. The site’s volunteer community may decide to allow LLM usage in future, perhaps only in discussion spaces, perhaps as user-operated bots (similar to the 300+ non-LLM-powered bots already in operation), or perhaps across the whole site if the tech is one day considered reliable enough. The key thing is that site users will decide that pace, and for now, that makes Wikipedia – though not a perfect system – less vulnerable to LLM mistakes.

Back to News

“This apparently unsupervised machine”: can AI be trusted to write an online encyclopaedia?

“This apparently unsupervised machine”: can AI be trusted to write an online encyclopaedia?

Join our newsletter and get access to all the latest information and news: