thoughts on the LLM-powered encyclopedia

it would be a competitor to Wikipedia.

instead of the encyclopedia anyone can edit, it is the encyclopedia that only the machine can edit. but, people can give the machine directions on how to edit.


There are about 6 million articles on Wikipedia. But the number of "interesting" articles is under 100k. The rest are in a bunch of "collections". 💡( the collections need managing, but are less work)

  • geographic locations
  • sports people/statistics
  • politics people/statistics
  • entertainment people/statistics
  • films, songs, books, etc.

... there are probably 10-20 more "collections" of a similar size.


There is also "news". Which is largely an intractable problem.

💬 News is the first rough draft of history. --Phil Graham

The barrier is less a "technical" one (although the acrobatics of writing accurate information based on incomplete-and-biased reporting is a technical challenge) and more of a "practical" one. If you have too much news, you cannibalize your sources. 🌎( and, you attract the attention of the people who make the news)

Fortunately, as we can keep editors out, it should be easier to not have those articles.


Some "historical" topics are more contentious than others. Initially, there can be "competing narratives".

💡 do we allow Tartaria or Fomenko-ism as a competing narrative? the answer must be the same as do we allow creationism as a competing narrative.

⚙️ there is the practical answer of this is how we keep it out of the main article. but, once again, as we can keep editors out, this is not a controlling requirement.


this leaves the question of who is evaluating the content and who is reading the content. 🔥( what if you build it and nobody comes?)

other than sell your soul to some ideological movement, i haven't come up with any plausible answers here. 💡( there are several new LLM-related ideological movements.) 📣( the Great LDS Encyclopedia sounds more palatable)


🔥 in the future the machine will know the answers. the encyclopedia will just be routing and caching.

⚙️ routing is "knowing which expert to ask about Sammy Sosa or John Culver". caching is "sometimes it does make sense to have a saved version on-hand, rather than asking the machine only after someone asks for the answer"


there is, also, the problem of the machine probably can't write a high-quality article about Pentecost just yet. but that is 2 years out, at most.

Quotes

"News is the first rough draft of history. --Phil Graham"