Abstract Wikipedia

internet_points · 2026-03-23T09:10:31 1774257031

> Let's follow one example: Nigeria is the most populous country in Africa. In Abstract Wikipedia, this might be stored as: Z27243(Q1033, Q138758272, Q6256, Q15, Z27243K5)

Haha that's like John Wilkins' "Real Character, and a Philosophical Language"

https://en.wikipedia.org/wiki/La_Ricerca_della_Lingua_Perfet... is a great intro to the weird and wonderful world of abstract/universal/ideal/a priori languages.

zozbot234 · 2026-03-23T11:59:06 1774267146

It's not that different from how LLM tokens work, only in a tree structure as opposed to a plain sequence. Having a tree structure makes it easier to formally define rewrite rules (which is key for interpretability), as opposed to learning them from data as LLM do.

internet_points · 2026-03-24T09:38:35 1774345115

Also tokens don't represent meaning in themselves, but are assigned points in a multidimensional space, they can only represent meaning in the network as a whole when combined with other tokens in context and order.

And the abstract concepts of Abstract Wikipedia are human-defined, top-down ways of carving the world into distinct categories which make some kind of logical sense, whereas LLM's work bottom-up and create overlapping, non-hierarchical, probabilistic networks of connections with nearly no imposed structure except the principle that you shall know a token by the company it keeps.

But you can type them both out with keys on a keyboard so in that sense I guess they're not that different.

rustyhancock · 2026-03-23T10:34:04 1774262044

One issue with projects like this just show me what it is on the front page.

Even the featured article section is empty!

brettermeier · 2026-03-23T10:59:05 1774263545

Correct, what is this and why is this?

orbital-decay · 2026-03-23T11:17:46 1774264666

Are they trying to reinvent Cyc?

https://en.wikipedia.org/wiki/Cyc

zinekeller · 2026-03-23T10:43:28 1774262608

For context, this was proposed way back in 2013 (https://meta.wikimedia.org/wiki/Abstract_Wikipedia), when machine translation is just plain bad (and LLMs are only known in academic circles). Surprised that AWiki is now active though.

casey2 · 2026-03-23T09:25:06 1774257906

So rather than machine translation... really primitive machine translation with extra steps?

Hasslequest · 2026-03-23T09:42:33 1774258953

Would you rather use a compiler, or have an LLM generate assembly code based on source code?

The purpose is to establish a new high-level lanugage