I also made a little decimal to Cistercian translator once. The numerals aren't in Unicode, so the font I use has to make use of private code points, which was kind of fun to get working properly. <https://cistercian.micahrl.com>
> It is not clear that there is any current need for Unicode support. Western Michigan University’s Medieval Institute (Cistercian and Monastic Studies) has a digitization project of Cistercian mss, but as of end 2020 they have not expressed any desire for computer encoding of the numerals. This document is therefore background information so that work is not duplicated if a need arises later.
Unrelated rant: The Unicode people have missed a lot more than that. There are an enormous amount of, for instance, musical accidentals that are missing, including ones that have been in use for at least a hundred years, such as the half flats or half sharps in e.g. Arabic or Turkish music, which are used by probably hundreds of millions of people in the Middle East. (They do have a symbol called "half sharp" and "half flat", but it isn't the same symbol.)
But at least they have the multiocular O, I guess.
If there's something missing from Unicode, you can propose it and there's a good chance they will add it to Unicode. Source: I got 7 characters added to Unicode.
The Bitcoin symbol is the most well known. Also the Group Mark from 1960s mainframes. The mask work symbol, like copyright for IC dies. And four half-stars.
Tengwar likely has to wait for the JRR Tolkien estate to release it to the public domain (CC0, for instance) or for the copyrights to expire on their own in a century or so.
(Same with Klingon glyphs and others.)
Such uses are probably fair use, but the Unicode consortium seems hesitant to test "probably" versus reality, especially since they would not see the brunt of the consequences but font authors and others might.
Unicode afaik focuses on taking already existing & supported character encodings, and incorporates those. Given there are 10,000 distinct glyphs, without some group of people already pushing for a specific encoding because it's what they use, Unicode probably doesn't have that strong a motivation to create their own (only to find out that it doesn't meet the needs of real users).
At least Unicode today doesn't need 10,000 codepoints to encode 10,000 distinct glyphs. Cistercian might make sense of ZWJ sequences of existing numerals plus a "cistercian mode selector". Authoring font ligatures to support that might not be fun, though, but also the cistercian glyphs are pretty simple concatenative structures almost perfectly designed for simple ligatures.
You're right, that does seem a better way to encode these—but that kind of encoding design I believe Unicode is wary of doing themselves. They seem to prefer sticking with a community-picked encoding instead.
Yes, I don't have an answer for that part and it makes sense of course, there's no reason to design an encoding that no one expects to use.
On the other hand, the above design idea (ZWJ sequences of arabic numerals with a "mode" marker) is offered free to anyone that does think they have a use for these numerals in Unicode and wants to turn it into a proper proposal.
Luckily Unicode makes exceptions for charsets/languages that are overlooked, also they take encouragement (via joining the Unicode Consortium) if you have a special symbol that you need to add for a private interest.
I ended up using that library to create a simple Cistercian clock screensaver. My hope was that I'd learn to read Cistercian numbers fluently though I'm still working on that.
D'ni numerals overlay two base-5 values in the same space, one horizontal & the other vertical, to represent 0–24. There's even a one-character numeral for 25, though it rarely comes up in-game. Different, of course, but I wouldn't be surprised if Cistercian was the inspiration.
Another number system, which I learned about recently on HN, are the Kaktovik numerals, which are even closer to D'ni. Interestingly, they were developed just a couple of years before Riven was released.
It's not a base-5 system exactly. It's base-20, where the rotated-and-superimposed digit is only one of the first three numerals. I guess you could call it a mixed-radix system, where the bases alternate like 4 5 ... 4 5 4 5 and where each group of 4 5 combines into a single symbol.
I feel like there's a missed symmetry here though and almost symmetrical things are like a wobbly tooth in my brain.
The right line appears at 6, but if you added that at 5 you could repeat the pattern you did for 1-4 for the horizontal lines, which it almost does. 7 and 8 are 6+1 and 6+2.
I'd make it binary-like, like another commenter said.
Change 3 to 2+1, 6 to 4+2.
8 takes the current 6, and 7 becomes 6+1, 9 becomes 8+1 (current 7). That also gives you hex digits to work with for free, each symbol could store up to FFFF.
It wouldn't change a lot, just removes the downstroke from 3.
Of course, 4 bits is an arbitrary choice, one could keep the current 5 bit/segments as well, but not much is needed to normalize it to 4 bits, which is interesting in itself.
The problems are that this is 1) hard to use for arithmetic, and 2) brittle.
"Brittle" because you're peacefully sitting there at 9999 coins or whatever, and you earn another one, and suddenly you can't represent it. Arabic (Hindu) numerals just slap another digit to the left, but Cistercian numerals break.
(You could fix that by using an Arabic notation - two Cistercian numbers, with the left one meaning "multiply this by 10,000". At that point, Cistercian numbers become a compressed Arabic notation. Still harder to use for arithmetic, but that's true of many compressed formats.)
Any fixed-number-of-digits (or bits, or whatever) system is brittle against overflow.
Any compressed format is hard(er) to use for arithmetic. Pretty much any grade-school kid can memorize the addition table, and can work it out on their fingers if they forget. But that table only has 100 entries. The addition tables for Cistercian numbers has 100,000,000 entries; nobody will memorize that.
But maybe I'm thinking wrong. Cistercian numbers really only have 0-9, in four possible orientations. So maybe you can just add them, going from units up through thousands, writing down each quadrant as you go. (Multiplying still looks like a mess, though...)
This looks a lot like how Chinese characters are composed of individual "radicals", arranged like tetris blocks inside the same square box. Some of them affect the meaning of the letter, while others only signify the sound it's supposed to make. The resulting writing system is not easy for foreigners to learn, but it makes quite an efficient use of space. This might have been more valuable at a time when writable surfaces were expensive.
Oh how I wish the glyphs started at the top right and moved anti clockwise one quadrant for each power of two, to be consistent with right handed coordinate systems
Glyphs have to be written in an orientation; bidirection (or higher symmetric) layouts would impose either writing the words left to right & right to left, which would cause smearing when writing with ink with a dominant hand, or mirroring the glyphs themselves, which would make reading & writing words much more difficult.
Well, another obvious extension would be to prolong the vertical line, since it's already ordered by rows (two of them).
Or, indeed, you could make an array out of them, making a super-glyph (kind of like a block matrix). Four blocks of four digits would get you to 10^4^4 = 10^16 so 100,00,00,00,00,00,00,00 (or 10000,0000,0000,0000, or 10,000,000,000,000,000 depending on your grouping), which is 10 Quadrillions in short- scale. Obviously grouping digits by blocks of 2 or 4 makes mor sense, since that's a row or a glyph.
Interesting discussion though, which is probably the goal here, rather than a good or bad reply: it allows you to see how the candidate thinks or approaches a problem.
It’s intentionally open-ended to engage in discussions about functional and quality attributes. Does the candidate ask questions or just jump into an answer?
I love it because it’s very open-ended and demonstrates problem solving. It’s a great conversation piece.
IMHO, Having something reposted every now and then is good. This gives the opportunity for people to see it if they missed it the first time.
Judging for myself, this is the first time I’ve seen Cistercian numbers and although I’m a frequent HN user, I’ve never seen them before.
If a topic is interesting it doesn’t hurt to repost it :-)
I also made a little decimal to Cistercian translator once. The numerals aren't in Unicode, so the font I use has to make use of private code points, which was kind of fun to get working properly. <https://cistercian.micahrl.com>