Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> Every character, or to be more exact, every "grapheme", is assigned a Unicode code point.

Every character is assigned a Unicode code point. The Unicode consortium defines a grapheme as a "user perceived character", usually made up of one Unicode code point, but sometimes two or more. A base character can be followed by one or more non-spacing marks, together forming a "grapheme", the most common of which usually have a "canonical mapping" to a single character, but need not.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: