So, I’m looking into coding some binary data as text, using single graphical units, after using already all the available ones for Java and UTF16, I’d like to expand my "pool".
I recently discovered grapheme clusters, where you can combine different elements/chars in order to create a single graphical unit, or, as they are properly called, a grapheme cluster.
For example: "🤦🏼♂️", which holds:
- base character
- skintone modifier
- ligature character (zero width joiner)
- male sign
- variation selector
I wonder if there is a comprensive list of all the possibile combinations, or at least some specifications which state clearly which emoticons can be combined how and how long they can be at maximum
Well, the obvious starting point is unicode itself: https://unicode.org
It does not provide a single, comprehensive list, as far as I know, but several lists for groups of symbols, e.g.
Emojis without modifiers:
Emojis with modifiers: