Is there a list of all (or most of the) possible combinations of (extended) grapheme clusters?

So, I’m looking into coding some binary data as text, using single graphical units, after using already all the available ones for Java and UTF16, I’d like to expand my "pool".

I recently discovered grapheme clusters, where you can combine different elements/chars in order to create a single graphical unit, or, as they are properly called, a grapheme cluster.

For example: "🤦🏼‍♂️", which holds:

  • base character
  • skintone modifier
  • ligature character (zero width joiner)
  • male sign
  • variation selector

(more here or here)

I wonder if there is a comprensive list of all the possibile combinations, or at least some specifications which state clearly which emoticons can be combined how and how long they can be at maximum

>Solution :

Well, the obvious starting point is unicode itself: https://unicode.org

It does not provide a single, comprehensive list, as far as I know, but several lists for groups of symbols, e.g.

Emojis without modifiers:
https://unicode.org/emoji/charts/full-emoji-list.html

Emojis with modifiers:
https://unicode.org/emoji/charts/full-emoji-modifiers.html

Leave a Reply