Follow

Keep Up to Date with the Most Important News

By pressing the Subscribe button, you confirm that you have read and are agreeing to our Privacy Policy and Terms of Use
Contact

Is there a list of all (or most of the) possible combinations of (extended) grapheme clusters?

So, I’m looking into coding some binary data as text, using single graphical units, after using already all the available ones for Java and UTF16, I’d like to expand my "pool".

I recently discovered grapheme clusters, where you can combine different elements/chars in order to create a single graphical unit, or, as they are properly called, a grapheme cluster.

For example: "🤦🏼‍♂️", which holds:

MEDevel.com: Open-source for Healthcare and Education

Collecting and validating open-source software for healthcare, education, enterprise, development, medical imaging, medical records, and digital pathology.

Visit Medevel

  • base character
  • skintone modifier
  • ligature character (zero width joiner)
  • male sign
  • variation selector

(more here or here)

I wonder if there is a comprensive list of all the possibile combinations, or at least some specifications which state clearly which emoticons can be combined how and how long they can be at maximum

>Solution :

Well, the obvious starting point is unicode itself: https://unicode.org

It does not provide a single, comprehensive list, as far as I know, but several lists for groups of symbols, e.g.

Emojis without modifiers:
https://unicode.org/emoji/charts/full-emoji-list.html

Emojis with modifiers:
https://unicode.org/emoji/charts/full-emoji-modifiers.html

Add a comment

Leave a Reply

Keep Up to Date with the Most Important News

By pressing the Subscribe button, you confirm that you have read and are agreeing to our Privacy Policy and Terms of Use

Discover more from Dev solutions

Subscribe now to keep reading and get access to the full archive.

Continue reading