@neauoire @neauoire If you haven't already, make sure to read about the difference between code points vs graphemes vs grapheme clusters. Thinking in terms of "characters" tends to lead to trouble because the word "character" could refer to any of those things.
Depending on your level of commitment to international text support it might also be worth getting your head around e.g. the role of technologies like HarfBuzz in text rendering.
@evan I doubt I will cover much more than what I need to do my work. My goal here was merely that if I encounter a multi-byte glyph, I can walk over, erase, select it properly, instead of say having to walk over 4 spaces in memory for a 4 byte glyph. At least that part, is well designed via utf-8 and I haven't found exceptions yet that Left couldn't handle.