@johannes @jkt @simevidas I thought it would be the...

@johannes @jkt @simevidas I thought it would be the other way around. The same grouping of bytes could represent different codepoints, based on the encoding.

Like 24 Feb 2023 at 21:05 | Wall-to-wall | Open on phpc.social

1 comment

Johannes ✔️

@ramsey @jkt @simevidas yes, but working on bytes means that the encoding has to be carried thorough the different layers and might cut utf-8 sequences apart (assuming utf-8 being the default encoding)

With either codepoints or grapheme clusters you at least get some valid (while not always sensible) result.

24 Feb 2023 at 21:09 | Open on det.social

Go Up