@hongminhee I still think it's stupid that Unicode hasn't separated the different-looking CJK glyphs into separate codepoints. If we can have A, Α, А and A as separate characters, why couldn't that have been done for CJK?
Top-level
@hongminhee I still think it's stupid that Unicode hasn't separated the different-looking CJK glyphs into separate codepoints. If we can have A, Α, А and A as separate characters, why couldn't that have been done for CJK? 3 comments
@hongminhee But why? If the character looks different, why wouldn't it be represented by a different codepoint? @jernej__s Because they are the same characters, even though they look slightly different. “Unicode encodes characters, not glyphs.” —Unicode FAQ. It's like Arabic numeral 7 is encoded as a single codepoint whether it has an extra horizontal line drawn across it or not. https://upload.wikimedia.org/wikipedia/commons/5/5c/Hand_Written_7.svg |
@jernej__s I'm in favor of Han unification though. See also this:
https://fosstodon.org/@hongminhee/113039545387576150