From 6c1fd9608e8134a90a765a030c1933d255aa8ca0 Mon Sep 17 00:00:00 2001 From: Addison Phillips Date: Fri, 18 Oct 2024 14:36:08 -0700 Subject: [PATCH 1/9] Add a note about EGCs - Fixes #1 - Minor cleanup to examples - Links some terms to the glossary --- index.html | 15 ++++++++------- 1 file changed, 8 insertions(+), 7 deletions(-) diff --git a/index.html b/index.html index c4e5c46..830cdc2 100644 --- a/index.html +++ b/index.html @@ -1237,10 +1237,13 @@

Characters

At their simplest, user-perceived characters are a single shape that can be tied one-to-one to the underlying computing representation. But a user-perceived character can be formed, in some scripts, from more than one character. And a given logical character can take many different shapes due to such influences as font selection, style, or the surrounding context (such as adjacent characters). In some cases, a single user-perceived character might be formed from a long sequence of logical characters. And some logical characters (so-called "combining marks") are always used in conjunction with another character.

-

When user-perceived characters are represented visibly (on screen or in print), they are represented by individual rendering units. This visual unit is called a grapheme (the word glyph is also used). Graphemes are the visual units found in fonts and rendering software.

+

When user-perceived characters are represented visibly (on screen or in print), they are represented by individual rendering units. This visual unit is called a [=grapheme cluster=] (or [=grapheme=] for short; the word [=glyph=] is also sometimes used).

+ + -