Prev | Current Page 360 | Next

Brad Ediger

"Advanced Rails"

/
3x 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4x @ A B C D E F G H I J K L M N O
5x P Q R S T U V W X Y Z [ \ ] ^ _
6x ` a b c d e f g h i j k l m n o
7x p q r s t u v w x y z { | } ~ DEL
238 | Chapter 8: i18n and L10n
Extended ASCII
Although ASCII defines 128 characters and a 7-bit encoding, most computers process
data in 8-bit bytes. This leaves room for 128 more characters. Of course, computer
vendors each chose their own way to deal with this situation. This led to the
development of numerous extended-ASCII character sets, each of which used a different
interpretation for the upper octets (80 through FF).
The most widely adopted extended-ASCII standard is ISO 8859. This standard
adopts the ASCII values for the first 128 characters, and provides 15 different ???parts???
that each provide a definition for the last 128 characters. In effect, ISO 8859 defines
15 separate character sets.
The most used of these character sets is ISO-8859-1 (Latin-1). This provides nearly
complete coverage for most Western European languages. In fact, the 256 characters
defined by ISO-8859-1 correspond to the first 256 code points of Unicode. ISO-
8859-1 is still in widespread use among languages that use the Latin alphabet.
Problems with ASCII
Though the extended ASCII character encodings were widely successful for years,
they only provided a temporary fix. With so many encodings floating around, it is
difficult for people to communicate.


Pages:
348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372