Latin1的所有字符编码
ISO-8859-1 (ISO Latin 1) Character Encoding
Contents
- The characters at a glance
- Character codes and names
- Notes for html documents
- Other notes
- Additional references
The characters at a glance
Here are all the printable characters, in collating order:
- ! " # $ % & \' ( ) * + , - . /
- 0 1 2 3 4 5 6 7 8 9 : ; < = > ? @
- A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
- [ \ ] ^ _ `
- a b c d e f g h i j k l m n o p q r s t u v w x y z
- { | } ~
- ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ ® ¯
- ° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿
- À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï Ð Ñ Ò Ó Ô Õ Ö
- ×
- Ø Ù Ú Û Ü Ý Þ
- ß
- à á â ã ä å æ ç è é ê ë ì í î ï ð ñ ò ó ô õ ö
- ÷
- ø ù ú û ü ý þ
- ÿ
The first six rows are the ASCII character set.
Note the ordinary ASCII space (before `!\’) and the ISO Latin-1 non-breaking space (before `¡\’)
Character codes and names
The columns show, in order:
- HTML: the HTML notation (decimal);
- OCTL: the C/Modula-3 octal notation;
- HEX: the hexadecimal code, as used e.g. in MIME quoted-printable encoding;
- CMP: the Sun/X11 “Compose” key combinations;
- CHR: the charater itself, when HTML-printable;
-
MEANING: the ISO-8859-1 (and ASCII) definition.
- HTML OCTL HEX CMP CHR MEANING
- ------ + ---- + --- + --- + --- + ------------------------------
- � | \000 | =00 | | | Invalid (ASCII NUL)
-  | \001 | =01 | | | Unused (ASCII SOH)
-  | \002 | =02 | | | Unused (ASCII STX)
-  | \003 | =03 | | | Unused (ASCII ETX)
-  | \004 | =04 | | | Unused (ASCII EOT)
-  | \005 | =05 | | | Unused (ASCII ENQ)
-  | \006 | =06 | | | Unused (ASCII ACK)
-  | \007 | =07 | | | Unused (ASCII BEL, audible bell)
-  | \010 | =08 | | | Unused (ASCII BS, backspace)
- | \011 | =09 | | | Horizontal tab (ASCII HT)
- | \012 | =0A | | | Line feed (ASCII NL, newline)
-  | \013 | =0B | | | Unused (ASCII VT, vertical tab)
-  | \014 | =0C | | | Unused (ASCII NP, new page)
- | \015 | =0D | | | Carriage Return (ASCII CR)
-  | \016 | =0E | | | Unused (ASCII SO)
-  | \017 | =0F | | | Unused (ASCII SI)
-  | \020 | =10 | | | Unused (ASCII DLE)
-  | \021 | =11 | | | Unused (ASCII DC1)
-  | \022 | =12 | | | Unused (ASCII DC2)
-  | \023 | =13 | | | Unused (ASCII DC3)
-  | \024 | =14 | | | Unused (ASCII DC4)
-  | \025 | =15 | | | Unused (ASCII NAK)
-  | \026 | =16 | | | Unused (ASCII SYN)
-  | \027 | =17 | | | Unused (ASCII ETB)
-  | \030 | =18 | | | Unused (ASCII CAN)
-  | \031 | =19 | | | Unused (ASCII EM)
-  | \032 | =1A | | | Unused (ASCII SUB)
-  | \033 | =1B | | | Unused (ASCII ESC, escape)
-  | \034 | =1C | | | Unused (ASCII FS)
-  | \035 | =1D | | | Unused (ASCII GS)
-  | \036 | =1E | | | Unused (ASCII RS)
-  | \037 | =1F | | | Unused (ASCII US)
- | \040 | =20 | | ( ) | Space (ASCII SP)
- ! | \041 | =21 | | (!) | Exclamation mark
- " | \042 | =22 | | (") | Quotation mark (")
- # | \043 | =23 | | (#) | Number sign
- $ | \044 | =24 | | ($) | Dollar sign
- % | \045 | =25 | | (%) | Percent sign
- & | \046 | =26 | | (&) | Ampersand (&)
- ' | \047 | =27 | | (\') | Apostrophe (right single quote)
- ( | \050 | =28 | | (() | Left parenthesis
- ) | \051 | =29 | | ()) | Right parenthesis
- * | \052 | =2A | | (*) | Asterisk
- + | \053 | =2B | | (+) | Plus sign
- , | \054 | =2C | | (,) | Comma
- - | \055 | =2D | | (-) | Hyphen
- . | \056 | =2E | | (.) | Period (fullstop)
- / | \057 | =2F | | (/) | Solidus (slash)
- 0 | \060 | =30 | | (0) | Digit 0
- . . .
- 9 | \071 | =39 | | (9) | Digit 9
- : | \072 | =3A | | (:) | Colon
- ; | \073 | =3B | | (;) | Semi-colon
- < | \074 | =3C | | (<) | Less than (<)
- = | \075 | =3D | | (=) | Equals sign
- > | \076 | =3E | | (>) | Greater than (>)
- ? | \077 | =3F | | (?) | Question mark
- @ | \100 | =40 | | (@) | Commercial at-sign
- A | \101 | =41 | | (A) | Uppercase letter A
- . . .
- Z | \132 | =5A | | (Z) | Uppercase letter Z
- [ | \133 | =5B | | ([) | Left square bracket
- \ | \134 | =5C | | (\) | Reverse solidus (backslash)
- ] | \135 | =5D | | (]) | Right square bracket
- ^ | \136 | =5E | | (^) | Caret
- _ | \137 | =5F | | (_) | Horizontal bar (underscore)
- ` | \140 | =60 | | (`) | Reverse apostrophe (left single quote)
- a | \141 | =61 | | (a) | Lowercase letter a
- . . .
- z | \172 | =7A | | (z) | Lowercase letter z
- { | \173 | =7B | | ({) | Left curly brace
- | | \174 | =7C | | (|) | Vertical bar
- } | \175 | =7D | | (}) | Right curly brace
- ~ | \176 | =7E | | (~) | Tilde
- | \177 | =7F | | | Unused (ASCII DEL)
- € | \200 | =80 | | | Unused
- | \201 | =81 | | | Unused
- ‚ | \202 | =82 | | | Unused
- ƒ | \203 | =83 | | | Unused
- „ | \204 | =84 | | | Unused
- … | \205 | =85 | | | Unused
- † | \206 | =86 | | | Unused
- ‡ | \207 | =87 | | | Unused
- ˆ | \210 | =88 | | | Unused
- ‰ | \211 | =89 | | | Unused
- Š | \212 | =8A | | | Unused
- ‹ | \213 | =8B | | | Unused
- Œ | \214 | =8C | | | Unused
- | \215 | =8D | | | Unused
- Ž | \216 | =8E | | | Unused
- | \217 | =8F | | | Unused
- | \220 | =90 | | | Unused
- ‘ | \221 | =91 | | | Unused
- ’ | \222 | =92 | | | Unused
- “ | \223 | =93 | | | Unused
- ” | \224 | =94 | | | Unused
- • | \225 | =95 | | | Unused
- – | \226 | =96 | | | Unused
- — | \227 | =97 | | | Unused
- ˜ | \230 | =98 | | | Unused
- ™ | \231 | =99 | | | Unused
- š | \232 | =9A | | | Unused
- › | \233 | =9B | | | Unused
- œ | \234 | =9C | | | Unused
- | \235 | =9D | | | Unused
- ž | \236 | =9E | | | Unused
- Ÿ | \237 | =9F | | | Unused
- | \240 | =A0 | | ( ) | Non-breaking space ( )
- ¡ | \241 | =A1 | ! ! | (¡) | Inverted exclamation
- ¢ | \242 | =A2 | c / | (¢) | Cent sign
- £ | \243 | =A3 | l - | (£) | Pound sterling
- ¤ | \244 | =A4 | o x | (¤) | General currency sign
- ¥ | \245 | =A5 | y - | (¥) | Yen sign
- ¦ | \246 | =A6 | | | | (¦) | Broken vertical bar
- § | \247 | =A7 | s o | (§) | Section sign
- ¨ | \250 | =A8 | " " | (¨) | Umlaut (dieresis)
- © | \251 | =A9 | c o | (©) | Copyright
- ª | \252 | =AA | - a | (ª) | Feminine ordinal
- « | \253 | =AB | < < | («) | Left angle quote, guillemotleft
- ¬ | \254 | =AC | - , | (¬) | Not sign
- | \255 | =AD | - - | () | Soft hyphen
- ® | \256 | =AE | r o | (®) | Registered trademark
- ¯ | \257 | =AF | ^ - | (¯) | Macron accent
- ° | \260 | =B0 | ^ * | (°) | Degree sign
- ± | \261 | =B1 | + - | (±) | Plus or minus
- ² | \262 | =B2 | ^ 2 | (²) | Superscript two
- ³ | \263 | =B3 | ^ 3 | (³) | Superscript three
- ´ | \264 | =B4 | \ \ | (´) | Acute accent
- µ | \265 | =B5 | / u | (µ) | Micro sign
- ¶ | \266 | =B6 | P ! | (¶) | Paragraph sign
- · | \267 | =B7 | ^ . | (·) | Middle dot
- ¸ | \270 | =B8 | , , | (¸) | Cedilla
- ¹ | \271 | =B9 | ^ 1 | (¹) | Superscript one
- º | \272 | =BA | _ o | (º) | Masculine ordinal
- » | \273 | =BB | > > | (») | Right angle quote, guillemotright
- ¼ | \274 | =BC | 1 4 | (¼) | Fraction one-fourth
- ½ | \275 | =BD | 1 2 | (½) | Fraction one-half
- ¾ | \276 | =BE | 3 4 | (¾) | Fraction three-fourths
- ¿ | \277 | =BF | ? ? | (¿) | Inverted question mark
- À | \300 | =C0 | A ` | (À) | Capital A, grave accent
- Á | \301 | =C1 | A \' | (Á) | Capital A, acute accent
- Â | \302 | =C2 | A ^ | (Â) | Capital A, circumflex accent
- Ã | \303 | =C3 | A ~ | (Ã) | Capital A, tilde
- Ä | \304 | =C4 | A " | (Ä) | Capital A, dieresis or umlaut mark
- Å | \305 | =C5 | A * | (Å) | Capital A, ring
- Æ | \306 | =C6 | A E | (Æ) | Capital AE dipthong (ligature)
- Ç | \307 | =C7 | C , | (Ç) | Capital C, cedilla
- È | \310 | =C8 | E ` | (È) | Capital E, grave accent
- É | \311 | =C9 | E \' | (É) | Capital E, acute accent
- Ê | \312 | =CA | E ^ | (Ê) | Capital E, circumflex accent
- Ë | \313 | =CB | E " | (Ë) | Capital E, dieresis or umlaut mark
- Ì | \314 | =CC | I ` | (Ì) | Capital I, grave accent
- Í | \315 | =CD | I \' | (Í) | Capital I, acute accent
- Î | \316 | =CE | I ^ | (Î) | Capital I, circumflex accent
- Ï | \317 | =CF | I " | (Ï) | Capital I, dieresis or umlaut mark
- Ð | \320 | =D0 | D - | (Ð) | Capital Eth, Icelandic
- Ñ | \321 | =D1 | N ~ | (Ñ) | Capital N, tilde
- Ò | \322 | =D2 | O ` | (Ò) | Capital O, grave accent
- Ó | \323 | =D3 | O \' | (Ó) | Capital O, acute accent
- Ô | \324 | =D4 | O ^ | (Ô) | Capital O, circumflex accent
- Õ | \325 | =D5 | O ~ | (Õ) | Capital O, tilde
- Ö | \326 | =D6 | O " | (Ö) | Capital O, dieresis or umlaut mark
- × | \327 | =D7 | x x | (×) | Multiply sign
- Ø | \330 | =D8 | O / | (Ø) | Capital O, slash
- Ù | \331 | =D9 | U ` | (Ù) | Capital U, grave accent
- Ú | \332 | =DA | U \' | (Ú) | Capital U, acute accent
- Û | \333 | =DB | U ^ | (Û) | Capital U, circumflex accent
- Ü | \334 | =DC | U " | (Ü) | Capital U, dieresis or umlaut mark
- Ý | \335 | =DD | Y \' | (Ý) | Capital Y, acute accent
- Þ | \336 | =DE | P | | (Þ) | Capital THORN, Icelandic
- ß | \337 | =DF | s s | (ß) | Small sharp s, German (sz ligature)
- à | \340 | =E0 | a ` | (à) | Small a, grave accent
- á | \341 | =E1 | a \' | (á) | Small a, acute accent
- â | \342 | =E2 | a ^ | (â) | Small a, circumflex accent
- ã | \343 | =E3 | a ~ | (ã) | Small a, tilde
- ä | \344 | =E4 | a " | (ä) | Small a, dieresis or umlaut mark
- å | \345 | =E5 | a * | (å) | Small a, ring
- æ | \346 | =E6 | a e | (æ) | Small ae dipthong (ligature)
- ç | \347 | =E7 | c , | (ç) | Small c, cedilla
- è | \350 | =E8 | e ` | (è) | Small e, grave accent
- é | \351 | =E9 | e \' | (é) | Small e, acute accent
- ê | \352 | =EA | e ^ | (ê) | Small e, circumflex accent
- ë | \353 | =EB | e " | (ë) | Small e, dieresis or umlaut mark
- ì | \354 | =EC | i ` | (ì) | Small i, grave accent
- í | \355 | =ED | i \' | (í) | Small i, acute accent
- î | \356 | =EE | i ^ | (î) | Small i, circumflex accent
- ï | \357 | =EF | i " | (ï) | Small i, dieresis or umlaut mark
- ð | \360 | =F0 | d - | (ð) | Small eth, Icelandic
- ñ | \361 | =F1 | n ~ | (ñ) | Small n, tilde
- ò | \362 | =F2 | o ` | (ò) | Small o, grave accent
- ó | \363 | =F3 | o \' | (ó) | Small o, acute accent
- ô | \364 | =F4 | o ^ | (ô) | Small o, circumflex accent
- õ | \365 | =F5 | o ~ | (õ) | Small o, tilde
- ö | \366 | =F6 | o " | (ö) | Small o, dieresis or umlaut mark
- ÷ | \367 | =F7 | - : | (÷) | Division sign
- ø | \370 | =F8 | o / | (ø) | Small o, slash
- ù | \371 | =F9 | u ` | (ù) | Small u, grave accent
- ú | \372 | =FA | u \' | (ú) | Small u, acute accent
- û | \373 | =FB | u ^ | (û) | Small u, circumflex accent
- ü | \374 | =FC | u " | (ü) | Small u, dieresis or umlaut mark
- ý | \375 | =FD | y \' | (ý) | Small y, acute accent
- þ | \376 | =FE | p | | (þ) | Small thorn, Icelandic
- ÿ | \377 | =FF | y " | (ÿ) | Small y, dieresis or umlaut mark
Notes for HTML documents
-
HTML entity names are given in the “MEANING” column only for ampersand, quote, less than, and greater than, which are significant in HTML syntax; and for the non-breaking space, which may be confused with ordinary space. HTML entity names exist for many other characters, but they are superfluous: the ISO-8859-1 eight-bit codes will work, by definition, on any browser.
-
The characters carriage return (ASCII CR) and line feed (ASCII NL, newline) are equivalent; they are treated as whitespace, except in <pre> contexts, where they force a line break. (However, a line feed is ignored if it immediately follows a carriage return.)
-
The horizontal tab character (ASCII HT) skips to the next tabbing column in <pre> contexts, and is treated as whitespace elsewhere.
-
The non-breaking space ( ) is honored even in non-<pre> contexts, and can be used to insert extra space between words, images, etc., like this: | |.
Other notes
-
Alternative Sun/X11 “Compose” sequences for the Icelandic “thorn” are “t h” (þ, lowercase) and “T H” (Þ, uppercase).
-
Note that the Sun/X11 “Compose” sequence for masculine ordinal (º) uses an underscore, while the feminine ordinal (ª) uses a minus sign. It takes a lot of imagination to come up with such ideas…
Additional references
- Martin Ramsch\’s iso8859-1 table.
- The HTML 2.0 Standard [Character Entity Sets] [HTML Coded Character Set]
- The HTML 3.0 specification [Latin-1 Character Entities]
- The HTML+ Discussion Document [Appendix II]
- An exhaustive entity table including HTML 2.0, HTML3.0, HTML+, with Postscript equivalents.
Composed by J. Stolfi from several sources found throughout the net.
http://www.ic.unicamp.br/~stolfi/EXPORT/www/ISO-8859-1-Encoding.html