about summary refs log blame commit diff stats
path: root/test/iso-8859-1a.html
blob: 972329d3c29c8f35cf07c59025fe46de50c8cfe0 (plain) (tree)


















































































































































































































































































                                                                                                                                       
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<!-- X-URL: http://www.ramsch.org/martin/uni/fmi-hp/iso8859-1.html -->
<!-- Date: Tue, 28 Dec 2004 20:24:09 GMT -->
<!-- Last-Modified: Mon, 15 May 2000 09:37:37 GMT -->
<HTML>
<HEAD>
<TITLE>Martin Ramsch - iso8859-1 table</TITLE>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1">
<BASE HREF="http://www.ramsch.org/martin/uni/fmi-hp/iso8859-1.html">
</HEAD>

<BODY> 

<H1 ALIGN=center>iso8859-1 table, with cp-1252</H1> 

<PRE>
Description                               Code            Entity name   
===================================       ============    ==============
quotation mark                            &amp;#34;  --> &#34;    &amp;quot;   --> &quot;
ampersand                                 &amp;#38;  --> &#38;    &amp;amp;    --> &amp;
less-than sign                            &amp;#60;  --> &#60;    &amp;lt;     --> &lt;
greater-than sign                         &amp;#62;  --> &#62;    &amp;gt;     --> &gt;

Description                          Char Code            Entity name   
===================================  ==== ============    ==============
euro sign                              �    &amp;128; --> &#128;
undefined                              �    &amp;129; --> &#129;
single low-9 quotation mark            �    &amp;130; --> &#130;
latin small letter f with hook         �    &amp;131; --> &#131;
double low-9 quotation mark            �    &amp;132; --> &#132;
horizontal ellipsis                    �    &amp;133; --> &#133;
dagger                                 �    &amp;134; --> &#134;
double dagger                          �    &amp;135; --> &#135;
modifier letter circumflex accent      �    &amp;136; --> &#136;
per mille sign                         �    &amp;137; --> &#137;
latin capital letter s with caron      �    &amp;138; --> &#138;
single left-pointing angle quote mark  �    &amp;139; --> &#139;
latin capital ligature oe              �    &amp;140; --> &#140;
undefined                              �    &amp;141; --> &#141;
latin capital letter z with caron      �    &amp;142; --> &#142;
undefined                              �    &amp;143; --> &#143;

undefined                              �    &amp;144; --> &#144;
left single quotation mark             �    &amp;145; --> &#145;
right single quotation mark            �    &amp;146; --> &#146;
left double quotation mark             �    &amp;147; --> &#147;
right double quotation mark            �    &amp;148; --> &#148;
bullet                                 �    &amp;149; --> &#149;
en dash                                �    &amp;150; --> &#150;
em dash                                �    &amp;151; --> &#151;
small tilde                            �    &amp;152; --> &#152;
trade mark sign                        �    &amp;153; --> &#153;
latin small letter s with caron        �    &amp;154; --> &#154;
single right-pointing angle quote mark �    &amp;155; --> &#155;
latin small ligature oe                �    &amp;156; --> &#156;
undefined                              �    &amp;157; --> &#157;
latin small letter z with caron        �    &amp;158; --> &#158;
latin capital letter y with diaeresis  �    &amp;159; --> &#159;

non-breaking space                   �    &amp;#160; --> &#160;    &amp;nbsp;   --> &nbsp;
inverted exclamation                 �    &amp;#161; --> &#161;    &amp;iexcl;  --> &iexcl;
cent sign                            �    &amp;#162; --> &#162;    &amp;cent;   --> &cent;
pound sterling                       �    &amp;#163; --> &#163;    &amp;pound;  --> &pound;
general currency sign                �    &amp;#164; --> &#164;    &amp;curren; --> &curren;
yen sign                             �    &amp;#165; --> &#165;    &amp;yen;    --> &yen;
broken vertical bar                  �    &amp;#166; --> &#166;    &amp;brvbar; --> &brvbar;
                                             Non-standard &amp;brkbar; --> &brkbar;
section sign                         �    &amp;#167; --> &#167;    &amp;sect;   --> &sect;
umlaut (dieresis)                    �    &amp;#168; --> &#168;    &amp;uml;    --> &uml;
                                             Non-standard &amp;die;    --> &die;
copyright                            �    &amp;#169; --> &#169;    &amp;copy;   --> &copy;
feminine ordinal                     �    &amp;#170; --> &#170;    &amp;ordf;   --> &ordf;
left angle quote, guillemotleft      �    &amp;#171; --> &#171;    &amp;laquo;  --> &laquo;
not sign                             �    &amp;#172; --> &#172;    &amp;not;    --> &not;
soft hyphen                          �    &amp;#173; --> &#173;    &amp;shy;    --> &shy;
registered trademark                 �    &amp;#174; --> &#174;    &amp;reg;    --> &reg;
macron accent                        �    &amp;#175; --> &#175;    &amp;macr;   --> &macr;
                                             Non-standard &amp;hibar;  --> &hibar;
degree sign                          �    &amp;#176; --> &#176;    &amp;deg;    --> &deg;
plus or minus                        �    &amp;#177; --> &#177;    &amp;plusmn; --> &plusmn;
superscript two                      �    &amp;#178; --> &#178;    &amp;sup2;   --> &sup2;
superscript three                    �    &amp;#179; --> &#179;    &amp;sup3;   --> &sup3;
acute accent                         �    &amp;#180; --> &#180;    &amp;acute;  --> &acute;
micro sign                           �    &amp;#181; --> &#181;    &amp;micro;  --> &micro;
paragraph sign                       �    &amp;#182; --> &#182;    &amp;para;   --> &para;
middle dot                           �    &amp;#183; --> &#183;    &amp;middot; --> &middot;
cedilla                              �    &amp;#184; --> &#184;    &amp;cedil;  --> &cedil;
superscript one                      �    &amp;#185; --> &#185;    &amp;sup1;   --> &sup1;
masculine ordinal                    �    &amp;#186; --> &#186;    &amp;ordm;   --> &ordm;
right angle quote, guillemotright    �    &amp;#187; --> &#187;    &amp;raquo;  --> &raquo;
fraction one-fourth                  �    &amp;#188; --> &#188;    &amp;frac14; --> &frac14;
fraction one-half                    �    &amp;#189; --> &#189;    &amp;frac12; --> &frac12;
fraction three-fourths               �    &amp;#190; --> &#190;    &amp;frac34; --> &frac34;
inverted question mark               �    &amp;#191; --> &#191;    &amp;iquest; --> &iquest;
capital A, grave accent              �    &amp;#192; --> &#192;    &amp;Agrave; --> &Agrave;
capital A, acute accent              �    &amp;#193; --> &#193;    &amp;Aacute; --> &Aacute;
capital A, circumflex accent         �    &amp;#194; --> &#194;    &amp;Acirc;  --> &Acirc;
capital A, tilde                     �    &amp;#195; --> &#195;    &amp;Atilde; --> &Atilde;
capital A, dieresis or umlaut mark   �    &amp;#196; --> &#196;    &amp;Auml;   --> &Auml;
capital A, ring                      �    &amp;#197; --> &#197;    &amp;Aring;  --> &Aring;
capital AE diphthong (ligature)      �    &amp;#198; --> &#198;    &amp;AElig;  --> &AElig;
capital C, cedilla                   �    &amp;#199; --> &#199;    &amp;Ccedil; --> &Ccedil;
capital E, grave accent              �    &amp;#200; --> &#200;    &amp;Egrave; --> &Egrave;
capital E, acute accent              �    &amp;#201; --> &#201;    &amp;Eacute; --> &Eacute;
capital E, circumflex accent         �    &amp;#202; --> &#202;    &amp;Ecirc;  --> &Ecirc;
capital E, dieresis or umlaut mark   �    &amp;#203; --> &#203;    &amp;Euml;   --> &Euml;
capital I, grave accent              �    &amp;#204; --> &#204;    &amp;Igrave; --> &Igrave;
capital I, acute accent              �    &amp;#205; --> &#205;    &amp;Iacute; --> &Iacute;
capital I, circumflex accent         �    &amp;#206; --> &#206;    &amp;Icirc;  --> &Icirc;
capital I, dieresis or umlaut mark   �    &amp;#207; --> &#207;    &amp;Iuml;   --> &Iuml;
capital Eth, Icelandic               �    &amp;#208; --> &#208;    &amp;ETH;    --> &ETH;
                                             Non-standard &amp;Dstrok; --> &Dstrok;
capital N, tilde                     �    &amp;#209; --> &#209;    &amp;Ntilde; --> &Ntilde;
capital O, grave accent              �    &amp;#210; --> &#210;    &amp;Ograve; --> &Ograve;
capital O, acute accent              �    &amp;#211; --> &#211;    &amp;Oacute; --> &Oacute;
capital O, circumflex accent         �    &amp;#212; --> &#212;    &amp;Ocirc;  --> &Ocirc;
capital O, tilde                     �    &amp;#213; --> &#213;    &amp;Otilde; --> &Otilde;
capital O, dieresis or umlaut mark   �    &amp;#214; --> &#214;    &amp;Ouml;   --> &Ouml;
multiply sign                        �    &amp;#215; --> &#215;    &amp;times;  --> &times;
capital O, slash                     �    &amp;#216; --> &#216;    &amp;Oslash; --> &Oslash;
capital U, grave accent              �    &amp;#217; --> &#217;    &amp;Ugrave; --> &Ugrave;
capital U, acute accent              �    &amp;#218; --> &#218;    &amp;Uacute; --> &Uacute;
capital U, circumflex accent         �    &amp;#219; --> &#219;    &amp;Ucirc;  --> &Ucirc;
capital U, dieresis or umlaut mark   �    &amp;#220; --> &#220;    &amp;Uuml;   --> &Uuml;
capital Y, acute accent              �    &amp;#221; --> &#221;    &amp;Yacute; --> &Yacute;
capital THORN, Icelandic             �    &amp;#222; --> &#222;    &amp;THORN;  --> &THORN;
small sharp s, German (sz ligature)  �    &amp;#223; --> &#223;    &amp;szlig;  --> &szlig;
small a, grave accent                �    &amp;#224; --> &#224;    &amp;agrave; --> &agrave;
small a, acute accent                �    &amp;#225; --> &#225;    &amp;aacute; --> &aacute;
small a, circumflex accent           �    &amp;#226; --> &#226;    &amp;acirc;  --> &acirc;
small a, tilde                       �    &amp;#227; --> &#227;    &amp;atilde; --> &atilde;
small a, dieresis or umlaut mark     �    &amp;#228; --> &#228;    &amp;auml;   --> &auml;
small a, ring                        �    &amp;#229; --> &#229;    &amp;aring;  --> &aring;
small ae diphthong (ligature)        �    &amp;#230; --> &#230;    &amp;aelig;  --> &aelig;
small c, cedilla                     �    &amp;#231; --> &#231;    &amp;ccedil; --> &ccedil;
small e, grave accent                �    &amp;#232; --> &#232;    &amp;egrave; --> &egrave;
small e, acute accent                �    &amp;#233; --> &#233;    &amp;eacute; --> &eacute;
small e, circumflex accent           �    &amp;#234; --> &#234;    &amp;ecirc;  --> &ecirc;
small e, dieresis or umlaut mark     �    &amp;#235; --> &#235;    &amp;euml;   --> &euml;
small i, grave accent                �    &amp;#236; --> &#236;    &amp;igrave; --> &igrave;
small i, acute accent                �    &amp;#237; --> &#237;    &amp;iacute; --> &iacute;
small i, circumflex accent           �    &amp;#238; --> &#238;    &amp;icirc;  --> &icirc;
small i, dieresis or umlaut mark     �    &amp;#239; --> &#239;    &amp;iuml;   --> &iuml;
small eth, Icelandic                 �    &amp;#240; --> &#240;    &amp;eth;    --> &eth;
small n, tilde                       �    &amp;#241; --> &#241;    &amp;ntilde; --> &ntilde;
small o, grave accent                �    &amp;#242; --> &#242;    &amp;ograve; --> &ograve;
small o, acute accent                �    &amp;#243; --> &#243;    &amp;oacute; --> &oacute;
small o, circumflex accent           �    &amp;#244; --> &#244;    &amp;ocirc;  --> &ocirc;
small o, tilde                       �    &amp;#245; --> &#245;    &amp;otilde; --> &otilde;
small o, dieresis or umlaut mark     �    &amp;#246; --> &#246;    &amp;ouml;   --> &ouml;
division sign                        �    &amp;#247; --> &#247;    &amp;divide; --> &divide;
small o, slash                       �    &amp;#248; --> &#248;    &amp;oslash; --> &oslash;
small u, grave accent                �    &amp;#249; --> &#249;    &amp;ugrave; --> &ugrave;
small u, acute accent                �    &amp;#250; --> &#250;    &amp;uacute; --> &uacute;
small u, circumflex accent           �    &amp;#251; --> &#251;    &amp;ucirc;  --> &ucirc;
small u, dieresis or umlaut mark     �    &amp;#252; --> &#252;    &amp;uuml;   --> &uuml;
small y, acute accent                �    &amp;#253; --> &#253;    &amp;yacute; --> &yacute;
small thorn, Icelandic               �    &amp;#254; --> &#254;    &amp;thorn;  --> &thorn;
small y, dieresis or umlaut mark     �    &amp;#255; --> &#255;    &amp;yuml;   --> &yuml;
</PRE>
<!-- removed: second /PRE, a hack for HotJava 1.0 preBeta 1 -->
<HR>

<STRONG>How to read</STRONG> this table.  The columns are
<DL COMPACT>
<DT>1st:<DD>textual <EM>description</EM> of the character
<DT>2nd:<DD>character inserted directly into the HTML page as <EM>one
            byte</EM>
<DT>3rd:<DD>character written as <EM>numeric HTML entity</EM>, in the
            format:<BR>"how it looks literally" <CODE>--&gt;</CODE>
            "what your browser does with it"
<DT>4th:<DD>character written as <EM>symbolic HTML entity</EM>, in the
            format:<BR>"how it looks literally" <CODE>--&gt;</CODE>
            "what your browser does with it"
</DL>

So for example, if you see something like "<CODE>&amp;divide; -->
&amp;divide;</CODE>" in the 4th column, this means your browser
doesn't know about the entity name "divide" and just puts it
literally.

<P>
<STRONG>This table</STRONG> grew out of an overview of the "ISO
Latin-1 Character Set" overview related to the Hyper-G Text Format
(<A HREF="http://www.hyperwave.de/HTFdoc">HTF</A>).

The entity names <CODE>&amp;brkbar;</CODE> and <CODE>&amp;Dstrok;</CODE>
seem to be unique to HTF.

The entity name <CODE>&amp;hibar;</CODE> has been supported by X Mosaic
but seems to be replaced with <CODE>&amp;macr;</CODE>.

The entity names <CODE>&amp;uml;</CODE> and <CODE>&amp;die;</CODE> should
be equivalent.

<P><STRONG>The standards stuff:</STRONG>
The 
<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/html-spec/">HTML 2.0 Standard</A>
includes a section on
<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/html-spec/html-spec_9.html#SEC99">Character Entity Sets</A>
and an overview on the
<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/html-spec/html-spec_13.html#SEC106">HTML Coded Character Set</A>
(The entity names are derived from <A HREF="http://www.ucc.ie/info/net/isolat1.html">ISO 8879</A>).
<BR>

Or have a look at the
<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/html3/latin1.html">Latin-1 Character Entities</A>
as listed in an draft for the
<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/html3/CoverPage.html">HTML 3.0 specification</A>.
<BR>

The
<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/HTMLPlus/htmlplus_59.html">Appendix II</A>
of CERN's
<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/HTMLPlus/htmlplus_1.html">HTML+ Discussion Document</A>
contains a
<A HREF="http://www.w3.org/hypertext/WWW/MarkUp/HTMLPlus/htmlplus_table.ps">table</A>
(in PostScript format) of the proposed character entities for HTML+ and their
corresponding character codes for Unicode and the Adobe Latin-1 &amp; Symbol
character sets.
<P>

<STRONG>Please note</STRONG> that there is nothing wrong with using
characters of ISO Latin-1 above 127: the normal transmission protocol
for the WWW,
<A HREF="http://www.w3.org/pub/WWW/Protocols/rfc1945/rfc1945">HTTP/1.0</A>,
uses the 8bit ISO latin-1 as default encoding.
(Thanks to Roman 
Czyborra for pointing this out!)
<P>

<STRONG>Other information:</STRONG>
<UL>

<LI><STRONG>Kevin J. Brewer</STRONG> has done two very good pages on the subject:
  <UL>
   <LI><A HREF="http://www.bbsinc.com/iso8859.html">ASCII - ISO 8859-1 (Latin-1) with HTML 3.0 Entities Table</A> and
   <LI><A HREF="http://www.bbsinc.com/iso8879.html">ISO 8879 Entities Gopher Menu</A>
  </UL>

<LI>The excellent overview on the series of
    <A HREF="http://czyborra.com/charsets/iso8859.html">ISO 8859
    character sets</A> compiled by Roman Czyborra.

<LI>Also have a look on Alan Flavell's page of
    <A HREF="http://ppewww.ph.gla.ac.uk/%7Eflavell/iso8859/iso8859-pointers.html">pointers
    to information about ISO8859</A>. It's written very well!

<LI>Maybe also of interest to you is the
    <A HREF="ftp://ftp.vlsivie.tuwien.ac.at/pub/8bit/FAQ-ISO-8859-1">ISO 
     8859-1 FAQ</A> by Michael Gschwind
    (<A HREF="mailto:mike@vlsivie.tuwien.ac.at">mike@vlsivie.tuwien.ac.at</A>),
    part of his page on
    <A HREF="http://www.vlsivie.tuwien.ac.at/mike/i18n.html">Internationalization</A>.

<LI>For users of X11R5 on SunOS systems: the
    <A HREF="Compose.txt">table over the compose combinations</A>
    (also coded <A HREF="Compose.html">with entities</A> where possible).
     It's taken from the MIT X sources in
     <CODE>server/ddx/sun/Compose.list</CODE>.

<LI>Finally you could have a look at
    <A HREF="ftp://ds.internic.net/rfc/rfc1345.txt">RFC 1345: 
     Character Mnemonics &amp; Character Sets</A>
     by K. Simonsen (06/11/92, 103 pages, approx. 240 kbyte).

</UL>


<HR>

<ADDRESS><A HREF="http://ramsch.home.pages.de/">Martin Ramsch</A>, 16.02.1994, 07.01.1996, 01.07.1996, 1998-10-09, 2000-05-15</ADDRESS>

</BODY>
</HTML>