Universiteit Leiden

Unicode


Unicode is a universally acknowledged encoding system which provides a unique number for each character. It is maintained by the Unicode Consortium. Code values can be found in the Unicode Character Code Charts or through the Character Name Index on the Unicode website.

In XML documents, the codes that you can find in the Unicode Character Code Charts need to be preceded by &#x and followed by a semi-colon (';')

In the XML language, so-called XML entities have been defined for the characters that are used in XML markup, such as the "less than" and "greater than" characters, and the single and the double quotes. When these literal characters are part of the actual text that need to be encoded, they can be represented using the following XML entities: <, >, ' and ".

The list below contains the character entities of some commonly used XML entities and unicode characters.

& &
< &lt;
> &gt;
' &apos;
" &quot;

© &#xA9;
ß &#xDF;
£ &#xA3;
ƒ &#x192;

ä &#xE4;
ö &#xF6;
ü &#xFC;
ë &#xEB;
ï &#xEF;

Ä &#xC4;
Ö &#xD6;
Ü &#xDC;
Ë &#xCB;
Ï &#xCF;

à &#xE0;
ò &#xF2;
ù &#xF9;
è &#xE8;
ì &#xEC;

À &#xC0;
Ò &#xD2;
Ù &#xD9;
È &#xC8;
Ì &#xCC;

Á &#xC1;
Ó &#xD3;
Ú &#xDA;
É &#xC9;
Í &#xCD;

á &#xE1;
ó &#xF3;
ú &#xFA;
é &#xE9;
í &#xED;

 &#xC2;
Ô &#xD4;
Û &#xDB;
Ê &#xCA;
Î &#xCE;

â &#xE2;
ô &#xF4;
û &#xFB;
ê &#xEA;
î &#xEE;

à &#xC3;
ã &#xE3;
Õ &#xD5;
õ &#xF5;

± &#xB1;
º &#xBA;
Ç &#xC7;
ç &#xE7;
Š &#x160;
š &#x161;
> &#x3E;
½ &#xBD;