Friday 28 March 2014

HTML Entities


कुछ शब्द  HTML  मे reserved होते है उन्हें हम प्रयोग नही कर सकते है। जैसे कि less than (<) या greater than (>) चिन्ह का प्रयोग नही किया जा सकता है, because the क्योकि browser उन्हें tags के साथ मिला देगा।
तो वास्तव मे reserved characters(शब्दों) को देखने के लिये, हमें HTML source code में character entities का प्रयोग करना होगा । character entity कुछ इस तरह दिखाई देती है:

&entity_name;
या
&#entity_number;
less than चिन्ह लिखने के लिये हमें: &lt; या &#60; लिखना होगा
नोट : एक number के स्थान पर entity name लिखने का फ़ायदा यह है कि नाम याद रखना आसान होता है, जबकि, नुकसान यह है कि browsers सभी entity names को support नही करते है (जबकि वे entity numbers को बहुत अच्छी तरह support करते).
नोट : Entity names case sensitive होते है !




Result
Description
Entity Name
Entity Number

non-breaking space
&nbsp;
&#160;
less than
&lt;
&#60;
greater than
&gt;
&#62;
&
ampersand
&amp;
&#38;
¢
cent
&cent;
&#162;
£
pound
&pound;
&#163;
¥
yen
&yen;
&#165;
euro
&euro;
&#8364;
§
section
&sect;
&#167;
©
copyright
&copy;
&#169;
®
registered trademark
&reg;
&#174;
trademark
&trade;
&#8482;



Non-breaking Space

HTML मे सामान्यत: प्रयुक्त होने वाला character entity है non-breaking space (&nbsp;).
Browsers हमेशा HTML पेजों मे कुछ spaces घटा (truncate) देता है। यदि आपने अपने वाक्य मे 10 spaces दिये है तो पेज प्रदर्शित होने से पूर्व browser उनमें से 9 घटा देता । तो अपने  वाक्यों में spaces जोडने के लिये आपको &nbsp; character entity का प्रयोग करना होगा ।



ASCII Character
HTML Entity Code
Description

&#32;
space
!
&#33;
exclamation mark
"
&#34;
quotation mark
#
&#35;
number sign
$
&#36;
dollar sign
%
&#37;
percent sign
&
&#38;
ampersand
'
&#39;
apostrophe
(
&#40;
left parenthesis
)
&#41;
right parenthesis
*
&#42;
asterisk
+
&#43;
plus sign
,
&#44;
comma
-
&#45;
hyphen
.
&#46;
period
/
&#47;
slash
0
&#48;
digit 0
1
&#49;
digit 1
2
&#50;
digit 2
3
&#51;
digit 3
4
&#52;
digit 4
5
&#53;
digit 5
6
&#54;
digit 6
7
&#55;
digit 7
8
&#56;
digit 8
9
&#57;
digit 9
:
&#58;
colon
;
&#59;
semicolon
&#60;
less-than
=
&#61;
equals-to
&#62;
greater-than
?
&#63;
question mark
@
&#64;
at sign
A
&#65;
uppercase A
B
&#66;
uppercase B
C
&#67;
uppercase C
D
&#68;
uppercase D
E
&#69;
uppercase E
F
&#70;
uppercase F
G
&#71;
uppercase G
H
&#72;
uppercase H
I
&#73;
uppercase I
J
&#74;
uppercase J
K
&#75;
uppercase K
L
&#76;
uppercase L
M
&#77;
uppercase M
N
&#78;
uppercase N
O
&#79;
uppercase O
P
&#80;
uppercase P
Q
&#81;
uppercase Q
R
&#82;
uppercase R
S
&#83;
uppercase S
T
&#84;
uppercase T
U
&#85;
uppercase U
V
&#86;
uppercase V
W
&#87;
uppercase W
X
&#88;
uppercase X
Y
&#89;
uppercase Y
Z
&#90;
uppercase Z
[
&#91;
left square bracket
\
&#92;
backslash
]
&#93;
right square bracket
^
&#94;
caret
_
&#95;
underscore
`
&#96;
grave accent
a
&#97;
lowercase a
b
&#98;
lowercase b
c
&#99;
lowercase c
d
&#100;
lowercase d
e
&#101;
lowercase e
f
&#102;
lowercase f
g
&#103;
lowercase g
h
&#104;
lowercase h
i
&#105;
lowercase i
j
&#106;
lowercase j
k
&#107;
lowercase k
l
&#108;
lowercase l
m
&#109;
lowercase m
n
&#110;
lowercase n
o
&#111;
lowercase o
p
&#112;
lowercase p
q
&#113;
lowercase q
r
&#114;
lowercase r
s
&#115;
lowercase s
t
&#116;
lowercase t
u
&#117;
lowercase u
v
&#118;
lowercase v
w
&#119;
lowercase w
x
&#120;
lowercase x
y
&#121;
lowercase y
z
&#122;
lowercase z
{
&#123;
left curly brace
|
&#124;
vertical bar
}
&#125;
right curly brace
~
&#126;
tilde



ASCII Device Control Characters


ASCII Character
HTML Entity Code
Description
NUL
&#00;
null character
SOH
&#01;
start of header
STX
&#02;
start of text
ETX
&#03;
end of text
EOT
&#04;
end of transmission
ENQ
&#05;
enquiry
ACK
&#06;
acknowledge
BEL
&#07;
bell (ring)
BS
&#08;
backspace
HT
&#09;
horizontal tab
LF
&#10;
line feed
VT
&#11;
vertical tab
FF
&#12;
form feed
CR
&#13;
carriage return
SO
&#14;
shift out
SI
&#15;
shift in
DLE
&#16;
data link escape
DC1
&#17;
device control 1
DC2
&#18;
device control 2
DC3
&#19;
device control 3
DC4
&#20;
device control 4
NAK
&#21;
negative acknowledge
SYN
&#22;
synchronize
ETB
&#23;
end transmission block
CAN
&#24;
cancel
EM
&#25;
end of medium
SUB
&#26;
substitute
ESC
&#27;
escape
FS
&#28;
file separator
GS
&#29;
group separator
RS
&#30;
record separator
US
&#31;
unit separator



DEL
&#127;
delete (rubout)

ISO Character Sets


Character set
Description
Covers
ISO-8859-1
Latin alphabet part 1
North America, Western Europe, Latin America, the Caribbean, Canada, Africa
ISO-8859-2
Latin alphabet part 2
Eastern Europe
ISO-8859-3
Latin alphabet part 3
SE Europe, Esperanto, miscellaneous others
ISO-8859-4
Latin alphabet part 4
Scandinavia/Baltics (and others not in ISO-8859-1)
ISO-8859-5
Latin/Cyrillic part 5
The languages that are using a Cyrillic alphabet such as Bulgarian, Belarusian, Russian and Macedonian
ISO-8859-6
Latin/Arabic part 6
The languages that are using the Arabic alphabet
ISO-8859-7
Latin/Greek part 7
The modern Greek language as well as mathematical symbols derived from the Greek
ISO-8859-8
Latin/Hebrew part 8
The languages that are using the Hebrew alphabet
ISO-8859-9
Latin 5 part 9
The Turkish language. Same as ISO-8859-1 except Turkish characters replace Icelandic ones
ISO-8859-10
Latin 6 Lappish, Nordic, Eskimo
The Nordic languages
ISO-8859-15
Latin 9 (aka Latin 0)
Similar to ISO 8859-1 but replaces some less common symbols with the euro sign and some other missing characters
ISO-2022-JP
Latin/Japanese part 1
The Japanese language
ISO-2022-JP-2
Latin/Japanese part 2
The Japanese language
ISO-2022-KR
Latin/Korean part 1
The Korean language



The Unicode Standard


Character-set
Description
UTF-8
A character in UTF8 can be from 1 to 4 bytes long. UTF-8 can represent any character in the Unicode standard. UTF-8 is backwards compatible with ASCII. UTF-8 is the preferred encoding for e-mail and web pages
UTF-16
16-bit Unicode Transformation Format is a variable-length character encoding for Unicode, capable of encoding the entire Unicode repertoire. UTF-16 is used in major operating systems and environments, like Microsoft Windows 2000/XP/2003/Vista/CE and the Java and .NET byte code environments
Tip: The first 256 characters of Unicode character-sets correspond to the 256 characters of ISO-8859-1.
Tip: All HTML 4 processors already support UTF-8, and all XHTML and XML processors support UTF-8 and UTF-16!



Math Symbols Supported by HTML
Character
Entity Number
Entity Name
Description
&#8704;
&forall;
for all
&#8706;
&part;
part
&#8707;
&exist;
exists
&#8709;
&empty;
empty
&#8711;
&nabla;
nabla
&#8712;
&isin;
isin
&#8713;
&notin;
notin
&#8715;
&ni;
ni
&#8719;
&prod;
prod
&#8721;
&sum;
sum
&#8722;
&minus;
minus
&#8727;
&lowast;
lowast
&#8730;
&radic;
square root
&#8733;
&prop;
proportional to
&#8734;
&infin;
infinity
&#8736;
&ang;
angle
&#8743;
&and;
and
&#8744;
&or;
or
&#8745;
&cap;
cap
&#8746;
&cup;
cup
&#8747;
&int;
integral
&#8756;
&there4;
therefore
&#8764;
&sim;
similar to
&#8773;
&cong;
congruent to
&#8776;
&asymp;
almost equal
&#8800;
&ne;
not equal
&#8801;
&equiv;
equivalent
&#8804;
&le;
less or equal
&#8805;
&ge;
greater or equal
&#8834;
&sub;
subset of
&#8835;
&sup;
superset of
&#8836;
&nsub;
not subset of
&#8838;
&sube;
subset or equal
&#8839;
&supe;
superset or equal
&#8853;
&oplus;
circled plus
&#8855;
&otimes;
circled times
&#8869;
&perp;
perpendicular
&#8901;
&sdot;
dot operator

Greek Letters Supported by HTML

Character
Entity Number
Entity Name
Description
Α
&#913;
&Alpha;
Alpha
Β
&#914;
&Beta;
Beta
Γ
&#915;
&Gamma;
Gamma
Δ
&#916;
&Delta;
Delta
Ε
&#917;
&Epsilon;
Epsilon
Ζ
&#918;
&Zeta;
Zeta
Η
&#919;
&Eta;
Eta
Θ
&#920;
&Theta;
Theta
Ι
&#921;
&Iota;
Iota
Κ
&#922;
&Kappa;
Kappa
Λ
&#923;
&Lambda;
Lambda
Μ
&#924;
&Mu;
Mu
Ν
&#925;
&Nu;
Nu
Ξ
&#926;
&Xi;
Xi
Ο
&#927;
&Omicron;
Omicron
Π
&#928;
&Pi;
Pi
Ρ
&#929;
&Rho;
Rho

undefined

Sigmaf
Σ
&#931;
&Sigma;
Sigma
Τ
&#932;
&Tau;
Tau
Υ
&#933;
&Upsilon;
Upsilon
Φ
&#934;
&Phi;
Phi
Χ
&#935;
&Chi;
Chi
Ψ
&#936;
&Psi;
Psi
Ω
&#937;
&Omega;
Omega




α
&#945;
&alpha;
alpha
β
&#946;
&beta;
beta
γ
&#947;
&gamma;
gamma
δ
&#948;
&delta;
delta
ε
&#949;
&epsilon;
epsilon
ζ
&#950;
&zeta;
zeta
η
&#951;
&eta;
eta
θ
&#952;
&theta;
theta
ι
&#953;
&iota;
iota
κ
&#954;
&kappa;
kappa
λ
&#955;
&lambda;
lambda
μ
&#956;
&mu;
mu
ν
&#957;
&nu;
nu
ξ
&#958;
&xi;
xi
ο
&#959;
&omicron;
omicron
π
&#960;
&pi;
pi
ρ
&#961;
&rho;
rho
ς
&#962;
&sigmaf;
sigmaf
σ
&#963;
&sigma;
sigma
τ
&#964;
&tau;
tau
υ
&#965;
&upsilon;
upsilon
φ
&#966;
&phi;
phi
χ
&#967;
&chi;
chi
ψ
&#968;
&psi;
psi
ω
&#969;
&omega;
omega




ϑ
&#977;
&thetasym;
theta symbol
ϒ
&#978;
&upsih;
upsilon symbol
ϖ
&#982;
&piv;
pi symbol

Other Entities Supported by HTML

Character
Entity Number
Entity Name
Description
Œ
&#338;
&OElig;
capital ligature OE
œ
&#339;
&oelig;
small ligature oe
Š
&#352;
&Scaron;
capital S with caron
š
&#353;
&scaron;
small S with caron
Ÿ
&#376;
&Yuml;
capital Y with diaeres
ƒ
&#402;
&fnof;
f with hook
ˆ
&#710;
&circ;
modifier letter circumflex accent
˜
&#732;
&tilde;
small tilde
&#8194;
&ensp;
en space
&#8195;
&emsp;
em space
&#8201;
&thinsp;
thin space
&#8204;
&zwnj;
zero width non-joiner
&#8205;
&zwj;
zero width joiner
&#8206;
&lrm;
left-to-right mark
&#8207;
&rlm;
right-to-left mark
&#8211;
&ndash;
en dash
&#8212;
&mdash;
em dash
&#8216;
&lsquo;
left single quotation mark
&#8217;
&rsquo;
right single quotation mark
&#8218;
&sbquo;
single low-9 quotation mark
&#8220;
&ldquo;
left double quotation mark
&#8221;
&rdquo;
right double quotation mark
&#8222;
&bdquo;
double low-9 quotation mark
&#8224;
&dagger;
dagger
&#8225;
&Dagger;
double dagger
&#8226;
&bull;
bullet
&#8230;
&hellip;
horizontal ellipsis
&#8240;
&permil;
per mille 
&#8242;
&prime;
minutes
&#8243;
&Prime;
seconds
&#8249;
&lsaquo;
single left angle quotation
&#8250;
&rsaquo;
single right angle quotation
&#8254;
&oline;
overline
&#8364;
&euro;
euro
&#8482; or &#153;
&trade;
trademark
&#8592;
&larr;
left arrow
&#8593;
&uarr;
up arrow
&#8594;
&rarr;
right arrow
&#8595;
&darr;
down arrow
&#8596;
&harr;
left right arrow
&#8629;
&crarr;
carriage return arrow
&#8968;
&lceil;
left ceiling
&#8969;
&rceil;
right ceiling
&#8970;
&lfloor;
left floor
&#8971;
&rfloor;
right floor
&#9674;
&loz;
lozenge
&#9824;
&spades;
spade
&#9827;
&clubs;
club
&#9829;
&hearts;
heart
&#9830;
&diams;
diamond


Reserved Characters in HTML


Character
Entity Number
Entity Name
Description
"
&#34;
&quot;
quotation mark
'
&#39;
&apos;
apostrophe 
&
&#38;
&amp;
ampersand
&#60;
&lt;
less-than
&#62;
&gt;
greater-than
Note: Entity names are case sensitive!


ISO 8859-1 Symbols

Character
Entity Number
Entity Name
Description

&#160;
&nbsp;
non-breaking space
¡
&#161;
&iexcl;
inverted exclamation mark
¢
&#162;
&cent;
cent
£
&#163;
&pound;
pound
¤
&#164;
&curren;
currency
¥
&#165;
&yen;
yen
¦
&#166;
&brvbar;
broken vertical bar
§
&#167;
&sect;
section
¨
&#168;
&uml;
spacing diaeresis
©
&#169;
&copy;
copyright
ª
&#170;
&ordf;
feminine ordinal indicator
«
&#171;
&laquo;
angle quotation mark (left)
¬
&#172;
&not;
negation
­­
&#173;
&shy;
soft hyphen
®
&#174;
&reg;
registered trademark
¯
&#175;
&macr;
spacing macron
°
&#176;
&deg;
degree
±
&#177;
&plusmn;
plus-or-minus 
²
&#178;
&sup2;
superscript 2
³
&#179;
&sup3;
superscript 3
´
&#180;
&acute;
spacing acute
µ
&#181;
&micro;
micro
&#182;
&para;
paragraph
·
&#183;
&middot;
middle dot
¸
&#184;
&cedil;
spacing cedilla
¹
&#185;
&sup1;
superscript 1
º
&#186;
&ordm;
masculine ordinal indicator
»
&#187;
&raquo;
angle quotation mark (right)
¼
&#188;
&frac14;
fraction 1/4
½
&#189;
&frac12;
fraction 1/2
¾
&#190;
&frac34;
fraction 3/4
¿
&#191;
&iquest;
inverted question mark
×
&#215;
&times;
multiplication
÷
&#247;
&divide;
division

ISO 8859-1 Characters

Character
Entity Number
Entity Name
Description
À
&#192;
&Agrave;
capital a, grave accent
Á
&#193;
&Aacute;
capital a, acute accent
Â
&#194;
&Acirc;
capital a, circumflex accent
Ã
&#195;
&Atilde;
capital a, tilde
Ä
&#196;
&Auml;
capital a, umlaut mark
Å
&#197;
&Aring;
capital a, ring
Æ
&#198;
&AElig;
capital ae
Ç
&#199;
&Ccedil;
capital c, cedilla
È
&#200;
&Egrave;
capital e, grave accent
É
&#201;
&Eacute;
capital e, acute accent
Ê
&#202;
&Ecirc;
capital e, circumflex accent
Ë
&#203;
&Euml;
capital e, umlaut mark
Ì
&#204;
&Igrave;
capital i, grave accent
Í
&#205;
&Iacute;
capital i, acute accent
Î
&#206;
&Icirc;
capital i, circumflex accent
Ï
&#207;
&Iuml;
capital i, umlaut mark
Ð
&#208;
&ETH;
capital eth, Icelandic
Ñ
&#209;
&Ntilde;
capital n, tilde
Ò
&#210;
&Ograve;
capital o, grave accent
Ó
&#211;
&Oacute;
capital o, acute accent
Ô
&#212;
&Ocirc;
capital o, circumflex accent
Õ
&#213;
&Otilde;
capital o, tilde
Ö
&#214;
&Ouml;
capital o, umlaut mark
Ø
&#216;
&Oslash;
capital o, slash
Ù
&#217;
&Ugrave;
capital u, grave accent
Ú
&#218;
&Uacute;
capital u, acute accent
Û
&#219;
&Ucirc;
capital u, circumflex accent
Ü
&#220;
&Uuml;
capital u, umlaut mark
Ý
&#221;
&Yacute;
capital y, acute accent
Þ
&#222;
&THORN;
capital THORN, Icelandic
ß
&#223;
&szlig;
small sharp s, German
à
&#224;
&agrave;
small a, grave accent
á
&#225;
&aacute;
small a, acute accent
â
&#226;
&acirc;
small a, circumflex accent
ã
&#227;
&atilde;
small a, tilde
ä
&#228;
&auml;
small a, umlaut mark
å
&#229;
&aring;
small a, ring
æ
&#230;
&aelig;
small ae
ç
&#231;
&ccedil;
small c, cedilla
è
&#232;
&egrave;
small e, grave accent
é
&#233;
&eacute;
small e, acute accent
ê
&#234;
&ecirc;
small e, circumflex accent
ë
&#235;
&euml;
small e, umlaut mark
ì
&#236;
&igrave;
small i, grave accent
í
&#237;
&iacute;
small i, acute accent
î
&#238;
&icirc;
small i, circumflex accent
ï
&#239;
&iuml;
small i, umlaut mark
ð
&#240;
&eth;
small eth, Icelandic
ñ
&#241;
&ntilde;
small n, tilde
ò
&#242;
&ograve;
small o, grave accent
ó
&#243;
&oacute;
small o, acute accent
ô
&#244;
&ocirc;
small o, circumflex accent
õ
&#245;
&otilde;
small o, tilde
ö
&#246;
&ouml;
small o, umlaut mark
ø
&#248;
&oslash;
small o, slash
ù
&#249;
&ugrave;
small u, grave accent
ú
&#250;
&uacute;
small u, acute accent
û
&#251;
&ucirc;
small u, circumflex accent
ü
&#252;
&uuml;
small u, umlaut mark
ý
&#253;
&yacute;
small y, acute accent
þ
&#254;
&thorn;
small thorn, Icelandic
ÿ
&#255;
&yuml;
small y, umlaut mark

0 comments:

Post a Comment