कुछ शब्द HTML मे reserved होते है उन्हें हम प्रयोग नही कर सकते है। जैसे कि less than (<) या greater than (>) चिन्ह का प्रयोग नही किया जा सकता है, because the क्योकि browser उन्हें tags के साथ मिला देगा।
तो वास्तव मे reserved characters(शब्दों) को देखने के लिये, हमें HTML source code में character entities का प्रयोग करना होगा । character entity कुछ इस तरह दिखाई देती है:
&entity_name;
या
&#entity_number;
less than चिन्ह लिखने के लिये हमें: <
या < लिखना होगानोट : एक number के स्थान पर entity name लिखने का फ़ायदा यह है कि नाम याद रखना आसान होता है, जबकि, नुकसान यह है कि browsers सभी entity names को support नही करते है (जबकि वे entity numbers को बहुत अच्छी तरह support करते).
नोट : Entity names case sensitive होते है !
Result
|
Description
|
Entity Name
|
Entity Number
|
non-breaking space
|
|
 
|
|
<
|
less than
|
<
|
<
|
>
|
greater than
|
>
|
>
|
&
|
ampersand
|
&
|
&
|
¢
|
cent
|
¢
|
¢
|
£
|
pound
|
£
|
£
|
¥
|
yen
|
¥
|
¥
|
€
|
euro
|
€
|
€
|
§
|
section
|
§
|
§
|
©
|
copyright
|
©
|
©
|
®
|
registered trademark
|
®
|
®
|
™
|
trademark
|
™
|
™
|
Non-breaking Space
HTML मे सामान्यत: प्रयुक्त होने वाला character entity है non-breaking space ( ).Browsers हमेशा HTML पेजों मे कुछ spaces घटा (truncate) देता है। यदि आपने अपने वाक्य मे 10 spaces दिये है तो पेज प्रदर्शित होने से पूर्व browser उनमें से 9 घटा देता । तो अपने वाक्यों में spaces जोडने के लिये आपको character entity का प्रयोग करना होगा ।
ASCII Character
|
HTML Entity Code
|
Description
|
 
|
space
|
|
!
|
!
|
exclamation mark
|
"
|
"
|
quotation mark
|
#
|
#
|
number sign
|
$
|
$
|
dollar sign
|
%
|
%
|
percent sign
|
&
|
&
|
ampersand
|
'
|
'
|
apostrophe
|
(
|
(
|
left parenthesis
|
)
|
)
|
right parenthesis
|
*
|
*
|
asterisk
|
+
|
+
|
plus sign
|
,
|
,
|
comma
|
-
|
-
|
hyphen
|
.
|
.
|
period
|
/
|
/
|
slash
|
0
|
0
|
digit 0
|
1
|
1
|
digit 1
|
2
|
2
|
digit 2
|
3
|
3
|
digit 3
|
4
|
4
|
digit 4
|
5
|
5
|
digit 5
|
6
|
6
|
digit 6
|
7
|
7
|
digit 7
|
8
|
8
|
digit 8
|
9
|
9
|
digit 9
|
:
|
:
|
colon
|
;
|
;
|
semicolon
|
<
|
<
|
less-than
|
=
|
=
|
equals-to
|
>
|
>
|
greater-than
|
?
|
?
|
question mark
|
@
|
@
|
at sign
|
A
|
A
|
uppercase A
|
B
|
B
|
uppercase B
|
C
|
C
|
uppercase C
|
D
|
D
|
uppercase D
|
E
|
E
|
uppercase E
|
F
|
F
|
uppercase F
|
G
|
G
|
uppercase G
|
H
|
H
|
uppercase H
|
I
|
I
|
uppercase I
|
J
|
J
|
uppercase J
|
K
|
K
|
uppercase K
|
L
|
L
|
uppercase L
|
M
|
M
|
uppercase M
|
N
|
N
|
uppercase N
|
O
|
O
|
uppercase O
|
P
|
P
|
uppercase P
|
Q
|
Q
|
uppercase Q
|
R
|
R
|
uppercase R
|
S
|
S
|
uppercase S
|
T
|
T
|
uppercase T
|
U
|
U
|
uppercase U
|
V
|
V
|
uppercase V
|
W
|
W
|
uppercase W
|
X
|
X
|
uppercase X
|
Y
|
Y
|
uppercase Y
|
Z
|
Z
|
uppercase Z
|
[
|
[
|
left square bracket
|
\
|
\
|
backslash
|
]
|
]
|
right square bracket
|
^
|
^
|
caret
|
_
|
_
|
underscore
|
`
|
`
|
grave accent
|
a
|
a
|
lowercase a
|
b
|
b
|
lowercase b
|
c
|
c
|
lowercase c
|
d
|
d
|
lowercase d
|
e
|
e
|
lowercase e
|
f
|
f
|
lowercase f
|
g
|
g
|
lowercase g
|
h
|
h
|
lowercase h
|
i
|
i
|
lowercase i
|
j
|
j
|
lowercase j
|
k
|
k
|
lowercase k
|
l
|
l
|
lowercase l
|
m
|
m
|
lowercase m
|
n
|
n
|
lowercase n
|
o
|
o
|
lowercase o
|
p
|
p
|
lowercase p
|
q
|
q
|
lowercase q
|
r
|
r
|
lowercase r
|
s
|
s
|
lowercase s
|
t
|
t
|
lowercase t
|
u
|
u
|
lowercase u
|
v
|
v
|
lowercase v
|
w
|
w
|
lowercase w
|
x
|
x
|
lowercase x
|
y
|
y
|
lowercase y
|
z
|
z
|
lowercase z
|
{
|
{
|
left curly brace
|
|
|
|
|
vertical bar
|
}
|
}
|
right curly brace
|
~
|
~
|
tilde
|
ASCII Device Control Characters
ASCII Character
|
HTML Entity Code
|
Description
|
NUL
|
�
|
null character
|
SOH
|

|
start of header
|
STX
|

|
start of text
|
ETX
|

|
end of text
|
EOT
|

|
end of transmission
|
ENQ
|

|
enquiry
|
ACK
|

|
acknowledge
|
BEL
|

|
bell (ring)
|
BS
|

|
backspace
|
HT
|
	
|
horizontal tab
|
LF
|
|
line feed
|
VT
|

|
vertical tab
|
FF
|

|
form feed
|
CR
|
|
carriage return
|
SO
|

|
shift out
|
SI
|

|
shift in
|
DLE
|

|
data link escape
|
DC1
|

|
device control 1
|
DC2
|

|
device control 2
|
DC3
|

|
device control 3
|
DC4
|

|
device control 4
|
NAK
|

|
negative acknowledge
|
SYN
|

|
synchronize
|
ETB
|

|
end transmission block
|
CAN
|

|
cancel
|
EM
|

|
end of medium
|
SUB
|

|
substitute
|
ESC
|

|
escape
|
FS
|

|
file separator
|
GS
|

|
group separator
|
RS
|

|
record separator
|
US
|

|
unit separator
|
DEL
|

|
delete (rubout)
|
ISO Character Sets
Character set
|
Description
|
Covers
|
ISO-8859-1
|
Latin alphabet part 1
|
North America, Western
Europe, Latin America, the Caribbean, Canada,
Africa
|
ISO-8859-2
|
Latin alphabet part 2
|
Eastern
Europe
|
ISO-8859-3
|
Latin alphabet part 3
|
SE
Europe, Esperanto,
miscellaneous others
|
ISO-8859-4
|
Latin alphabet part 4
|
Scandinavia/Baltics (and
others not in ISO-8859-1)
|
ISO-8859-5
|
Latin/Cyrillic part 5
|
The languages that are
using a Cyrillic alphabet such as Bulgarian, Belarusian, Russian and
Macedonian
|
ISO-8859-6
|
Latin/Arabic part 6
|
The languages that are
using the Arabic alphabet
|
ISO-8859-7
|
Latin/Greek part 7
|
The modern Greek language
as well as mathematical symbols derived from the Greek
|
ISO-8859-8
|
Latin/Hebrew part 8
|
The languages that are
using the Hebrew alphabet
|
ISO-8859-9
|
Latin 5 part 9
|
The Turkish language.
Same as ISO-8859-1 except Turkish characters replace Icelandic ones
|
ISO-8859-10
|
Latin 6 Lappish, Nordic,
Eskimo
|
The Nordic languages
|
ISO-8859-15
|
Latin 9 (aka Latin 0)
|
Similar to ISO 8859-1 but
replaces some less common symbols with the euro sign and some other missing
characters
|
ISO-2022-JP
|
Latin/Japanese part 1
|
The Japanese language
|
ISO-2022-JP-2
|
Latin/Japanese part 2
|
The Japanese language
|
ISO-2022-KR
|
Latin/Korean part 1
|
The Korean language
|
The Unicode Standard
Character-set
|
Description
|
UTF-8
|
A character in UTF8 can
be from 1 to 4 bytes long. UTF-8 can represent any character in the Unicode
standard. UTF-8 is backwards compatible with ASCII. UTF-8 is the preferred
encoding for e-mail and web pages
|
UTF-16
|
16-bit Unicode
Transformation Format is a variable-length character encoding for Unicode,
capable of encoding the entire Unicode repertoire. UTF-16 is used in major
operating systems and environments, like Microsoft Windows 2000/XP/2003/Vista/CE and the Java and .NET byte code environments
|
Tip: All HTML 4 processors already support UTF-8, and all XHTML and XML processors support UTF-8 and UTF-16!
Math
Symbols Supported by HTML
Character
|
Entity Number
|
Entity Name
|
Description
|
∀
|
∀
|
∀
|
for all
|
∂
|
∂
|
∂
|
part
|
∃
|
∃
|
∃
|
exists
|
∅
|
∅
|
∅
|
empty
|
∇
|
∇
|
∇
|
nabla
|
∈
|
∈
|
∈
|
isin
|
∉
|
∉
|
∉
|
notin
|
∋
|
∋
|
∋
|
ni
|
∏
|
∏
|
∏
|
prod
|
∑
|
∑
|
∑
|
sum
|
−
|
−
|
−
|
minus
|
∗
|
∗
|
∗
|
lowast
|
√
|
√
|
√
|
square root
|
∝
|
∝
|
∝
|
proportional to
|
∞
|
∞
|
∞
|
infinity
|
∠
|
∠
|
∠
|
angle
|
∧
|
∧
|
∧
|
and
|
∨
|
∨
|
∨
|
or
|
∩
|
∩
|
∩
|
cap
|
∪
|
∪
|
∪
|
cup
|
∫
|
∫
|
∫
|
integral
|
∴
|
∴
|
∴
|
therefore
|
∼
|
∼
|
∼
|
similar to
|
≅
|
≅
|
≅
|
congruent to
|
≈
|
≈
|
≈
|
almost equal
|
≠
|
≠
|
≠
|
not equal
|
≡
|
≡
|
≡
|
equivalent
|
≤
|
≤
|
≤
|
less or equal
|
≥
|
≥
|
≥
|
greater or equal
|
⊂
|
⊂
|
⊂
|
subset of
|
⊃
|
⊃
|
⊃
|
superset of
|
⊄
|
⊄
|
⊄
|
not subset of
|
⊆
|
⊆
|
⊆
|
subset or equal
|
⊇
|
⊇
|
⊇
|
superset or equal
|
⊕
|
⊕
|
⊕
|
circled plus
|
⊗
|
⊗
|
⊗
|
circled times
|
⊥
|
⊥
|
⊥
|
perpendicular
|
⋅
|
⋅
|
⋅
|
dot operator
|
Greek Letters Supported by HTML
Character
|
Entity Number
|
Entity Name
|
Description
|
Α
|
Α
|
Α
|
Alpha
|
Β
|
Β
|
Β
|
Beta
|
Γ
|
Γ
|
Γ
|
Gamma
|
Δ
|
Δ
|
Δ
|
Delta
|
Ε
|
Ε
|
Ε
|
Epsilon
|
Ζ
|
Ζ
|
Ζ
|
Zeta
|
Η
|
Η
|
Η
|
Eta
|
Θ
|
Θ
|
Θ
|
Theta
|
Ι
|
Ι
|
Ι
|
Iota
|
Κ
|
Κ
|
Κ
|
Kappa
|
Λ
|
Λ
|
Λ
|
Lambda
|
Μ
|
Μ
|
Μ
|
Mu
|
Ν
|
Ν
|
Ν
|
Nu
|
Ξ
|
Ξ
|
Ξ
|
Xi
|
Ο
|
Ο
|
Ο
|
Omicron
|
Π
|
Π
|
Π
|
Pi
|
Ρ
|
Ρ
|
Ρ
|
Rho
|
undefined
|
Sigmaf
|
||
Σ
|
Σ
|
Σ
|
Sigma
|
Τ
|
Τ
|
Τ
|
Tau
|
Υ
|
Υ
|
Υ
|
Upsilon
|
Φ
|
Φ
|
Φ
|
Phi
|
Χ
|
Χ
|
Χ
|
Chi
|
Ψ
|
Ψ
|
Ψ
|
Psi
|
Ω
|
Ω
|
Ω
|
Omega
|
α
|
α
|
α
|
alpha
|
β
|
β
|
β
|
beta
|
γ
|
γ
|
γ
|
gamma
|
δ
|
δ
|
δ
|
delta
|
ε
|
ε
|
ε
|
epsilon
|
ζ
|
ζ
|
ζ
|
zeta
|
η
|
η
|
η
|
eta
|
θ
|
θ
|
θ
|
theta
|
ι
|
ι
|
ι
|
iota
|
κ
|
κ
|
κ
|
kappa
|
λ
|
λ
|
λ
|
lambda
|
μ
|
μ
|
μ
|
mu
|
ν
|
ν
|
ν
|
nu
|
ξ
|
ξ
|
ξ
|
xi
|
ο
|
ο
|
ο
|
omicron
|
π
|
π
|
π
|
pi
|
ρ
|
ρ
|
ρ
|
rho
|
ς
|
ς
|
ς
|
sigmaf
|
σ
|
σ
|
σ
|
sigma
|
τ
|
τ
|
τ
|
tau
|
υ
|
υ
|
υ
|
upsilon
|
φ
|
φ
|
φ
|
phi
|
χ
|
χ
|
χ
|
chi
|
ψ
|
ψ
|
ψ
|
psi
|
ω
|
ω
|
ω
|
omega
|
ϑ
|
ϑ
|
ϑ
|
theta symbol
|
ϒ
|
ϒ
|
ϒ
|
upsilon symbol
|
ϖ
|
ϖ
|
ϖ
|
pi symbol
|
Other Entities Supported by HTML
Character
|
Entity Number
|
Entity Name
|
Description
|
Œ
|
Œ
|
Œ
|
capital ligature OE
|
œ
|
œ
|
œ
|
small ligature oe
|
Š
|
Š
|
Š
|
capital S with caron
|
š
|
š
|
š
|
small S with caron
|
Ÿ
|
Ÿ
|
Ÿ
|
capital Y with diaeres
|
ƒ
|
ƒ
|
ƒ
|
f with hook
|
ˆ
|
ˆ
|
ˆ
|
modifier letter
circumflex accent
|
˜
|
˜
|
˜
|
small tilde
|
 
|
 
|
en space
|
|
 
|
 
|
em space
|
|
 
|
 
|
thin space
|
|
|
‌
|
‌
|
zero width non-joiner
|
|
‍
|
‍
|
zero width joiner
|
|
‎
|
‎
|
left-to-right mark
|
|
‏
|
‏
|
right-to-left mark
|
–
|
–
|
–
|
en dash
|
—
|
—
|
—
|
em dash
|
‘
|
‘
|
‘
|
left single quotation
mark
|
’
|
’
|
’
|
right single quotation
mark
|
‚
|
‚
|
‚
|
single low-9 quotation
mark
|
“
|
“
|
“
|
left double quotation
mark
|
”
|
”
|
”
|
right double quotation
mark
|
„
|
„
|
„
|
double low-9 quotation
mark
|
†
|
†
|
†
|
dagger
|
‡
|
‡
|
‡
|
double dagger
|
•
|
•
|
•
|
bullet
|
…
|
…
|
…
|
horizontal ellipsis
|
‰
|
‰
|
‰
|
per mille
|
′
|
′
|
′
|
minutes
|
″
|
″
|
″
|
seconds
|
‹
|
‹
|
‹
|
single left angle
quotation
|
›
|
›
|
›
|
single right angle
quotation
|
‾
|
‾
|
‾
|
overline
|
€
|
€
|
€
|
euro
|
™
|
™ or ™
|
™
|
trademark
|
←
|
←
|
←
|
left arrow
|
↑
|
↑
|
↑
|
up arrow
|
→
|
→
|
→
|
right arrow
|
↓
|
↓
|
↓
|
down arrow
|
↔
|
↔
|
↔
|
left right arrow
|
↵
|
↵
|
↵
|
carriage return arrow
|
⌈
|
⌈
|
⌈
|
left ceiling
|
⌉
|
⌉
|
⌉
|
right ceiling
|
⌊
|
⌊
|
⌊
|
left floor
|
⌋
|
⌋
|
⌋
|
right floor
|
◊
|
◊
|
◊
|
lozenge
|
♠
|
♠
|
♠
|
spade
|
♣
|
♣
|
♣
|
club
|
♥
|
♥
|
♥
|
heart
|
♦
|
♦
|
♦
|
diamond
|
Reserved Characters in HTML
Character
|
Entity Number
|
Entity Name
|
Description
|
"
|
"
|
"
|
quotation mark
|
'
|
'
|
'
|
apostrophe
|
&
|
&
|
&
|
ampersand
|
<
|
<
|
<
|
less-than
|
>
|
>
|
>
|
greater-than
|
ISO 8859-1 Symbols
Character
|
Entity Number
|
Entity Name
|
Description
|
 
|
|
non-breaking space
|
|
¡
|
¡
|
¡
|
inverted exclamation mark
|
¢
|
¢
|
¢
|
cent
|
£
|
£
|
£
|
pound
|
¤
|
¤
|
¤
|
currency
|
¥
|
¥
|
¥
|
yen
|
¦
|
¦
|
¦
|
broken vertical bar
|
§
|
§
|
§
|
section
|
¨
|
¨
|
¨
|
spacing diaeresis
|
©
|
©
|
©
|
copyright
|
ª
|
ª
|
ª
|
feminine ordinal
indicator
|
«
|
«
|
«
|
angle quotation mark
(left)
|
¬
|
¬
|
¬
|
negation
|
|
­
|
­
|
soft hyphen
|
®
|
®
|
®
|
registered trademark
|
¯
|
¯
|
¯
|
spacing macron
|
°
|
°
|
°
|
degree
|
±
|
±
|
±
|
plus-or-minus
|
²
|
²
|
²
|
superscript 2
|
³
|
³
|
³
|
superscript 3
|
´
|
´
|
´
|
spacing acute
|
µ
|
µ
|
µ
|
micro
|
¶
|
¶
|
¶
|
paragraph
|
·
|
·
|
·
|
middle dot
|
¸
|
¸
|
¸
|
spacing cedilla
|
¹
|
¹
|
¹
|
superscript 1
|
º
|
º
|
º
|
masculine ordinal
indicator
|
»
|
»
|
»
|
angle quotation mark
(right)
|
¼
|
¼
|
¼
|
fraction 1/4
|
½
|
½
|
½
|
fraction 1/2
|
¾
|
¾
|
¾
|
fraction 3/4
|
¿
|
¿
|
¿
|
inverted question mark
|
×
|
×
|
×
|
multiplication
|
÷
|
÷
|
÷
|
division
|
ISO 8859-1 Characters
Character
|
Entity Number
|
Entity Name
|
Description
|
À
|
À
|
À
|
capital a, grave accent
|
Á
|
Á
|
Á
|
capital a, acute accent
|
Â
|
Â
|
Â
|
capital a, circumflex
accent
|
Ã
|
Ã
|
Ã
|
capital a, tilde
|
Ä
|
Ä
|
Ä
|
capital a, umlaut mark
|
Å
|
Å
|
Å
|
capital a, ring
|
Æ
|
Æ
|
Æ
|
capital ae
|
Ç
|
Ç
|
Ç
|
capital c, cedilla
|
È
|
È
|
È
|
capital e, grave accent
|
É
|
É
|
É
|
capital e, acute accent
|
Ê
|
Ê
|
Ê
|
capital e, circumflex
accent
|
Ë
|
Ë
|
Ë
|
capital e, umlaut mark
|
Ì
|
Ì
|
Ì
|
capital i, grave accent
|
Í
|
Í
|
Í
|
capital i, acute accent
|
Î
|
Î
|
Î
|
capital i, circumflex
accent
|
Ï
|
Ï
|
Ï
|
capital i, umlaut mark
|
Ð
|
Ð
|
Ð
|
capital eth, Icelandic
|
Ñ
|
Ñ
|
Ñ
|
capital n, tilde
|
Ò
|
Ò
|
Ò
|
capital o, grave accent
|
Ó
|
Ó
|
Ó
|
capital o, acute accent
|
Ô
|
Ô
|
Ô
|
capital o, circumflex
accent
|
Õ
|
Õ
|
Õ
|
capital o, tilde
|
Ö
|
Ö
|
Ö
|
capital o, umlaut mark
|
Ø
|
Ø
|
Ø
|
capital o, slash
|
Ù
|
Ù
|
Ù
|
capital u, grave accent
|
Ú
|
Ú
|
Ú
|
capital u, acute accent
|
Û
|
Û
|
Û
|
capital u, circumflex
accent
|
Ü
|
Ü
|
Ü
|
capital u, umlaut mark
|
Ý
|
Ý
|
Ý
|
capital y, acute accent
|
Þ
|
Þ
|
Þ
|
capital THORN, Icelandic
|
ß
|
ß
|
ß
|
small sharp s, German
|
à
|
à
|
à
|
small a, grave accent
|
á
|
á
|
á
|
small a, acute accent
|
â
|
â
|
â
|
small a, circumflex
accent
|
ã
|
ã
|
ã
|
small a, tilde
|
ä
|
ä
|
ä
|
small a, umlaut mark
|
å
|
å
|
å
|
small a, ring
|
æ
|
æ
|
æ
|
small ae
|
ç
|
ç
|
ç
|
small c, cedilla
|
è
|
è
|
è
|
small e, grave accent
|
é
|
é
|
é
|
small e, acute accent
|
ê
|
ê
|
ê
|
small e, circumflex
accent
|
ë
|
ë
|
ë
|
small e, umlaut mark
|
ì
|
ì
|
ì
|
small i, grave accent
|
í
|
í
|
í
|
small i, acute accent
|
î
|
î
|
î
|
small i, circumflex
accent
|
ï
|
ï
|
ï
|
small i, umlaut mark
|
ð
|
ð
|
ð
|
small eth, Icelandic
|
ñ
|
ñ
|
ñ
|
small n, tilde
|
ò
|
ò
|
ò
|
small o, grave accent
|
ó
|
ó
|
ó
|
small o, acute accent
|
ô
|
ô
|
ô
|
small o, circumflex
accent
|
õ
|
õ
|
õ
|
small o, tilde
|
ö
|
ö
|
ö
|
small o, umlaut mark
|
ø
|
ø
|
ø
|
small o, slash
|
ù
|
ù
|
ù
|
small u, grave accent
|
ú
|
ú
|
ú
|
small u, acute accent
|
û
|
û
|
û
|
small u, circumflex
accent
|
ü
|
ü
|
ü
|
small u, umlaut mark
|
ý
|
ý
|
ý
|
small y, acute accent
|
þ
|
þ
|
þ
|
small thorn, Icelandic
|
ÿ
|
ÿ
|
ÿ
|
small y, umlaut mark
|
0 comments:
Post a Comment