HTML Entities ~ ESkool

कुछ शब्द HTML मे reserved होते है उन्हें हम प्रयोग नही कर सकते है। जैसे कि less than (<) या greater than (>) चिन्ह का प्रयोग नही किया जा सकता है, because the क्योकि browser उन्हें tags के साथ मिला देगा।
तो वास्तव मे reserved characters(शब्दों) को देखने के लिये, हमें HTML source code में character entities का प्रयोग करना होगा । character entity कुछ इस तरह दिखाई देती है:

&entity_name;

या

&#entity_number;

less than चिन्ह लिखने के लिये हमें: < या < लिखना होगा
नोट : एक number के स्थान पर entity name लिखने का फ़ायदा यह है कि नाम याद रखना आसान होता है, जबकि, नुकसान यह है कि browsers सभी entity names को support नही करते है (जबकि वे entity numbers को बहुत अच्छी तरह support करते).
नोट : Entity names case sensitive होते है !

Result	Description	Entity Name	Entity Number
	non-breaking space
<	less than	<	<
>	greater than	>	>
&	ampersand	&	&
¢	cent	¢	¢
£	pound	£	£
¥	yen	¥	¥
€	euro	€	€
§	section	§	§
©	copyright	©	©
®	registered trademark	®	®
™	trademark	™	™

Non-breaking Space

HTML मे सामान्यत: प्रयुक्त होने वाला character entity है non-breaking space ( ).
Browsers हमेशा HTML पेजों मे कुछ spaces घटा (truncate) देता है। यदि आपने अपने वाक्य मे 10 spaces दिये है तो पेज प्रदर्शित होने से पूर्व browser उनमें से 9 घटा देता । तो अपने वाक्यों में spaces जोडने के लिये आपको   character entity का प्रयोग करना होगा ।

ASCII Character	HTML Entity Code	Description
		space
!	!	exclamation mark
"	"	quotation mark
#	#	number sign
$	$	dollar sign
%	%	percent sign
&	&	ampersand
'	'	apostrophe
(	(	left parenthesis
)	)	right parenthesis
*	*	asterisk
+	+	plus sign
,	,	comma
-	-	hyphen
.	.	period
/	/	slash
0	0	digit 0
1	1	digit 1
2	2	digit 2
3	3	digit 3
4	4	digit 4
5	5	digit 5
6	6	digit 6
7	7	digit 7
8	8	digit 8
9	9	digit 9
:	:	colon
;	;	semicolon
<	<	less-than
=	=	equals-to
>	>	greater-than
?	?	question mark
@	@	at sign
A	A	uppercase A
B	B	uppercase B
C	C	uppercase C
D	D	uppercase D
E	E	uppercase E
F	F	uppercase F
G	G	uppercase G
H	H	uppercase H
I	I	uppercase I
J	J	uppercase J
K	K	uppercase K
L	L	uppercase L
M	M	uppercase M
N	N	uppercase N
O	O	uppercase O
P	P	uppercase P
Q	Q	uppercase Q
R	R	uppercase R
S	S	uppercase S
T	T	uppercase T
U	U	uppercase U
V	V	uppercase V
W	W	uppercase W
X	X	uppercase X
Y	Y	uppercase Y
Z	Z	uppercase Z
[	[	left square bracket
\	\	backslash
]	]	right square bracket
^	^	caret
_	_	underscore
`	`	grave accent
a	a	lowercase a
b	b	lowercase b
c	c	lowercase c
d	d	lowercase d
e	e	lowercase e
f	f	lowercase f
g	g	lowercase g
h	h	lowercase h
i	i	lowercase i
j	j	lowercase j
k	k	lowercase k
l	l	lowercase l
m	m	lowercase m
n	n	lowercase n
o	o	lowercase o
p	p	lowercase p
q	q	lowercase q
r	r	lowercase r
s	s	lowercase s
t	t	lowercase t
u	u	lowercase u
v	v	lowercase v
w	w	lowercase w
x	x	lowercase x
y	y	lowercase y
z	z	lowercase z
{	{	left curly brace
\|	\|	vertical bar
}	}	right curly brace
~	~	tilde

ASCII Device Control Characters

ASCII Character	HTML Entity Code	Description
NUL		null character
SOH		start of header
STX		start of text
ETX		end of text
EOT		end of transmission
ENQ		enquiry
ACK		acknowledge
BEL		bell (ring)
BS		backspace
HT		horizontal tab
LF		line feed
VT		vertical tab
FF		form feed
CR		carriage return
SO		shift out
SI		shift in
DLE		data link escape
DC1		device control 1
DC2		device control 2
DC3		device control 3
DC4		device control 4
NAK		negative acknowledge
SYN		synchronize
ETB		end transmission block
CAN		cancel
EM		end of medium
SUB		substitute
ESC		escape
FS		file separator
GS		group separator
RS		record separator
US		unit separator

DEL		delete (rubout)

ISO Character Sets

Character set	Description	Covers
ISO-8859-1	Latin alphabet part 1	North America, Western Europe, Latin America, the Caribbean, Canada, Africa
ISO-8859-2	Latin alphabet part 2	Eastern Europe
ISO-8859-3	Latin alphabet part 3	SE Europe, Esperanto, miscellaneous others
ISO-8859-4	Latin alphabet part 4	Scandinavia/Baltics (and others not in ISO-8859-1)
ISO-8859-5	Latin/Cyrillic part 5	The languages that are using a Cyrillic alphabet such as Bulgarian, Belarusian, Russian and Macedonian
ISO-8859-6	Latin/Arabic part 6	The languages that are using the Arabic alphabet
ISO-8859-7	Latin/Greek part 7	The modern Greek language as well as mathematical symbols derived from the Greek
ISO-8859-8	Latin/Hebrew part 8	The languages that are using the Hebrew alphabet
ISO-8859-9	Latin 5 part 9	The Turkish language. Same as ISO-8859-1 except Turkish characters replace Icelandic ones
ISO-8859-10	Latin 6 Lappish, Nordic, Eskimo	The Nordic languages
ISO-8859-15	Latin 9 (aka Latin 0)	Similar to ISO 8859-1 but replaces some less common symbols with the euro sign and some other missing characters
ISO-2022-JP	Latin/Japanese part 1	The Japanese language
ISO-2022-JP-2	Latin/Japanese part 2	The Japanese language
ISO-2022-KR	Latin/Korean part 1	The Korean language

The Unicode Standard

Character-set	Description
UTF-8	A character in UTF8 can be from 1 to 4 bytes long. UTF-8 can represent any character in the Unicode standard. UTF-8 is backwards compatible with ASCII. UTF-8 is the preferred encoding for e-mail and web pages
UTF-16	16-bit Unicode Transformation Format is a variable-length character encoding for Unicode, capable of encoding the entire Unicode repertoire. UTF-16 is used in major operating systems and environments, like Microsoft Windows 2000/XP/2003/Vista/CE and the Java and .NET byte code environments

Tip: The first 256 characters of Unicode character-sets correspond to the 256 characters of ISO-8859-1.
Tip: All HTML 4 processors already support UTF-8, and all XHTML and XML processors support UTF-8 and UTF-16!

Math Symbols Supported by HTML

Character	Entity Number	Entity Name	Description
∀	∀	∀	for all
∂	∂	∂	part
∃	∃	∃	exists
∅	∅	∅	empty
∇	∇	∇	nabla
∈	∈	∈	isin
∉	∉	∉	notin
∋	∋	&ni;	ni
∏	∏	∏	prod
∑	∑	∑	sum
−	−	−	minus
∗	∗	&lowast;	lowast
√	√	√	square root
∝	∝	&prop;	proportional to
∞	∞	∞	infinity
∠	∠	&ang;	angle
∧	∧	&and;	and
∨	∨	&or;	or
∩	∩	∩	cap
∪	∪	∪	cup
∫	∫	∫	integral
∴	∴	&there4;	therefore
∼	∼	&sim;	similar to
≅	≅	&cong;	congruent to
≈	≈	≈	almost equal
≠	≠	≠	not equal
≡	≡	&equiv;	equivalent
≤	≤	≤	less or equal
≥	≥	≥	greater or equal
⊂	⊂	⊂	subset of
⊃	⊃	⊃	superset of
⊄	⊄	&nsub;	not subset of
⊆	⊆	&sube;	subset or equal
⊇	⊇	&supe;	superset or equal
⊕	⊕	&oplus;	circled plus
⊗	⊗	&otimes;	circled times
⊥	⊥	&perp;	perpendicular
⋅	⋅	⋅	dot operator

Greek Letters Supported by HTML

Character	Entity Number	Entity Name	Description
Α	Α	Α	Alpha
Β	Β	Β	Beta
Γ	Γ	Γ	Gamma
Δ	Δ	Δ	Delta
Ε	Ε	Ε	Epsilon
Ζ	Ζ	Ζ	Zeta
Η	Η	Η	Eta
Θ	Θ	Θ	Theta
Ι	Ι	Ι	Iota
Κ	Κ	Κ	Kappa
Λ	Λ	Λ	Lambda
Μ	Μ	Μ	Mu
Ν	Ν	Ν	Nu
Ξ	Ξ	Ξ	Xi
Ο	Ο	Ο	Omicron
Π	Π	Π	Pi
Ρ	Ρ	Ρ	Rho
	undefined		Sigmaf
Σ	Σ	Σ	Sigma
Τ	Τ	Τ	Tau
Υ	Υ	Υ	Upsilon
Φ	Φ	Φ	Phi
Χ	Χ	Χ	Chi
Ψ	Ψ	Ψ	Psi
Ω	Ω	Ω	Omega

α	α	α	alpha
β	β	β	beta
γ	γ	γ	gamma
δ	δ	δ	delta
ε	ε	ε	epsilon
ζ	ζ	ζ	zeta
η	η	η	eta
θ	θ	θ	theta
ι	ι	ι	iota
κ	κ	κ	kappa
λ	λ	λ	lambda
μ	μ	μ	mu
ν	ν	ν	nu
ξ	ξ	ξ	xi
ο	ο	ο	omicron
π	π	π	pi
ρ	ρ	ρ	rho
ς	ς	&sigmaf;	sigmaf
σ	σ	σ	sigma
τ	τ	τ	tau
υ	υ	υ	upsilon
φ	φ	φ	phi
χ	χ	χ	chi
ψ	ψ	ψ	psi
ω	ω	ω	omega

ϑ	ϑ	&thetasym;	theta symbol
ϒ	ϒ	&upsih;	upsilon symbol
ϖ	ϖ	ϖ	pi symbol

Other Entities Supported by HTML

Character	Entity Number	Entity Name	Description
Œ	Œ	&OElig;	capital ligature OE
œ	œ	&oelig;	small ligature oe
Š	Š	&Scaron;	capital S with caron
š	š	&scaron;	small S with caron
Ÿ	Ÿ	&Yuml;	capital Y with diaeres
ƒ	ƒ	&fnof;	f with hook
ˆ	ˆ	&circ;	modifier letter circumflex accent
˜	˜	&tilde;	small tilde
		&ensp;	en space
		&emsp;	em space
			thin space
‌	‌	&zwnj;	zero width non-joiner
‍	‍	&zwj;	zero width joiner
‎	‎	&lrm;	left-to-right mark
‏	‏	&rlm;	right-to-left mark
–	–	–	en dash
—	—	—	em dash
‘	‘	‘	left single quotation mark
’	’	’	right single quotation mark
‚	‚	&sbquo;	single low-9 quotation mark
“	“	“	left double quotation mark
”	”	”	right double quotation mark
„	„	&bdquo;	double low-9 quotation mark
†	†	&dagger;	dagger
‡	‡	&Dagger;	double dagger
•	•	•	bullet
…	…	…	horizontal ellipsis
‰	‰	&permil;	per mille
′	′	′	minutes
″	″	″	seconds
‹	‹	&lsaquo;	single left angle quotation
›	›	&rsaquo;	single right angle quotation
‾	‾	&oline;	overline
€	€	€	euro
™	™ or	™	trademark
←	←	←	left arrow
↑	↑	↑	up arrow
→	→	→	right arrow
↓	↓	↓	down arrow
↔	↔	↔	left right arrow
↵	↵	&crarr;	carriage return arrow
⌈	⌈	&lceil;	left ceiling
⌉	⌉	&rceil;	right ceiling
⌊	⌊	&lfloor;	left floor
⌋	⌋	&rfloor;	right floor
◊	◊	&loz;	lozenge
♠	♠	&spades;	spade
♣	♣	&clubs;	club
♥	♥	&hearts;	heart
♦	♦	&diams;	diamond

Reserved Characters in HTML

Character	Entity Number	Entity Name	Description
"	"	"	quotation mark
'	'	'	apostrophe
&	&	&	ampersand
<	<	<	less-than
>	>	>	greater-than

Note: Entity names are case sensitive!

ISO 8859-1 Symbols

Character	Entity Number	Entity Name	Description
			non-breaking space
¡	¡	¡	inverted exclamation mark
¢	¢	¢	cent
£	£	£	pound
¤	¤	¤	currency
¥	¥	¥	yen
¦	¦	¦	broken vertical bar
§	§	§	section
¨	¨	¨	spacing diaeresis
©	©	©	copyright
ª	ª	ª	feminine ordinal indicator
«	«	«	angle quotation mark (left)
¬	¬	¬	negation
			soft hyphen
®	®	®	registered trademark
¯	¯	¯	spacing macron
°	°	°	degree
±	±	±	plus-or-minus
²	²	²	superscript 2
³	³	³	superscript 3
´	´	´	spacing acute
µ	µ	µ	micro
¶	¶	¶	paragraph
·	·	·	middle dot
¸	¸	¸	spacing cedilla
¹	¹	¹	superscript 1
º	º	º	masculine ordinal indicator
»	»	»	angle quotation mark (right)
¼	¼	¼	fraction 1/4
½	½	½	fraction 1/2
¾	¾	¾	fraction 3/4
¿	¿	¿	inverted question mark
×	×	×	multiplication
÷	÷	÷	division

ISO 8859-1 Characters

Character	Entity Number	Entity Name	Description
À	À	À	capital a, grave accent
Á	Á	Á	capital a, acute accent
Â	Â	Â	capital a, circumflex accent
Ã	Ã	Ã	capital a, tilde
Ä	Ä	Ä	capital a, umlaut mark
Å	Å	Å	capital a, ring
Æ	Æ	Æ	capital ae
Ç	Ç	Ç	capital c, cedilla
È	È	È	capital e, grave accent
É	É	É	capital e, acute accent
Ê	Ê	Ê	capital e, circumflex accent
Ë	Ë	Ë	capital e, umlaut mark
Ì	Ì	Ì	capital i, grave accent
Í	Í	Í	capital i, acute accent
Î	Î	Î	capital i, circumflex accent
Ï	Ï	Ï	capital i, umlaut mark
Ð	Ð	Ð	capital eth, Icelandic
Ñ	Ñ	Ñ	capital n, tilde
Ò	Ò	Ò	capital o, grave accent
Ó	Ó	Ó	capital o, acute accent
Ô	Ô	Ô	capital o, circumflex accent
Õ	Õ	Õ	capital o, tilde
Ö	Ö	Ö	capital o, umlaut mark
Ø	Ø	Ø	capital o, slash
Ù	Ù	Ù	capital u, grave accent
Ú	Ú	Ú	capital u, acute accent
Û	Û	Û	capital u, circumflex accent
Ü	Ü	Ü	capital u, umlaut mark
Ý	Ý	Ý	capital y, acute accent
Þ	Þ	Þ	capital THORN, Icelandic
ß	ß	ß	small sharp s, German
à	à	à	small a, grave accent
á	á	á	small a, acute accent
â	â	â	small a, circumflex accent
ã	ã	ã	small a, tilde
ä	ä	ä	small a, umlaut mark
å	å	å	small a, ring
æ	æ	æ	small ae
ç	ç	ç	small c, cedilla
è	è	è	small e, grave accent
é	é	é	small e, acute accent
ê	ê	ê	small e, circumflex accent
ë	ë	ë	small e, umlaut mark
ì	ì	ì	small i, grave accent
í	í	í	small i, acute accent
î	î	î	small i, circumflex accent
ï	ï	ï	small i, umlaut mark
ð	ð	ð	small eth, Icelandic
ñ	ñ	ñ	small n, tilde
ò	ò	ò	small o, grave accent
ó	ó	ó	small o, acute accent
ô	ô	ô	small o, circumflex accent
õ	õ	õ	small o, tilde
ö	ö	ö	small o, umlaut mark
ø	ø	ø	small o, slash
ù	ù	ù	small u, grave accent
ú	ú	ú	small u, acute accent
û	û	û	small u, circumflex accent
ü	ü	ü	small u, umlaut mark
ý	ý	ý	small y, acute accent
þ	þ	þ	small thorn, Icelandic
ÿ	ÿ	ÿ	small y, umlaut mark

ESkool

Friday, 28 March 2014

HTML Entities

Non-breaking Space

ASCII Device Control Characters

ISO Character Sets

The Unicode Standard

Greek Letters Supported by HTML

Other Entities Supported by HTML

Reserved Characters in HTML

ISO 8859-1 Symbols

ISO 8859-1 Characters

0 comments:

Post a Comment

look who is online

Resent Posts

Popular Posts

Blog Archive

other links

About