New Text Document

Uploaded by

rajesh

0% found this document useful (0 votes)

5 views2 pages

about utf 8

Copyright

Available Formats

TXT, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

about utf 8

Copyright:

Available Formats

Download as TXT, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

5 views2 pages

New Text Document

Uploaded by

rajesh

about utf 8

Copyright:

Available Formats

Download as TXT, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Summary

UTF-8 is a compromise character encoding that can be as compact as ASCII (if the
file is just plain English text) but can also contain any unicode characters (w
ith some increase in file size).
UTF stands for Unicode Transformation Format. The '8' means it uses 8-bit blocks
to represent a character. The number of blocks needed to represent a character
varies from 1 to 4.
One of the really nice features of UTF-8 is that it is compatible with nul-termi
nated strings. No character will have a nul (0) byte when encoded. This means th
at C code that deals with char[] will "just work".
You can try the UTF-8 Test Page to see how well your browser (and default font)
support UTF-8.
If you are an application developer, this Joel On Software article on Unicode is
pretty good summary of all you need to know.
More links:
If you are into the gory details, the official spec is RFC 3629
Markus Kuhn's FAQ
Rob Pike's story about the invention of it
Detail
For any character equal to or below 127 (hex 0x7F), the UTF-8 representation is
one byte. It is just the lowest 7 bits of the full unicode value. This is also t
he same as the ASCII value.
For characters equal to or below 2047 (hex 0x07FF), the UTF-8 representation is
spread across two bytes. The first byte will have the two high bits set and the
third bit clear (i.e. 0xC2 to 0xDF). The second byte will have the top bit set a
nd the second bit clear (i.e. 0x80 to 0xBF).
For all characters equal to or greater than 2048 but less that 65535 (0xFFFF), t
he UTF-8 representation is spread across three bytes.
The following table shows the format of such UTF-8 byte sequences (where the "fr
ee bits" shown by x's in the table are combined in the order shown, and interpre
ted from most significant to least significant).
Binary format of bytes in sequence
1st Byte
2nd Byte
3rd Byte
its
Maximum Expressible Unicode Value
0xxxxxxx
7
110xxxxx
10xxxxxx
1110xxxx
10xxxxxx
10xxxxxx
(65535)
11110xxx
10xxxxxx
10xxxxxx
10FFFF hex (1,114,111)
The value of each individual byte indicates its
00
80
C2
E0
F0

to
to
to
to
to

7F
BF
DF
EF
FF

hex
hex
hex
hex
hex

4th Byte

Number of Free B

007F hex (127)

(5+6)=11
07FF hex (2047)
(4+6+6)=16
FFFF hex
10xxxxxx

(3+6+6+6)=21

UTF-8 function, as follows:

(0 to 127): first and only byte of a sequence.

(128 to 191): continuing byte in a multi-byte sequence.
(194 to 223): first byte of a two-byte sequence.
(224 to 239): first byte of a three-byte sequence.
(240 to 255): first byte of a four-byte sequence.

UTF-8 remains a simple, single-byte, ASCII-compatible encoding method, as long a

s no characters greater than 127 are directly present. This means that an HTML d
ocument technically declared to be encoded as UTF-8 can remain a normal single-b
yte ASCII file. The document can remain so even though it may contain Unicode ch
aracters above 127, as long as all characters above 127 are referred to indirect
ly by ampersand entities.
Examples of encoded Unicode characters (in hexadecimal notation)
16-bit
0001
007F
0080
07FF
0800
FFFF
010000
10FFFF

Unicode UTF-8 Sequence

01
7F
C2 80
DF BF
E0 A0 80
EF BF BF
F0 90 80 80
F4 8F BF BF

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5794)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Rating: 4 out of 5 stars
4/5 (1090)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (838)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (599)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1713)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (895)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1103)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (588)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (537)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2104)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (345)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (474)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1016)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (821)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1839)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (121)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (271)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (440)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Toibin
Rating: 3.5 out of 5 stars
3.5/5 (1937)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (400)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2259)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4609)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4200)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1929)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (98)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (806)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (266)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2322)
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1891)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (231)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (234)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (738)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (3811)
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2409)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (74)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (792)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
Rating: 3.5 out of 5 stars
3.5/5 (104)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (137)
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (104)
Cimo Guide 2014 en I 3
Document36 pages
Cimo Guide 2014 en I 3
lakis
No ratings yet
Design of Footing R1
Document8 pages
Design of Footing R1
URVESHKUMAR PATEL
No ratings yet
Tecsun Pl310et PDF
Document30 pages
Tecsun Pl310et PDF
Axel Bodemann
No ratings yet
Lab 3 Report Fins Redo
Document3 pages
Lab 3 Report Fins Redo
Westley Gomez
No ratings yet
Untitled
Document5 pages
Untitled
april montejo
No ratings yet
22 Thành NG Quen Thu C Trên Ielts - Firefighter
Document2 pages
22 Thành NG Quen Thu C Trên Ielts - Firefighter
Ninh Nguyễn
No ratings yet
Control System PPT DO1
Document11 pages
Control System PPT DO1
Luis Anderson
No ratings yet
Energy Bodies
Document1 page
Energy Bodies
annoyingspore
No ratings yet
Handbook On National Spectrum Management 2015
Document333 pages
Handbook On National Spectrum Management 2015
Marisela Alvarez
No ratings yet
I. Objectives Ii. Content Iii. Learning Resources
Document13 pages
I. Objectives Ii. Content Iii. Learning Resources
Zenia Capalac
No ratings yet
README
Document2 pages
README
tushar patel
No ratings yet
Mathematics4 q4 Week4 v4
Document11 pages
Mathematics4 q4 Week4 v4
Morales Jinx
No ratings yet
The 5 Pivotal Paragraphs in A Paper
Document1 page
The 5 Pivotal Paragraphs in A Paper
Fer Rivas Nieto
No ratings yet
20235UGSEM2206
Document2 pages
20235UGSEM2206
Lovepreet Kaur
No ratings yet
PCM 2.4l Turbo 5 de 5
Document2 pages
PCM 2.4l Turbo 5 de 5
Felix Velasquez
No ratings yet
Types of Computers
Document7 pages
Types of Computers
Syed Badshah Yousafzai
No ratings yet
SD-NOC-MAR-202 - Rev00 Transfer of Personnel at Offshore Facilities
Document33 pages
SD-NOC-MAR-202 - Rev00 Transfer of Personnel at Offshore Facilities
tho03103261
100% (1)
Shift Registers Notes
Document146 pages
Shift Registers Notes
Rajat Kumar
No ratings yet
19 Uco 578
Document20 pages
19 Uco 578
roshan jain
No ratings yet
(Word 365-2019) Mos Word Mocktest
Document4 pages
(Word 365-2019) Mos Word Mocktest
Quỳnh Anh Nguyễn Thái
No ratings yet
Slide 7 PV New
Document74 pages
Slide 7 PV New
Priyanshu Agrawal
No ratings yet
Unit 13 Dialogue Writing: Objectives
Document8 pages
Unit 13 Dialogue Writing: Objectives
Akg Gupt
No ratings yet
Manish Kumar: Desire To Work and Grow in The Field of Mechanical
Document4 pages
Manish Kumar: Desire To Work and Grow in The Field of Mechanical
MANISH
No ratings yet
Second Periodical Test in Organization and Management SY 2018-2019
Document3 pages
Second Periodical Test in Organization and Management SY 2018-2019
Merida Bravo
No ratings yet
Sem
Document583 pages
Sem
Maria Santos
No ratings yet
Hydrology Report at CH-9+491
Document3 pages
Hydrology Report at CH-9+491
juliyet struc
No ratings yet
Empowerment Technology Lesson 4 PDF
Document18 pages
Empowerment Technology Lesson 4 PDF
queenless eightyone
No ratings yet
Bamboo People - An Interdisciplinary Unit For High School
Document6 pages
Bamboo People - An Interdisciplinary Unit For High School
Chipo Jean Marunda
No ratings yet
EEE301 Digital Electronics Lecture 1 Part 3: Dr. A.S.M. Mohsin
Document6 pages
EEE301 Digital Electronics Lecture 1 Part 3: Dr. A.S.M. Mohsin
Aaa Aaa
No ratings yet
William Ury Power of A Positive No Bantam - 2007
Document227 pages
William Ury Power of A Positive No Bantam - 2007
Tam Jeopardy
100% (1)