|
|
||
![]() |
||
| HELP | ||
|
$6.00 List price:
HARDCVR W/COMP MEDIA, USED
Ships in 1 to 3 days
The Unicode Standard Version 3.0 with CDROMby Unicode Consortium
Synopses & ReviewsPublisher Comments:Unicode Characters for all the languages of the world
The standard for the new millennium
Required for XML and the Internet
The basis for modern software standards and products
The official way to implement ISO/IEC 10646
The key to global interoperability
The Unicode Standard, Version 3.0
The authoritative technical guide to the creation of software for worldwide use.
x Detailed specifications for Unicode:
x Structure, conformance, encoding forms, character properties, semantics, equivalence, combining characters, logical ordering, conversion, allocation, big/little endian usage, Korean syllables formation, control characters, case mappings, numeric values, mathematical properties, writing directions (Arabic, Japanese, English,...), character shaping (Arabic, Devanagari, Tamil,...),...
x Expanded implementation guidelines by experts in global software design:
x Normalization, sorting and searching, case mapping, compression, language tagging, boundaries (characters, word, lines, and sentences), rendering of non-spacing marks, transcoding to other character sets, handling unknown characters, surrogate pairs, numbers, editing and selection, keyboard input,...
x Comprehensive charts, references, glossary, and indices:
x Codes, names, appearances, aliases, cross-references, equivalences, radical-stroke ideographic index, Shift-JIS index,...
CD-ROM
x The comprehensive Unicode Character Database for:
x character codes, names, properties, decompositions, upper/lower/titlecases, normalizations, shaping,...
x International, national, and vendor character mappings for:
x Western European, Japanese, Chinese, Korean, Greek, Russian,...
x Windows, Macintosh, Unix, Linux,...
x Unicode Technical Reports, extending the standard for:
x Sorting, displaying, normalizing, linebreaking, compression, serialization, regular expressions, CR/LF, XML, case mappings,... Book News Annotation:A book/CD-ROM technical guide to the Unicode character encoding
standard, the international character code for information processing
that includes all major scripts of the world and is the foundation
for development of software for worldwide use. Early chapters cover
information engineers need to produce a conforming implementation.
Later chapters give basic information about each script and discuss
specific characters. Includes reference and background information,
plus a glossary. The CD-ROM contains a Unicode Character Database,
technical reports, and international, national, and vendor character
mappings for various languages.
Annotation c. Book News, Inc., Portland, OR (booknews.com) Synopsis:This book/CD-ROM is "the" authoritative source of information on the Unicode character-encoding standard, which makes it possible to create global software and share data across languages, nations, and locales worldwide. Encompassing all of the world's widely-used scripts and character sets, Unicode represents the foundation for international software. Key principles are outlined in the book and the CD-ROM contains updated tables of character properties and mapping to international, national, and vendor character sets. Description:Unicode Consortium
Unicode
Characters for all the languages of the world
The standard for the new millennium
Required for XML and the Internet
The basis for modern software standards and products
The official way to implement ISO/IEC 10646
The key to global interoperability
The Unicode Standard, Version 3.0
The authoritative, technical guide to the creation of software for worldwide use.
Detailed specifications for Unicode:
Structure, conformance, encoding forms, character properties, semantics, equivalence, combining characters, logical ordering, conversion, allocation, big/little
endian usage, Korean syllable formation, control characters, case mappings, numeric values, mathematical properties, writing directions (Arabic, Japanese,
English, and so on), character shaping (Arabic, Devanagari, Tamil, and so on)
Expanded implementation guidelines by experts in global software design:
Normalization, sorting and searching, case mapping, compression, language tagging, boundaries (characters, word, lines, and sentences), rendering of
non-spacing marks, transcoding to other character sets, handling unknown characters, surrogate pairs, numbers, editing and selection, keyboard input, and
more
Comprehensive charts, references, glossary, and indexes:
Codes, names, appearances, aliases, cross-references, equivalences, radical-stroke ideographic index, Shift-JIS index, and more
CD-ROM
The comprehensive Unicode Character Database for:
Character codes, names, properties, decompositions, upper- ,lower-, and title cases, normalizations, shaping
International, national, and vendor character mappings for:
Western European, Japanese, Chinese, Korean, Greek, Russian, and others
Windows, Macintosh, Unix, and Linux
Unicode Technical Reports that extend the standard for:
Sorting, displaying, normalizing, linebreaking, compression, serialization, regular expressions, CR/LF, XML, case mappings, and more Table of ContentsAcknowledgments Unicode Consortium Members and Directors
Full Members
Current Associate Members
Current Liaison Members
Current Specialist Members
Current Individual Members
Current Members of the Board of Directors
Contents
Figures
Tables
Preface
0.1 About the Unicode Standard
Concepts, Architecture, Conformance, and Guidelines
Character Block Descriptions
Charts and Index
Appendices and Tables
The Unicode Character Database and Technical Reports
On the CD-ROM
0.2 Notational Conventions
Extended BNF
Operators
0.3 Resources
Unicode Web Site
Unicode Anonymous FTP Site
Unicode Public Mailing List
How to Contact the Unicode Consortium
1 Introduction
1.1 Coverage
Standards Coverage
New Characters
1.2 Design Basis
1.3 Text Handling
Interpreting Characters
Text Elements
1.4 The Unicode Standard and ISO/IEC 10646
1.5 The Unicode Consortium
The Unicode Technical Committee
2 General Structure
2.1 Architectural Context
Basic Text Processes
Text Elements, Code Values, and Text Processes
Text Processes and Encoding
2.2 Unicode Design Principles
Sixteen-Bit Character Codes
Efficiency
Characters, Not Glyphs
Semantics
Plain Text
Logical Order
Unification
Dynamic Composition
Equivalent Sequence
Convertibility
2.3 Encoding Forms
UTF-16
UTF-8
Character Encoding Schemes
2.4 Unicode Allocation
Allocation Areas
Codespace Assignment for Graphic Characters
Nongraphic Characters, Reserved and Unassigned Codes
2.5 Writing Direction
2.6 Combining Characters
Sequence of Base Characters and Diacritics
Multiple Combining Characters
Multiple Base Characters
Spacing Clones of European Diacritical Marks
2.7 Special Character and Noncharacter Values
Byte Order Mark (BOM)
Special Noncharacter Values
Separators
Layout and Format Control Characters
The Replacement Character
2.8 Controls and Control Sequences
Control Characters
Representing Control Sequences
2.9 Conforming to the Unicode Standard
Characters Not Used in a Subset
2.10 Referencing Versions of the Unicode Standard
3 Conformance
3.1 Conformance Requirements
Byte Ordering
Invalid Code Values
Interpretation
Modification
Transformations
Bidirectional Text
Unicode Technical Reports
3.2 Semantics
3.3 Characters and Coded Representations
3.4 Simple Properties
3.5 Combination
3.6 Decomposition
Compatibility Decomposition
Canonical Decomposition
3.7 Surrogates
3.8 Transformations
3.9 Special Character Properties
3.10 Canonical Ordering Behavior
Combining Classes
Canonical Ordering
Use with Collation
3.11 Conjoining Jamo Behavior
Syllable Boundaries
Standard Syllables
Hangul Syllable Composition
Hangul Syllable Decomposition
Hangul Syllable Names
3.12 Bidirectional Behavior
Directional Formatting Codes
Basic Display Algorithm
Definitions
Resolving Embedding Levels
Reordering Resolved Levels
Bidirectional Conformance
Implementation Notes
4 Character Properties
4.1 Case--Normative
4.2 Combining Classes--Normative
4.3 Directionality--Normative
4.4 Jamo Short Names--Normative
4.5 General Category--Normative in Part
4.6 Numeric Value--Normative
4.7 Mirrored--Normative
4.8 Unicode 1.0 Names
4.9 Mathematical Property
4.10 Letters and Other Useful Properties
5 Implementation Guidelines
5.1 Transcoding to Other Standards
Issues
Multistage Tables
7-Bit or 8-Bit Transmission
Mapping Table Resources
5.2 ANSI/ISO C wchar_t
5.3 Unknown and Missing Characters
Unassigned and Private Use Character Codes
Interpretable but Unrenderable Characters
Reassigned Characters
5.4 Handling Surrogate Pairs
5.5 Handling Numbers
5.6 Handling Properties
5.7 Normalization
5.8 Compression
5.9 Line Handling
5.10 Regular Expressions
5.11 Language Information in Plain Text
Requirements for Language Tagging
Working with Language Tags
Language Tags and Han Unification
5.12 Editing and Selection
Consistent Text Elements
5.13 Strategies for Handling Nonspacing Marks
Keyboard Input
Truncation
5.14 Rendering Nonspacing Marks
Positioning Methods
5.15 Locating Text Element Boundaries
Boundary Specification
Example Specifications
Grapheme Boundaries
Word Boundaries
Line Boundaries
Sentence Boundaries
Random Access
5.16 Identifiers
Syntactic Rule
5.17 Sorting and Searching
Culturally Expected Sorting
Unicode Character Equivalence
Similar Characters
Levels of Comparison
Ignorable Characters
Multiple Mappings
Collating Out-of-Scope Characters
Unmapped Characters
Parameterization
Optimizations
Searching
Sublinear Searching
5.18 Case Mappings
6 Punctuation
6.1 General Punctuation
Punctuation: U+0020-U+00BF
General Punctuation: U+2000-U+206F
CJK Symbols and Punctuation: U+3000-U+303F
CJK Compatibility Forms: U+FE30-U+FE4F
Small Form Variants: U+FE50-U+FE6F
7 European Alphabetic Scripts
7.1 Latin
Letters of Basic Latin: U+0041-U+007A
Letters of the Latin-1 Supplement: U+00C0-U+00FF
Latin Extended-A: U+0100-U+017F
Latin Extended-B: U+0180-U+024F
IPA Extensions: U+0250-U+02AF
Latin Extended Additional: U+1E00-U+1EFF
Latin Ligatures: FB00-FB06
7.2 Greek
Greek: U+0370-U+03FF
Greek Extended: U+1F00-U+1FFF
7.3 Cyrillic
Cyrillic: U+0400-U+04FF
7.4 Armenian
Armenian: U+0530-U+058F
7.5 Georgian
Georgian: U+10A0-U+10FF
7.6 Runic
Runic: U+16A0-U+16F0
7.7 Ogham
Ogham: U+1680-U+169F
7.8 Modifier Letters
Spacing Modifier Letters: U+02B0-U+02FF
7.9 Combining Marks
Combining Diacritical Marks: U+0300-U+036F
Combining Marks for Symbols: U+20D0-U+20FF
Combining Half Marks: U+FE20-U+FE2F
8 Middle Eastern Scripts
8.1 Hebrew
Hebrew: U+0590-U+05FF
Alphabetic Presentation Forms: U+FB1D-U+FB4F
8.2 Arabic
Arabic: U+0600-U+06FF
Cursive Joining
Ligatures
Arabic Presentation Forms-A: U+FB50-U+FDFF
Arabic Presentation Forms-B: U+FE70-U+FEFF
8.3 Syriac
Syriac: U+0700-U+074F
Syriac Shaping
Syriac Cursive Joining
Ligatures
8.4 Thaana
Thaana: U+0780-U+07BF
9 South and Southeast Asian Scripts
9.1 Devanagari
Devanagari: U+0900-U+097F
9.2 Bengali
Bengali: U+0980-U+09FF
9.3 Gurmukhi
Gurmukhi: U+0A00-U+0A7F
9.4 Gujarati
Gujarati: U+0A80-U+0AFF
9.5 Oriya
Oriya: U+0B00-U+0B7F
9.6 Tamil
Tamil: U+0B80-U+0BFF
9.7 Telugu
Telugu: U+0C00-U+0C7F
9.8 Kannada
Kannada: U+0C80-U+0CFF
9.9 Malayalam
Malayalam: U+0D00-U+0D7F
9.10 Sinhala
Sinhala: U+0D80-U+0DFF
9.11 Thai
Thai: U+0E00-U+0E7F
9.12 Lao
Lao: U+0E80-U+0EFF
9.13 Tibetan
Tibetan: U+0F00-U+0FBF
9.14 Myanmar
Myanmar: U+1000-U+109F
9.15 Khmer
Khmer: U+1780-U+17FF
10 East Asian Scripts
10.1 Han
CJK Unified Ideographs
CJK Compatibility Ideographs: U+F900-U+FAFF
Kanbun: U+3190-U+319F
CJK and KangXi Radicals: U+2E80-U+2FD5
Ideographic Description: U+2FF0-U+2FFB
10.2 Hiragana
Hiragana: U+3040-U+309F
10.3 Katakana
Katakana: U+30A0-U+30FF
Halfwidth and Fullwidth Forms: U+FF00-U+FFEF
10.4 Hangul
Hangul Jamo: U+1100-U+11FF
Hangul Compatibility Jamo: U+3130-U+318F
Hangul Syllables: U+AC00-U+D7A3
10.5 Bopomofo
Bopomofo: U+3100-U+312F
10.6 Yi
Yi: U+A000-U+A4CF
11 Additional Scripts
11.1 Ethiopic
Ethiopic: U+1200-U+137F
11.2 Cherokee
Cherokee: U+13A0-U+13FF
11.3 Canadian Aboriginal Syllabics
Canadian Aboriginal Syllabics: U+1400-U+167F
11.4 Mongolian
Mongolian: U+1800-U+18AF
12 Symbols
12.1 Currency Symbols
Currency Symbols: U+20A0-U+20CF
12.2 Letterlike Symbols
Letterlike Symbols: U+2100-U+214F
12.3 Number Forms
Number Forms: U+2150-U+218F
Superscripts and Subscripts: U+2070-U+209F
12.4 Mathematical Operators
Mathematical Operators: U+2200-U+22FF
Arrows: U+2190-U+21FF
12.5 Technical Symbols
Control Pictures: U+2400-U+243F
Miscellaneous Technical: U+2300-U+23FF
Optical Character Recognition: U+2440-U+245F
12.6 Geometrical Symbols
Box Drawing: U+2500-U+257F
Block Elements: U+2580-U+259F
Geometric Shapes: U+25A0-U+25FF
12.7 Miscellaneous Symbols and Dingbats
Miscellaneous Symbols: U+2600-U+26FF
Dingbats: U+2700-U+27BF
12.8 Enclosed and Square
Enclosed Alphanumerics: U+2460-U+24FF
Enclosed CJK Letters and Months: U+3200-U+32FF
CJK Compatibility: U+3300-U+33FF
12.9 Braille
Braille: U+2800-U+28FF
13 Special Areas and Format Characters
13.1 Control Codes
C0 Control Codes: U+0000-U+001F
C1 Control Codes: U+0080-U+009F
13.2 Layout Controls
Layout Controls
13.3 Deprecated Format Characters
Deprecated Format Characters: U+206A-U+206F
13.4 Surrogates Area
Surrogates Area: U+D800-U+DFFF
13.5 Private Use Area
Private Use Area: U+E000-U+F8FF
13.6 Specials
Specials: U+FEFF, U+FFF0-U+FFFF
14 Code Charts
14.1 Character Names List
Images in the Code Charts and Character Lists
Cross References
Case Form Mappings
Decompositions
Information About Languages
Reserved Characters
14.2 CJK Unified Ideographs
14.3 Hangul Syllables
15 Han Indices
15.1 Han Radical-Stroke Index
15.2 Shift-JIS Index
A Han Unification History
B Submitting New Characters
B.1 Proposal Guidelines
B.2 Requirements of Proposal Form and Process
Interim Solutions
Sending Proposals
C Relationship to ISO/IEC 10646
C.1 History
Unicode 1.0
Unicode 2.0
Unicode 3.0
C.2 Encoding Forms in ISO/IEC 10646
Zero Extending
C.3 UCS Transformation Formats
UTF-8
UTF-16
C.4 Synchronization of the Standards
C.5 Identification of Features for the Unicode Standard
C.6 Character Names
C.7 Character Functional Specifications
D Changes from Unicode Version 2.0
D.1 Versions of the Unicode Standard
D.2 Changes from Unicode Version 2.0 to Version 2.1
New Characters Added
Character Semantics Changes
Changes Affecting Conformance
D.3 Changes from Unicode Version 2.1 to Version 3.0
New Characters Added
Character Semantics Changes
Changes Affecting Conformance
Unicode Technical Reports
G Glossary
R References
R.1 Source Standards
R.2 Source Dictionaries for Han Unification
R.3 Other Sources for the Unicode Standard
R.4 Selected Resources
I Indices
I.1 Unicode Names Index
I.2 General Index What Our Readers Are SayingBe the first to add a comment for a chance to win!Product Details
Other books you might like
Related Aisles | ||||||||||||
|
| |||||||||||||
|
|
|||||||||||||