Knockout Narratives Sale
 
 

Special Offers see all

Enter to WIN a $100 Credit

Subscribe to PowellsBooks.news
for a chance to win.
Privacy Policy

Visit our stores


    Recently Viewed clear list


    The Powell's Playlist | January 5, 2015

    Tim Johnston: IMG The Powell's Playlist: Songs for Not Sleeping by Tim Johnston



    I once told a medical-profession-type lady that I didn't sleep well, that I awoke all through the night and was awake for hours. "What do you do... Continue »

    spacer

This item may be
out of stock.

Click on the button below to search for this title in other formats.


Check for Availability
Add to Wishlist

The Unicode Standard Version 3.0 with CDROM

The Unicode Standard Version 3.0 with CDROM Cover

 

Synopses & Reviews

Publisher Comments:

Unicode Characters for all the languages of the world The standard for the new millennium Required for XML and the Internet The basis for modern software standards and products The official way to implement ISO/IEC 10646 The key to global interoperability The Unicode Standard, Version 3.0 The authoritative technical guide to the creation of software for worldwide use. x Detailed specifications for Unicode: x Structure, conformance, encoding forms, character properties, semantics, equivalence, combining characters, logical ordering, conversion, allocation, big/little endian usage, Korean syllables formation, control characters, case mappings, numeric values, mathematical properties, writing directions (Arabic, Japanese, English,...), character shaping (Arabic, Devanagari, Tamil,...),... x Expanded implementation guidelines by experts in global software design: x Normalization, sorting and searching, case mapping, compression, language tagging, boundaries (characters, word, lines, and sentences), rendering of non-spacing marks, transcoding to other character sets, handling unknown characters, surrogate pairs, numbers, editing and selection, keyboard input,... x Comprehensive charts, references, glossary, and indices: x Codes, names, appearances, aliases, cross-references, equivalences, radical-stroke ideographic index, Shift-JIS index,... CD-ROM x The comprehensive Unicode Character Database for: x character codes, names, properties, decompositions, upper/lower/titlecases, normalizations, shaping,... x International, national, and vendor character mappings for: x Western European, Japanese, Chinese, Korean, Greek, Russian,... x Windows, Macintosh, Unix, Linux,... x Unicode Technical Reports, extending the standard for: x Sorting, displaying, normalizing, linebreaking, compression, serialization, regular expressions, CR/LF, XML, case mappings,...

Book News Annotation:

A book/CD-ROM technical guide to the Unicode character encoding standard, the international character code for information processing that includes all major scripts of the world and is the foundation for development of software for worldwide use. Early chapters cover information engineers need to produce a conforming implementation. Later chapters give basic information about each script and discuss specific characters. Includes reference and background information, plus a glossary. The CD-ROM contains a Unicode Character Database, technical reports, and international, national, and vendor character mappings for various languages.
Annotation c. Book News, Inc., Portland, OR (booknews.com)

Synopsis:

This book/CD-ROM is "the" authoritative source of information on the Unicode character-encoding standard, which makes it possible to create global software and share data across languages, nations, and locales worldwide. Encompassing all of the world's widely-used scripts and character sets, Unicode represents the foundation for international software. Key principles are outlined in the book and the CD-ROM contains updated tables of character properties and mapping to international, national, and vendor character sets.

Description:

Unicode Consortium Unicode Characters for all the languages of the world The standard for the new millennium Required for XML and the Internet The basis for modern software standards and products The official way to implement ISO/IEC 10646 The key to global interoperability The Unicode Standard, Version 3.0 The authoritative, technical guide to the creation of software for worldwide use. Detailed specifications for Unicode: Structure, conformance, encoding forms, character properties, semantics, equivalence, combining characters, logical ordering, conversion, allocation, big/little endian usage, Korean syllable formation, control characters, case mappings, numeric values, mathematical properties, writing directions (Arabic, Japanese, English, and so on), character shaping (Arabic, Devanagari, Tamil, and so on) Expanded implementation guidelines by experts in global software design: Normalization, sorting and searching, case mapping, compression, language tagging, boundaries (characters, word, lines, and sentences), rendering of non-spacing marks, transcoding to other character sets, handling unknown characters, surrogate pairs, numbers, editing and selection, keyboard input, and more Comprehensive charts, references, glossary, and indexes: Codes, names, appearances, aliases, cross-references, equivalences, radical-stroke ideographic index, Shift-JIS index, and more CD-ROM The comprehensive Unicode Character Database for: Character codes, names, properties, decompositions, upper- ,lower-, and title cases, normalizations, shaping International, national, and vendor character mappings for: Western European, Japanese, Chinese, Korean, Greek, Russian, and others Windows, Macintosh, Unix, and Linux Unicode Technical Reports that extend the standard for: Sorting, displaying, normalizing, linebreaking, compression, serialization, regular expressions, CR/LF, XML, case mappings, and more

Table of Contents

Acknowledgments Unicode Consortium Members and Directors Full Members Current Associate Members Current Liaison Members Current Specialist Members Current Individual Members Current Members of the Board of Directors Contents Figures Tables Preface 0.1 About the Unicode Standard Concepts, Architecture, Conformance, and Guidelines Character Block Descriptions Charts and Index Appendices and Tables The Unicode Character Database and Technical Reports On the CD-ROM 0.2 Notational Conventions Extended BNF Operators 0.3 Resources Unicode Web Site Unicode Anonymous FTP Site Unicode Public Mailing List How to Contact the Unicode Consortium 1 Introduction 1.1 Coverage Standards Coverage New Characters 1.2 Design Basis 1.3 Text Handling Interpreting Characters Text Elements 1.4 The Unicode Standard and ISO/IEC 10646 1.5 The Unicode Consortium The Unicode Technical Committee 2 General Structure 2.1 Architectural Context Basic Text Processes Text Elements, Code Values, and Text Processes Text Processes and Encoding 2.2 Unicode Design Principles Sixteen-Bit Character Codes Efficiency Characters, Not Glyphs Semantics Plain Text Logical Order Unification Dynamic Composition Equivalent Sequence Convertibility 2.3 Encoding Forms UTF-16 UTF-8 Character Encoding Schemes 2.4 Unicode Allocation Allocation Areas Codespace Assignment for Graphic Characters Nongraphic Characters, Reserved and Unassigned Codes 2.5 Writing Direction 2.6 Combining Characters Sequence of Base Characters and Diacritics Multiple Combining Characters Multiple Base Characters Spacing Clones of European Diacritical Marks 2.7 Special Character and Noncharacter Values Byte Order Mark (BOM) Special Noncharacter Values Separators Layout and Format Control Characters The Replacement Character 2.8 Controls and Control Sequences Control Characters Representing Control Sequences 2.9 Conforming to the Unicode Standard Characters Not Used in a Subset 2.10 Referencing Versions of the Unicode Standard 3 Conformance 3.1 Conformance Requirements Byte Ordering Invalid Code Values Interpretation Modification Transformations Bidirectional Text Unicode Technical Reports 3.2 Semantics 3.3 Characters and Coded Representations 3.4 Simple Properties 3.5 Combination 3.6 Decomposition Compatibility Decomposition Canonical Decomposition 3.7 Surrogates 3.8 Transformations 3.9 Special Character Properties 3.10 Canonical Ordering Behavior Combining Classes Canonical Ordering Use with Collation 3.11 Conjoining Jamo Behavior Syllable Boundaries Standard Syllables Hangul Syllable Composition Hangul Syllable Decomposition Hangul Syllable Names 3.12 Bidirectional Behavior Directional Formatting Codes Basic Display Algorithm Definitions Resolving Embedding Levels Reordering Resolved Levels Bidirectional Conformance Implementation Notes 4 Character Properties 4.1 Case--Normative 4.2 Combining Classes--Normative 4.3 Directionality--Normative 4.4 Jamo Short Names--Normative 4.5 General Category--Normative in Part 4.6 Numeric Value--Normative 4.7 Mirrored--Normative 4.8 Unicode 1.0 Names 4.9 Mathematical Property 4.10 Letters and Other Useful Properties 5 Implementation Guidelines 5.1 Transcoding to Other Standards Issues Multistage Tables 7-Bit or 8-Bit Transmission Mapping Table Resources 5.2 ANSI/ISO C wchar_t 5.3 Unknown and Missing Characters Unassigned and Private Use Character Codes Interpretable but Unrenderable Characters Reassigned Characters 5.4 Handling Surrogate Pairs 5.5 Handling Numbers 5.6 Handling Properties 5.7 Normalization 5.8 Compression 5.9 Line Handling 5.10 Regular Expressions 5.11 Language Information in Plain Text Requirements for Language Tagging Working with Language Tags Language Tags and Han Unification 5.12 Editing and Selection Consistent Text Elements 5.13 Strategies for Handling Nonspacing Marks Keyboard Input Truncation 5.14 Rendering Nonspacing Marks Positioning Methods 5.15 Locating Text Element Boundaries Boundary Specification Example Specifications Grapheme Boundaries Word Boundaries Line Boundaries Sentence Boundaries Random Access 5.16 Identifiers Syntactic Rule 5.17 Sorting and Searching Culturally Expected Sorting Unicode Character Equivalence Similar Characters Levels of Comparison Ignorable Characters Multiple Mappings Collating Out-of-Scope Characters Unmapped Characters Parameterization Optimizations Searching Sublinear Searching 5.18 Case Mappings 6 Punctuation 6.1 General Punctuation Punctuation: U+0020-U+00BF General Punctuation: U+2000-U+206F CJK Symbols and Punctuation: U+3000-U+303F CJK Compatibility Forms: U+FE30-U+FE4F Small Form Variants: U+FE50-U+FE6F 7 European Alphabetic Scripts 7.1 Latin Letters of Basic Latin: U+0041-U+007A Letters of the Latin-1 Supplement: U+00C0-U+00FF Latin Extended-A: U+0100-U+017F Latin Extended-B: U+0180-U+024F IPA Extensions: U+0250-U+02AF Latin Extended Additional: U+1E00-U+1EFF Latin Ligatures: FB00-FB06 7.2 Greek Greek: U+0370-U+03FF Greek Extended: U+1F00-U+1FFF 7.3 Cyrillic Cyrillic: U+0400-U+04FF 7.4 Armenian Armenian: U+0530-U+058F 7.5 Georgian Georgian: U+10A0-U+10FF 7.6 Runic Runic: U+16A0-U+16F0 7.7 Ogham Ogham: U+1680-U+169F 7.8 Modifier Letters Spacing Modifier Letters: U+02B0-U+02FF 7.9 Combining Marks Combining Diacritical Marks: U+0300-U+036F Combining Marks for Symbols: U+20D0-U+20FF Combining Half Marks: U+FE20-U+FE2F 8 Middle Eastern Scripts 8.1 Hebrew Hebrew: U+0590-U+05FF Alphabetic Presentation Forms: U+FB1D-U+FB4F 8.2 Arabic Arabic: U+0600-U+06FF Cursive Joining Ligatures Arabic Presentation Forms-A: U+FB50-U+FDFF Arabic Presentation Forms-B: U+FE70-U+FEFF 8.3 Syriac Syriac: U+0700-U+074F Syriac Shaping Syriac Cursive Joining Ligatures 8.4 Thaana Thaana: U+0780-U+07BF 9 South and Southeast Asian Scripts 9.1 Devanagari Devanagari: U+0900-U+097F 9.2 Bengali Bengali: U+0980-U+09FF 9.3 Gurmukhi Gurmukhi: U+0A00-U+0A7F 9.4 Gujarati Gujarati: U+0A80-U+0AFF 9.5 Oriya Oriya: U+0B00-U+0B7F 9.6 Tamil Tamil: U+0B80-U+0BFF 9.7 Telugu Telugu: U+0C00-U+0C7F 9.8 Kannada Kannada: U+0C80-U+0CFF 9.9 Malayalam Malayalam: U+0D00-U+0D7F 9.10 Sinhala Sinhala: U+0D80-U+0DFF 9.11 Thai Thai: U+0E00-U+0E7F 9.12 Lao Lao: U+0E80-U+0EFF 9.13 Tibetan Tibetan: U+0F00-U+0FBF 9.14 Myanmar Myanmar: U+1000-U+109F 9.15 Khmer Khmer: U+1780-U+17FF 10 East Asian Scripts 10.1 Han CJK Unified Ideographs CJK Compatibility Ideographs: U+F900-U+FAFF Kanbun: U+3190-U+319F CJK and KangXi Radicals: U+2E80-U+2FD5 Ideographic Description: U+2FF0-U+2FFB 10.2 Hiragana Hiragana: U+3040-U+309F 10.3 Katakana Katakana: U+30A0-U+30FF Halfwidth and Fullwidth Forms: U+FF00-U+FFEF 10.4 Hangul Hangul Jamo: U+1100-U+11FF Hangul Compatibility Jamo: U+3130-U+318F Hangul Syllables: U+AC00-U+D7A3 10.5 Bopomofo Bopomofo: U+3100-U+312F 10.6 Yi Yi: U+A000-U+A4CF 11 Additional Scripts 11.1 Ethiopic Ethiopic: U+1200-U+137F 11.2 Cherokee Cherokee: U+13A0-U+13FF 11.3 Canadian Aboriginal Syllabics Canadian Aboriginal Syllabics: U+1400-U+167F 11.4 Mongolian Mongolian: U+1800-U+18AF 12 Symbols 12.1 Currency Symbols Currency Symbols: U+20A0-U+20CF 12.2 Letterlike Symbols Letterlike Symbols: U+2100-U+214F 12.3 Number Forms Number Forms: U+2150-U+218F Superscripts and Subscripts: U+2070-U+209F 12.4 Mathematical Operators Mathematical Operators: U+2200-U+22FF Arrows: U+2190-U+21FF 12.5 Technical Symbols Control Pictures: U+2400-U+243F Miscellaneous Technical: U+2300-U+23FF Optical Character Recognition: U+2440-U+245F 12.6 Geometrical Symbols Box Drawing: U+2500-U+257F Block Elements: U+2580-U+259F Geometric Shapes: U+25A0-U+25FF 12.7 Miscellaneous Symbols and Dingbats Miscellaneous Symbols: U+2600-U+26FF Dingbats: U+2700-U+27BF 12.8 Enclosed and Square Enclosed Alphanumerics: U+2460-U+24FF Enclosed CJK Letters and Months: U+3200-U+32FF CJK Compatibility: U+3300-U+33FF 12.9 Braille Braille: U+2800-U+28FF 13 Special Areas and Format Characters 13.1 Control Codes C0 Control Codes: U+0000-U+001F C1 Control Codes: U+0080-U+009F 13.2 Layout Controls Layout Controls 13.3 Deprecated Format Characters Deprecated Format Characters: U+206A-U+206F 13.4 Surrogates Area Surrogates Area: U+D800-U+DFFF 13.5 Private Use Area Private Use Area: U+E000-U+F8FF 13.6 Specials Specials: U+FEFF, U+FFF0-U+FFFF 14 Code Charts 14.1 Character Names List Images in the Code Charts and Character Lists Cross References Case Form Mappings Decompositions Information About Languages Reserved Characters 14.2 CJK Unified Ideographs 14.3 Hangul Syllables 15 Han Indices 15.1 Han Radical-Stroke Index 15.2 Shift-JIS Index A Han Unification History B Submitting New Characters B.1 Proposal Guidelines B.2 Requirements of Proposal Form and Process Interim Solutions Sending Proposals C Relationship to ISO/IEC 10646 C.1 History Unicode 1.0 Unicode 2.0 Unicode 3.0 C.2 Encoding Forms in ISO/IEC 10646 Zero Extending C.3 UCS Transformation Formats UTF-8 UTF-16 C.4 Synchronization of the Standards C.5 Identification of Features for the Unicode Standard C.6 Character Names C.7 Character Functional Specifications D Changes from Unicode Version 2.0 D.1 Versions of the Unicode Standard D.2 Changes from Unicode Version 2.0 to Version 2.1 New Characters Added Character Semantics Changes Changes Affecting Conformance D.3 Changes from Unicode Version 2.1 to Version 3.0 New Characters Added Character Semantics Changes Changes Affecting Conformance Unicode Technical Reports G Glossary R References R.1 Source Standards R.2 Source Dictionaries for Han Unification R.3 Other Sources for the Unicode Standard R.4 Selected Resources I Indices I.1 Unicode Names Index I.2 General Index

Product Details

ISBN:
9780201616330
Author:
Unicode Consortium
Publisher:
Addison-Wesley Professional
Location:
Reading, Mass.
Subject:
Programming Languages - General
Subject:
Programming - General
Subject:
Programming - Software Development
Subject:
Unicode (Computer character set)
Subject:
Unicode
Edition Number:
1
Series Volume:
888
Publication Date:
February 2000
Binding:
Hardcover
Language:
English
Illustrations:
Yes
Pages:
1072
Dimensions:
29 cm. +

Related Subjects

The Unicode Standard Version 3.0 with CDROM
0 stars - 0 reviews
$ In Stock
Product details 1072 pages Addison-Wesley Professional - English 9780201616330 Reviews:
"Synopsis" by , This book/CD-ROM is "the" authoritative source of information on the Unicode character-encoding standard, which makes it possible to create global software and share data across languages, nations, and locales worldwide. Encompassing all of the world's widely-used scripts and character sets, Unicode represents the foundation for international software. Key principles are outlined in the book and the CD-ROM contains updated tables of character properties and mapping to international, national, and vendor character sets.
spacer
spacer
  • back to top

FOLLOW US ON...

     
Powell's City of Books is an independent bookstore in Portland, Oregon, that fills a whole city block with more than a million new, used, and out of print books. Shop those shelves — plus literally millions more books, DVDs, and gifts — here at Powells.com.