Yahoo! UI Library

unicode  3.3.0pr1

Yahoo! UI Library > unicode
Search:
 
Filters

Module: unicode

Unicode utilities.

This module contains the following classes:

Submodules:

unicode-accentfold

Provides a basic Unicode accent folding implementation that converts common accented letters (like "á") to their non-accented forms (like "a").

This implementation is not comprehensive, and should only be used as a last resort when accent folding can't be done on the server. A comprehensive accent folding implementation would require much more character data to be sent to the browser, resulting in a significant performance penalty. This implementation strives for a compromise between usefulness and performance.

Accent folding is a destructive operation that can't be reversed, and may change or destroy the actual meaning of the text depending on the language. It should not be used on strings that will later be displayed to a user, unless this is done with the understanding that linguistic meaning may be lost and that you may in fact confuse or insult the user by doing so.

unicode-data-accentfold

An imperfect, incomplete reverse mapping of ASCII characters to case-insensitive regexes that match their most common accented forms.

The goal of this module is to provide a pragmatic and generally useful set of accent folding data, since serving and performing lookups on a complete dataset would be impractical in client-side JavaScript.

Whenever possible, accent folding should be done on the server, where it's possible to use tools that are both more complete and more performant. It should only be done on the client as an absolute last resort.

unicode-data-wordbreak

Unicode data used by the word breaking algorithm.

Whenever possible, word breaking should be done on the server, where it's possible to use tools that are both more complete and more performant. It should only be done on the client as an absolute last resort.

unicode-wordbreak
Provides utility methods for splitting strings on word breaks and determining whether a character represents a word boundary, using the algorithm defined in the Unicode Text Segmentation guidelines (Unicode Standard Annex #29).

Copyright © 2010 Yahoo! Inc. All rights reserved.