net.sourceforge.wohenchan.encoding
Class UPlus
java.lang.Object
|
+--net.sourceforge.wohenchan.encoding.UPlus
- All Implemented Interfaces:
- LanguageInfoInterface
- public class UPlus
- extends java.lang.Object
- implements LanguageInfoInterface
Information about the U+ unicode notation.
The following is taken from
http://www.unicode.org/unicode/standard/versions/Unicode3.0.1.html
In running text, an individual Unicode code point can be expressed
as U+n, where n is from four to six hexadecimal digits, using the
digits 0-9 and A-F (for 10 through 15, respectively). There should
be no leading zeros, unless the codepoint would have fewer than
four hexadecimal digits; for example, U+0001, U+0012, U+0123,
U+1234, U+12345, U+102345.
- Version:
- $Name: $ $Date: 2003/09/14 08:26:56 $
- Author:
- $Author: wtanaka $
Constructor Summary |
private |
UPlus()
|
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
CANONICAL
public static final java.lang.String CANONICAL
- See Also:
- Constant Field Values
s_singleton
static final UPlus s_singleton
UPlus
private UPlus()
getInstance
public static UPlus getInstance()
getCanonicalString
public java.lang.String getCanonicalString()
- Specified by:
getCanonicalString
in interface LanguageInfoInterface
- Returns:
- the canonical string describing this language.
getProbability
public int getProbability(byte[] input)
- The current implementation of this detection algorithm sucks.
isCharacterInLanguage
public boolean isCharacterInLanguage(char ch)
- Description copied from interface:
LanguageInfoInterface
- Returns true if ch is in the language, or false if it is not.
- Specified by:
isCharacterInLanguage
in interface LanguageInfoInterface