net.sourceforge.wohenchan.encoding
Class UPlus

java.lang.Object
  |
  +--net.sourceforge.wohenchan.encoding.UPlus
All Implemented Interfaces:
LanguageInfoInterface

public class UPlus
extends java.lang.Object
implements LanguageInfoInterface

Information about the U+ unicode notation. The following is taken from http://www.unicode.org/unicode/standard/versions/Unicode3.0.1.html

 In running text, an individual Unicode code point can be expressed
 as U+n, where n is from four to six hexadecimal digits, using the
 digits 0-9 and A-F (for 10 through 15, respectively). There should
 be no leading zeros, unless the codepoint would have fewer than
 four hexadecimal digits; for example, U+0001, U+0012, U+0123,
 U+1234, U+12345, U+102345.
 

Version:
$Name: $ $Date: 2003/09/14 08:26:56 $
Author:
$Author: wtanaka $

Field Summary
static java.lang.String CANONICAL
           
(package private) static UPlus s_singleton
           
 
Fields inherited from interface net.sourceforge.wohenchan.encoding.LanguageInfoInterface
ALL_ENCODINGS
 
Constructor Summary
private UPlus()
           
 
Method Summary
 java.lang.String getCanonicalString()
           
static UPlus getInstance()
           
 int getProbability(byte[] input)
          The current implementation of this detection algorithm sucks.
 boolean isCharacterInLanguage(char ch)
          Returns true if ch is in the language, or false if it is not.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

CANONICAL

public static final java.lang.String CANONICAL
See Also:
Constant Field Values

s_singleton

static final UPlus s_singleton
Constructor Detail

UPlus

private UPlus()
Method Detail

getInstance

public static UPlus getInstance()

getCanonicalString

public java.lang.String getCanonicalString()
Specified by:
getCanonicalString in interface LanguageInfoInterface
Returns:
the canonical string describing this language.

getProbability

public int getProbability(byte[] input)
The current implementation of this detection algorithm sucks.


isCharacterInLanguage

public boolean isCharacterInLanguage(char ch)
Description copied from interface: LanguageInfoInterface
Returns true if ch is in the language, or false if it is not.

Specified by:
isCharacterInLanguage in interface LanguageInfoInterface