org.apache.lucene.analysis
public final class LowerCaseTokenizer extends LetterTokenizer
Note: this does a decent job for most European languages, but does a terrible job for some Asian languages, where words are not separated by spaces.
Constructor Summary | |
---|---|
LowerCaseTokenizer(Reader in) Construct a new LowerCaseTokenizer. |
Method Summary | |
---|---|
protected char | normalize(char c) Collects only characters which satisfy
Character#isLetter(char). |