org.apache.lucene.analysis.cz
public final class CzechAnalyzer extends Analyzer
Field Summary | |
---|---|
static String[] | CZECH_STOP_WORDS
List of typical stopwords. |
Constructor Summary | |
---|---|
CzechAnalyzer()
Builds an analyzer with the default stop words (CZECH_STOP_WORDS). | |
CzechAnalyzer(String[] stopwords)
Builds an analyzer with the given stop words. | |
CzechAnalyzer(Hashtable stopwords)
Builds an analyzer with the given stop words.
| |
CzechAnalyzer(HashSet stopwords) | |
CzechAnalyzer(File stopwords)
Builds an analyzer with the given stop words. |
Method Summary | |
---|---|
void | loadStopWords(InputStream wordfile, String encoding)
Loads stopwords hash from resource stream (file, database...). |
TokenStream | tokenStream(String fieldName, Reader reader)
Creates a TokenStream which tokenizes all the text in the provided Reader.
|
Deprecated:
Builds an analyzer with the given stop words.Parameters: wordfile File containing the wordlist encoding Encoding used (win-1250, iso-8859-2, ...), null for default system encoding
Returns: A TokenStream build from a StandardTokenizer filtered with StandardFilter, LowerCaseFilter, and StopFilter