public final class CJKAnalyzer
extends org.apache.lucene.analysis.StopwordAnalyzerBase
Analyzer that tokenizes text with StandardTokenizer,
normalizes content with CJKWidthFilter, folds case with
LowerCaseFilter, forms bigrams of CJK with CJKBigramFilter,
and filters stopwords with StopFilter| Modifier and Type | Field and Description |
|---|---|
static String[] |
STOP_WORDS
Deprecated.
use
getDefaultStopSet() instead |
| Constructor and Description |
|---|
CJKAnalyzer(org.apache.lucene.util.Version matchVersion)
Builds an analyzer which removes words in
getDefaultStopSet(). |
CJKAnalyzer(org.apache.lucene.util.Version matchVersion,
Set<?> stopwords)
Builds an analyzer with the given stop words
|
CJKAnalyzer(org.apache.lucene.util.Version matchVersion,
String... stopWords)
Deprecated.
use
CJKAnalyzer(Version, Set) instead |
| Modifier and Type | Method and Description |
|---|---|
protected org.apache.lucene.analysis.ReusableAnalyzerBase.TokenStreamComponents |
createComponents(String fieldName,
Reader reader) |
static Set<?> |
getDefaultStopSet()
Returns an unmodifiable instance of the default stop-words set.
|
getStopwordSet, loadStopwordSet, loadStopwordSet, loadStopwordSetinitReader, reusableTokenStream, tokenStream@Deprecated public static final String[] STOP_WORDS
getDefaultStopSet() insteadpublic CJKAnalyzer(org.apache.lucene.util.Version matchVersion)
getDefaultStopSet().public CJKAnalyzer(org.apache.lucene.util.Version matchVersion,
Set<?> stopwords)
matchVersion - lucene compatibility versionstopwords - a stopword set@Deprecated public CJKAnalyzer(org.apache.lucene.util.Version matchVersion, String... stopWords)
CJKAnalyzer(Version, Set) insteadstopWords - stop word arraypublic static Set<?> getDefaultStopSet()
Copyright © 2000-2015 Apache Software Foundation. All Rights Reserved.