A B C D E F G H I J K L M N O P Q R S T U V W Y
All Classes All Packages
All Classes All Packages
All Classes All Packages
A
- AbstractEncoder - Class in org.apache.lucene.analysis.payloads
-
Base class for payload encoders.
- AbstractEncoder() - Constructor for class org.apache.lucene.analysis.payloads.AbstractEncoder
- accept(AttributeSource) - Method in class org.apache.lucene.analysis.sinks.DateRecognizerSinkFilter
- accept(AttributeSource) - Method in class org.apache.lucene.analysis.sinks.TokenRangeSinkFilter
- accept(AttributeSource) - Method in class org.apache.lucene.analysis.sinks.TokenTypeSinkFilter
- ACRONYM_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- add(Reader) - Method in class org.apache.lucene.analysis.synonym.SolrSynonymParser
- add(Reader) - Method in class org.apache.lucene.analysis.synonym.WordnetSynonymParser
- add(CharsRef, CharsRef, boolean) - Method in class org.apache.lucene.analysis.synonym.SynonymMap.Builder
-
Add a phrase->phrase synonym mapping.
- addClass(String) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Add a character class to the tree.
- addClass(String) - Method in interface org.apache.lucene.analysis.compound.hyphenation.PatternConsumer
-
Add a character class.
- addClass(String) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- addException(String, ArrayList<Object>) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Add an exception to the tree.
- addException(String, ArrayList<Object>) - Method in interface org.apache.lucene.analysis.compound.hyphenation.PatternConsumer
-
Add a hyphenation exception.
- addException(String, ArrayList<Object>) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- addPattern(String, String) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Add a pattern to the tree.
- addPattern(String, String) - Method in interface org.apache.lucene.analysis.compound.hyphenation.PatternConsumer
-
Add hyphenation patterns.
- addPattern(String, String) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- addPrefix(HunspellAffix) - Method in class org.apache.lucene.analysis.hunspell.HunspellStemmer.Stem
-
Adds a prefix to the list of prefixes used to generate this stem.
- addStopWords(IndexReader) - Method in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Deprecated.Stopwords should be calculated at instantiation using
QueryAutoStopWordAnalyzer(Version, Analyzer, IndexReader)
- addStopWords(IndexReader, float) - Method in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Deprecated.Stowords should be calculated at instantiation using
QueryAutoStopWordAnalyzer(Version, Analyzer, IndexReader, float)
- addStopWords(IndexReader, int) - Method in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Deprecated.Stopwords should be calculated at instantiation using
QueryAutoStopWordAnalyzer(Version, Analyzer, IndexReader, int)
- addStopWords(IndexReader, String, float) - Method in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Deprecated.Stowords should be calculated at instantiation using
QueryAutoStopWordAnalyzer(Version, Analyzer, IndexReader, Collection, float)
- addStopWords(IndexReader, String, int) - Method in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Deprecated.Stowords should be calculated at instantiation using
QueryAutoStopWordAnalyzer(Version, Analyzer, IndexReader, Collection, int)
- addSuffix(HunspellAffix) - Method in class org.apache.lucene.analysis.hunspell.HunspellStemmer.Stem
-
Adds a suffix to the list of suffixes used to generate this stem.
- ALEF - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- ALEF - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- ALEF_HAMZA_ABOVE - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- ALEF_HAMZA_BELOW - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- ALEF_MADDA - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- alloc(int) - Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
-
This is to implement memory allocation in the array.
- alloc(int) - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
- ALPHANUM_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- Among - Class in org.tartarus.snowball
- Among(String, int, int, String, SnowballProgram) - Constructor for class org.tartarus.snowball.Among
- analyze(Analyzer, String, CharsRef) - Static method in class org.apache.lucene.analysis.synonym.SynonymMap.Builder
-
Sugar: analyzes the text with the analyzer and separates by
SynonymMap.WORD_SEPARATOR
. - APOSTROPHE_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- append(char) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- append(CharSequence) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- append(CharSequence, int, int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- apply(char[], int) - Method in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
- applyAffix(char[], int, HunspellAffix, int) - Method in class org.apache.lucene.analysis.hunspell.HunspellStemmer
-
Applies the affix rule to the given word, producing a list of stems if any are found
- ArabicAnalyzer - Class in org.apache.lucene.analysis.ar
-
Analyzer
for Arabic. - ArabicAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.ar.ArabicAnalyzer
-
Builds an analyzer with the default stop words:
ArabicAnalyzer.DEFAULT_STOPWORD_FILE
. - ArabicAnalyzer(Version, File) - Constructor for class org.apache.lucene.analysis.ar.ArabicAnalyzer
-
Deprecated.use
ArabicAnalyzer(Version, Set)
instead - ArabicAnalyzer(Version, String...) - Constructor for class org.apache.lucene.analysis.ar.ArabicAnalyzer
-
Deprecated.use
ArabicAnalyzer(Version, Set)
instead - ArabicAnalyzer(Version, Hashtable<?, ?>) - Constructor for class org.apache.lucene.analysis.ar.ArabicAnalyzer
-
Deprecated.use
ArabicAnalyzer(Version, Set)
instead - ArabicAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.ar.ArabicAnalyzer
-
Builds an analyzer with the given stop words
- ArabicAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.ar.ArabicAnalyzer
-
Builds an analyzer with the given stop word.
- ArabicLetterTokenizer - Class in org.apache.lucene.analysis.ar
-
Deprecated.(3.1) Use
StandardTokenizer
instead. - ArabicLetterTokenizer(Reader) - Constructor for class org.apache.lucene.analysis.ar.ArabicLetterTokenizer
-
Deprecated.use
ArabicLetterTokenizer(Version, Reader)
instead. This will be removed in Lucene 4.0. - ArabicLetterTokenizer(AttributeSource.AttributeFactory, Reader) - Constructor for class org.apache.lucene.analysis.ar.ArabicLetterTokenizer
-
Deprecated.use
ArabicLetterTokenizer(Version, AttributeSource.AttributeFactory, Reader)
instead. This will be removed in Lucene 4.0. - ArabicLetterTokenizer(AttributeSource, Reader) - Constructor for class org.apache.lucene.analysis.ar.ArabicLetterTokenizer
-
Deprecated.use
ArabicLetterTokenizer(Version, AttributeSource, Reader)
instead. This will be removed in Lucene 4.0. - ArabicLetterTokenizer(Version, Reader) - Constructor for class org.apache.lucene.analysis.ar.ArabicLetterTokenizer
-
Deprecated.Construct a new ArabicLetterTokenizer.
- ArabicLetterTokenizer(Version, AttributeSource.AttributeFactory, Reader) - Constructor for class org.apache.lucene.analysis.ar.ArabicLetterTokenizer
-
Deprecated.Construct a new ArabicLetterTokenizer using a given
AttributeSource.AttributeFactory
. - ArabicLetterTokenizer(Version, AttributeSource, Reader) - Constructor for class org.apache.lucene.analysis.ar.ArabicLetterTokenizer
-
Deprecated.Construct a new ArabicLetterTokenizer using a given
AttributeSource
. - ArabicNormalizationFilter - Class in org.apache.lucene.analysis.ar
-
A
TokenFilter
that appliesArabicNormalizer
to normalize the orthography. - ArabicNormalizationFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ar.ArabicNormalizationFilter
- ArabicNormalizer - Class in org.apache.lucene.analysis.ar
-
Normalizer for Arabic.
- ArabicNormalizer() - Constructor for class org.apache.lucene.analysis.ar.ArabicNormalizer
- ArabicStemFilter - Class in org.apache.lucene.analysis.ar
-
A
TokenFilter
that appliesArabicStemmer
to stem Arabic words.. - ArabicStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ar.ArabicStemFilter
- ArabicStemmer - Class in org.apache.lucene.analysis.ar
-
Stemmer for Arabic.
- ArabicStemmer() - Constructor for class org.apache.lucene.analysis.ar.ArabicStemmer
- ArmenianAnalyzer - Class in org.apache.lucene.analysis.hy
-
Analyzer
for Armenian. - ArmenianAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.hy.ArmenianAnalyzer
-
Builds an analyzer with the default stop words:
ArmenianAnalyzer.DEFAULT_STOPWORD_FILE
. - ArmenianAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.hy.ArmenianAnalyzer
-
Builds an analyzer with the given stop words.
- ArmenianAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.hy.ArmenianAnalyzer
-
Builds an analyzer with the given stop words.
- ArmenianStemmer - Class in org.tartarus.snowball.ext
-
This class was automatically generated by a Snowball to Java compiler It implements the stemming algorithm defined by a snowball script.
- ArmenianStemmer() - Constructor for class org.tartarus.snowball.ext.ArmenianStemmer
- assign_to(StringBuilder) - Method in class org.tartarus.snowball.SnowballProgram
B
- BACK - org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter.Side
-
Get the n-gram from the end of the input
- BACK - org.apache.lucene.analysis.ngram.EdgeNGramTokenizer.Side
-
Get the n-gram from the end of the input
- balance() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Balance the tree for best search performance
- BasqueAnalyzer - Class in org.apache.lucene.analysis.eu
-
Analyzer
for Basque. - BasqueAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.eu.BasqueAnalyzer
-
Builds an analyzer with the default stop words:
BasqueAnalyzer.DEFAULT_STOPWORD_FILE
. - BasqueAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.eu.BasqueAnalyzer
-
Builds an analyzer with the given stop words.
- BasqueAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.eu.BasqueAnalyzer
-
Builds an analyzer with the given stop words.
- BasqueStemmer - Class in org.tartarus.snowball.ext
-
This class was automatically generated by a Snowball to Java compiler It implements the stemming algorithm defined by a snowball script.
- BasqueStemmer() - Constructor for class org.tartarus.snowball.ext.BasqueStemmer
- BEH - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- BLOCK_SIZE - Static variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- BOLD - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- BOLD_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- BOLD_ITALICS - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- BOLD_ITALICS_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- BOTH - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
Output the both the untokenized token and the splits
- bra - Variable in class org.tartarus.snowball.SnowballProgram
- BRAZILIAN_STOP_WORDS - Static variable in class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
Deprecated.use
BrazilianAnalyzer.getDefaultStopSet()
instead - BrazilianAnalyzer - Class in org.apache.lucene.analysis.br
-
Analyzer
for Brazilian Portuguese language. - BrazilianAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
Builds an analyzer with the default stop words (
BrazilianAnalyzer.getDefaultStopSet()
). - BrazilianAnalyzer(Version, File) - Constructor for class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
Deprecated.use
BrazilianAnalyzer(Version, Set)
instead - BrazilianAnalyzer(Version, String...) - Constructor for class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
Deprecated.use
BrazilianAnalyzer(Version, Set)
instead - BrazilianAnalyzer(Version, Map<?, ?>) - Constructor for class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
Deprecated.use
BrazilianAnalyzer(Version, Set)
instead - BrazilianAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
Builds an analyzer with the given stop words
- BrazilianAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
Builds an analyzer with the given stop words and stemming exclusion words
- BrazilianStemFilter - Class in org.apache.lucene.analysis.br
-
A
TokenFilter
that appliesBrazilianStemmer
. - BrazilianStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.br.BrazilianStemFilter
-
Creates a new BrazilianStemFilter
- BrazilianStemFilter(TokenStream, Set<?>) - Constructor for class org.apache.lucene.analysis.br.BrazilianStemFilter
-
Deprecated.use
KeywordAttribute
withKeywordMarkerFilter
instead. - BrazilianStemmer - Class in org.apache.lucene.analysis.br
-
A stemmer for Brazilian Portuguese words.
- BrazilianStemmer() - Constructor for class org.apache.lucene.analysis.br.BrazilianStemmer
- buf - Variable in class org.apache.lucene.analysis.util.OpenStringBuilder
- build() - Method in class org.apache.lucene.analysis.synonym.SynonymMap.Builder
-
Builds an
SynonymMap
and returns it. - Builder(boolean) - Constructor for class org.apache.lucene.analysis.synonym.SynonymMap.Builder
-
If dedup is true then identical rules (same input, same output) will be added only once.
- BulgarianAnalyzer - Class in org.apache.lucene.analysis.bg
-
Analyzer
for Bulgarian. - BulgarianAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.bg.BulgarianAnalyzer
-
Builds an analyzer with the default stop words:
BulgarianAnalyzer.DEFAULT_STOPWORD_FILE
. - BulgarianAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.bg.BulgarianAnalyzer
-
Builds an analyzer with the given stop words.
- BulgarianAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.bg.BulgarianAnalyzer
-
Builds an analyzer with the given stop words and a stem exclusion set.
- BulgarianStemFilter - Class in org.apache.lucene.analysis.bg
-
A
TokenFilter
that appliesBulgarianStemmer
to stem Bulgarian words. - BulgarianStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.bg.BulgarianStemFilter
- BulgarianStemmer - Class in org.apache.lucene.analysis.bg
-
Light Stemmer for Bulgarian.
- BulgarianStemmer() - Constructor for class org.apache.lucene.analysis.bg.BulgarianStemmer
- ByteVector - Class in org.apache.lucene.analysis.compound.hyphenation
-
This class implements a simple byte vector with access to the underlying array.
- ByteVector() - Constructor for class org.apache.lucene.analysis.compound.hyphenation.ByteVector
- ByteVector(byte[]) - Constructor for class org.apache.lucene.analysis.compound.hyphenation.ByteVector
- ByteVector(byte[], int) - Constructor for class org.apache.lucene.analysis.compound.hyphenation.ByteVector
- ByteVector(int) - Constructor for class org.apache.lucene.analysis.compound.hyphenation.ByteVector
C
- calculateShingleWeight(Token, List<Token>, int, List<ShingleMatrixFilter.Matrix.Column.Row>, List<Token>) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.Evaluates the new shingle token weight.
- capacity() - Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
-
returns current capacity of array
- capacity() - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
-
returns current capacity of array
- capacity() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- CatalanAnalyzer - Class in org.apache.lucene.analysis.ca
-
Analyzer
for Catalan. - CatalanAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.ca.CatalanAnalyzer
-
Builds an analyzer with the default stop words:
CatalanAnalyzer.DEFAULT_STOPWORD_FILE
. - CatalanAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.ca.CatalanAnalyzer
-
Builds an analyzer with the given stop words.
- CatalanAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.ca.CatalanAnalyzer
-
Builds an analyzer with the given stop words.
- CatalanStemmer - Class in org.tartarus.snowball.ext
-
This class was automatically generated by a Snowball to Java compiler It implements the stemming algorithm defined by a snowball script.
- CatalanStemmer() - Constructor for class org.tartarus.snowball.ext.CatalanStemmer
- CATEGORY - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- CATEGORY_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- characters(char[], int, int) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- CharArrayIterator - Class in org.apache.lucene.analysis.util
-
A CharacterIterator used internally for use with
BreakIterator
- CharArrayIterator() - Constructor for class org.apache.lucene.analysis.util.CharArrayIterator
- charAt(int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- charset - Variable in class org.apache.lucene.analysis.payloads.IdentityEncoder
- charsetName - Variable in class org.apache.lucene.analysis.payloads.IdentityEncoder
-
Deprecated.This field is no longer used. Use
IdentityEncoder.charset
instead. - CharVector - Class in org.apache.lucene.analysis.compound.hyphenation
-
This class implements a simple char vector with access to the underlying array.
- CharVector() - Constructor for class org.apache.lucene.analysis.compound.hyphenation.CharVector
- CharVector(char[]) - Constructor for class org.apache.lucene.analysis.compound.hyphenation.CharVector
- CharVector(char[], int) - Constructor for class org.apache.lucene.analysis.compound.hyphenation.CharVector
- CharVector(int) - Constructor for class org.apache.lucene.analysis.compound.hyphenation.CharVector
- checkCondition(CharSequence) - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Checks whether the given text matches the conditional pattern on this affix
- ChineseAnalyzer - Class in org.apache.lucene.analysis.cn
-
Deprecated.Use
StandardAnalyzer
instead, which has the same functionality. This analyzer will be removed in Lucene 5.0 - ChineseAnalyzer() - Constructor for class org.apache.lucene.analysis.cn.ChineseAnalyzer
-
Deprecated.
- ChineseFilter - Class in org.apache.lucene.analysis.cn
-
Deprecated.Use
StopFilter
instead, which has the same functionality. This filter will be removed in Lucene 5.0 - ChineseFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.cn.ChineseFilter
-
Deprecated.
- ChineseTokenizer - Class in org.apache.lucene.analysis.cn
-
Deprecated.Use
StandardTokenizer
instead, which has the same functionality. This filter will be removed in Lucene 5.0 - ChineseTokenizer(Reader) - Constructor for class org.apache.lucene.analysis.cn.ChineseTokenizer
-
Deprecated.
- ChineseTokenizer(AttributeSource.AttributeFactory, Reader) - Constructor for class org.apache.lucene.analysis.cn.ChineseTokenizer
-
Deprecated.
- ChineseTokenizer(AttributeSource, Reader) - Constructor for class org.apache.lucene.analysis.cn.ChineseTokenizer
-
Deprecated.
- CITATION - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- CITATION_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- CJ_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- CJKAnalyzer - Class in org.apache.lucene.analysis.cjk
-
An
Analyzer
that tokenizes text withStandardTokenizer
, normalizes content withCJKWidthFilter
, folds case withLowerCaseFilter
, forms bigrams of CJK withCJKBigramFilter
, and filters stopwords withStopFilter
- CJKAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.cjk.CJKAnalyzer
-
Builds an analyzer which removes words in
CJKAnalyzer.getDefaultStopSet()
. - CJKAnalyzer(Version, String...) - Constructor for class org.apache.lucene.analysis.cjk.CJKAnalyzer
-
Deprecated.use
CJKAnalyzer(Version, Set)
instead - CJKAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.cjk.CJKAnalyzer
-
Builds an analyzer with the given stop words
- CJKBigramFilter - Class in org.apache.lucene.analysis.cjk
-
Forms bigrams of CJK terms that are generated from StandardTokenizer or ICUTokenizer.
- CJKBigramFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.cjk.CJKBigramFilter
- CJKBigramFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.cjk.CJKBigramFilter
-
Create a new CJKBigramFilter, specifying which writing systems should be bigrammed.
- CJKTokenizer - Class in org.apache.lucene.analysis.cjk
-
Deprecated.Use StandardTokenizer, CJKWidthFilter, CJKBigramFilter, and LowerCaseFilter instead.
- CJKTokenizer(Reader) - Constructor for class org.apache.lucene.analysis.cjk.CJKTokenizer
-
Deprecated.Construct a token stream processing the given input.
- CJKTokenizer(AttributeSource.AttributeFactory, Reader) - Constructor for class org.apache.lucene.analysis.cjk.CJKTokenizer
-
Deprecated.
- CJKTokenizer(AttributeSource, Reader) - Constructor for class org.apache.lucene.analysis.cjk.CJKTokenizer
-
Deprecated.
- CJKWidthFilter - Class in org.apache.lucene.analysis.cjk
-
A
TokenFilter
that normalizes CJK width differences: Folds fullwidth ASCII variants into the equivalent basic latin Folds halfwidth Katakana variants into the equivalent kana - CJKWidthFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.cjk.CJKWidthFilter
- classmap - Variable in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
This map stores the character classes
- clear() - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
-
Reset Vector but don't resize or clear elements
- clone() - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
- clone() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- clone() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- close() - Method in class org.apache.lucene.analysis.charfilter.HTMLStripCharFilter
- close() - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAndSuffixAwareTokenFilter
- close() - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
- Column() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
-
Deprecated.
- Column(Token) - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
-
Deprecated.
- COMPANY_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- CompoundToken(int, int) - Constructor for class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase.CompoundToken
-
Construct the compound token based on a slice of the current
CompoundWordTokenFilterBase.termAtt
. - CompoundWordTokenFilterBase - Class in org.apache.lucene.analysis.compound
-
Base class for decomposition token filters.
- CompoundWordTokenFilterBase(TokenStream, String[]) - Constructor for class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
Deprecated.
- CompoundWordTokenFilterBase(TokenStream, String[], boolean) - Constructor for class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
Deprecated.
- CompoundWordTokenFilterBase(TokenStream, String[], int, int, int, boolean) - Constructor for class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
Deprecated.
- CompoundWordTokenFilterBase(TokenStream, Set<?>) - Constructor for class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
Deprecated.
- CompoundWordTokenFilterBase(TokenStream, Set<?>, boolean) - Constructor for class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
Deprecated.
- CompoundWordTokenFilterBase(TokenStream, Set<?>, int, int, int, boolean) - Constructor for class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
Deprecated.
- CompoundWordTokenFilterBase(Version, TokenStream, String[]) - Constructor for class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- CompoundWordTokenFilterBase(Version, TokenStream, String[], boolean) - Constructor for class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- CompoundWordTokenFilterBase(Version, TokenStream, String[], int, int, int, boolean) - Constructor for class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- CompoundWordTokenFilterBase(Version, TokenStream, Set<?>) - Constructor for class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- CompoundWordTokenFilterBase(Version, TokenStream, Set<?>, boolean) - Constructor for class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- CompoundWordTokenFilterBase(Version, TokenStream, Set<?>, int, int, int, boolean) - Constructor for class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- CompoundWordTokenFilterBase.CompoundToken - Class in org.apache.lucene.analysis.compound
-
Helper class to hold decompounded token information
- copy_from(SnowballProgram) - Method in class org.tartarus.snowball.SnowballProgram
- createComponents(String, Reader) - Method in class org.apache.lucene.analysis.ar.ArabicAnalyzer
-
Creates
ReusableAnalyzerBase.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.bg.BulgarianAnalyzer
-
Creates a
ReusableAnalyzerBase.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
Creates
ReusableAnalyzerBase.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.ca.CatalanAnalyzer
-
Creates a
ReusableAnalyzerBase.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.cjk.CJKAnalyzer
- createComponents(String, Reader) - Method in class org.apache.lucene.analysis.cn.ChineseAnalyzer
-
Deprecated.Creates
ReusableAnalyzerBase.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.cz.CzechAnalyzer
-
Creates
ReusableAnalyzerBase.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.da.DanishAnalyzer
-
Creates a
ReusableAnalyzerBase.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.de.GermanAnalyzer
-
Creates
ReusableAnalyzerBase.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.el.GreekAnalyzer
-
Creates
ReusableAnalyzerBase.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.en.EnglishAnalyzer
-
Creates a
ReusableAnalyzerBase.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.es.SpanishAnalyzer
-
Creates a
ReusableAnalyzerBase.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.eu.BasqueAnalyzer
-
Creates a
ReusableAnalyzerBase.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.fa.PersianAnalyzer
-
Creates
ReusableAnalyzerBase.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.fi.FinnishAnalyzer
-
Creates a
ReusableAnalyzerBase.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
Creates
ReusableAnalyzerBase.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.ga.IrishAnalyzer
-
Creates a
ReusableAnalyzerBase.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.gl.GalicianAnalyzer
-
Creates a
ReusableAnalyzerBase.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.hi.HindiAnalyzer
-
Creates
ReusableAnalyzerBase.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.hu.HungarianAnalyzer
-
Creates a
ReusableAnalyzerBase.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.hy.ArmenianAnalyzer
-
Creates a
ReusableAnalyzerBase.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.id.IndonesianAnalyzer
-
Creates
ReusableAnalyzerBase.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.it.ItalianAnalyzer
-
Creates a
ReusableAnalyzerBase.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.lv.LatvianAnalyzer
-
Creates a
ReusableAnalyzerBase.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.miscellaneous.PatternAnalyzer
-
Creates a token stream that tokenizes all the text in the given Reader; This implementation forwards to
tokenStream(String, Reader, String)
and is less efficient thantokenStream(String, Reader, String)
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
-
Returns a (possibly reused)
TokenStream
which tokenizes all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.no.NorwegianAnalyzer
-
Creates a
ReusableAnalyzerBase.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.pt.PortugueseAnalyzer
-
Creates a
ReusableAnalyzerBase.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.ro.RomanianAnalyzer
-
Creates a
ReusableAnalyzerBase.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.ru.RussianAnalyzer
-
Creates
ReusableAnalyzerBase.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.sv.SwedishAnalyzer
-
Creates a
ReusableAnalyzerBase.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.th.ThaiAnalyzer
-
Creates
ReusableAnalyzerBase.TokenStreamComponents
used to tokenize all the text in the providedReader
. - createComponents(String, Reader) - Method in class org.apache.lucene.analysis.tr.TurkishAnalyzer
-
Creates a
ReusableAnalyzerBase.TokenStreamComponents
which tokenizes all the text in the providedReader
. - createComponents(String, Reader, String) - Method in class org.apache.lucene.analysis.miscellaneous.PatternAnalyzer
-
Creates a token stream that tokenizes the given string into token terms (aka words).
- current() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- cursor - Variable in class org.tartarus.snowball.SnowballProgram
- CZECH_STOP_WORDS - Static variable in class org.apache.lucene.analysis.cz.CzechAnalyzer
-
Deprecated.use
CzechAnalyzer.getDefaultStopSet()
instead - CzechAnalyzer - Class in org.apache.lucene.analysis.cz
-
Analyzer
for Czech language. - CzechAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.cz.CzechAnalyzer
-
Builds an analyzer with the default stop words (
CzechAnalyzer.getDefaultStopSet()
). - CzechAnalyzer(Version, File) - Constructor for class org.apache.lucene.analysis.cz.CzechAnalyzer
-
Deprecated.use
CzechAnalyzer(Version, Set)
instead - CzechAnalyzer(Version, String...) - Constructor for class org.apache.lucene.analysis.cz.CzechAnalyzer
-
Deprecated.use
CzechAnalyzer(Version, Set)
instead - CzechAnalyzer(Version, HashSet<?>) - Constructor for class org.apache.lucene.analysis.cz.CzechAnalyzer
-
Deprecated.use
CzechAnalyzer(Version, Set)
instead - CzechAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.cz.CzechAnalyzer
-
Builds an analyzer with the given stop words.
- CzechAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.cz.CzechAnalyzer
-
Builds an analyzer with the given stop words and a set of work to be excluded from the
CzechStemFilter
. - CzechStemFilter - Class in org.apache.lucene.analysis.cz
-
A
TokenFilter
that appliesCzechStemmer
to stem Czech words. - CzechStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.cz.CzechStemFilter
- CzechStemmer - Class in org.apache.lucene.analysis.cz
-
Light Stemmer for Czech.
- CzechStemmer() - Constructor for class org.apache.lucene.analysis.cz.CzechStemmer
D
- DAMMA - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- DAMMATAN - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- DanishAnalyzer - Class in org.apache.lucene.analysis.da
-
Analyzer
for Danish. - DanishAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.da.DanishAnalyzer
-
Builds an analyzer with the default stop words:
DanishAnalyzer.DEFAULT_STOPWORD_FILE
. - DanishAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.da.DanishAnalyzer
-
Builds an analyzer with the given stop words.
- DanishAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.da.DanishAnalyzer
-
Builds an analyzer with the given stop words.
- DanishStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- DanishStemmer() - Constructor for class org.tartarus.snowball.ext.DanishStemmer
- DATE_TYPE - Static variable in class org.apache.lucene.analysis.sinks.DateRecognizerSinkFilter
- dateFormat - Variable in class org.apache.lucene.analysis.sinks.DateRecognizerSinkFilter
- DateRecognizerSinkFilter - Class in org.apache.lucene.analysis.sinks
-
Attempts to parse the
CharTermAttributeImpl.termBuffer()
as a Date using aDateFormat
. - DateRecognizerSinkFilter() - Constructor for class org.apache.lucene.analysis.sinks.DateRecognizerSinkFilter
-
Uses
DateFormat.getDateInstance()
as theDateFormat
object. - DateRecognizerSinkFilter(DateFormat) - Constructor for class org.apache.lucene.analysis.sinks.DateRecognizerSinkFilter
- DBBI_AVAILABLE - Static variable in class org.apache.lucene.analysis.th.ThaiWordFilter
-
True if the JRE supports a working dictionary-based breakiterator for Thai.
- decodeFloat(byte[]) - Static method in class org.apache.lucene.analysis.payloads.PayloadHelper
- decodeFloat(byte[], int) - Static method in class org.apache.lucene.analysis.payloads.PayloadHelper
-
Decode the payload that was encoded using
PayloadHelper.encodeFloat(float)
. - decodeInt(byte[], int) - Static method in class org.apache.lucene.analysis.payloads.PayloadHelper
- decompose() - Method in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
Decomposes the current
CompoundWordTokenFilterBase.termAtt
and placesCompoundWordTokenFilterBase.CompoundToken
instances in theCompoundWordTokenFilterBase.tokens
list. - decompose() - Method in class org.apache.lucene.analysis.compound.DictionaryCompoundWordTokenFilter
- decompose() - Method in class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
- DEFAULT_ANALYZER - Static variable in class org.apache.lucene.analysis.miscellaneous.PatternAnalyzer
-
A lower-casing word analyzer with English stop words (can be shared freely across threads without harm); global per class loader.
- DEFAULT_DELIMITER - Static variable in class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- DEFAULT_DELIMITER - Static variable in class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- DEFAULT_DELIMITER - Static variable in class org.apache.lucene.analysis.payloads.DelimitedPayloadTokenFilter
- DEFAULT_MAX_GRAM_SIZE - Static variable in class org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter
- DEFAULT_MAX_GRAM_SIZE - Static variable in class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
- DEFAULT_MAX_NGRAM_SIZE - Static variable in class org.apache.lucene.analysis.ngram.NGramTokenFilter
- DEFAULT_MAX_NGRAM_SIZE - Static variable in class org.apache.lucene.analysis.ngram.NGramTokenizer
- DEFAULT_MAX_SHINGLE_SIZE - Static variable in class org.apache.lucene.analysis.shingle.ShingleFilter
-
default maximum shingle size is 2.
- DEFAULT_MAX_SUBWORD_SIZE - Static variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
The default for maximal length of subwords that get propagated to the output of this filter
- DEFAULT_MIN_GRAM_SIZE - Static variable in class org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter
- DEFAULT_MIN_GRAM_SIZE - Static variable in class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
- DEFAULT_MIN_NGRAM_SIZE - Static variable in class org.apache.lucene.analysis.ngram.NGramTokenFilter
- DEFAULT_MIN_NGRAM_SIZE - Static variable in class org.apache.lucene.analysis.ngram.NGramTokenizer
- DEFAULT_MIN_SHINGLE_SIZE - Static variable in class org.apache.lucene.analysis.shingle.ShingleFilter
-
default minimum shingle size is 2.
- DEFAULT_MIN_SUBWORD_SIZE - Static variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
The default for minimal length of subwords that get propagated to the output of this filter
- DEFAULT_MIN_WORD_SIZE - Static variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
The default for minimal word length that gets decomposed
- DEFAULT_SIDE - Static variable in class org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter
- DEFAULT_SIDE - Static variable in class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
- DEFAULT_SKIP - Static variable in class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- DEFAULT_SKIP - Static variable in class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.ar.ArabicAnalyzer
-
File containing default Arabic stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.bg.BulgarianAnalyzer
-
File containing default Bulgarian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
File containing default Brazilian Portuguese stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.ca.CatalanAnalyzer
-
File containing default Catalan stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.cz.CzechAnalyzer
-
File containing default Czech stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.da.DanishAnalyzer
-
File containing default Danish stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.de.GermanAnalyzer
-
File containing default German stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.el.GreekAnalyzer
-
File containing default Greek stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.es.SpanishAnalyzer
-
File containing default Spanish stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.eu.BasqueAnalyzer
-
File containing default Basque stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.fa.PersianAnalyzer
-
File containing default Persian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.fi.FinnishAnalyzer
-
File containing default Italian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
File containing default French stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.ga.IrishAnalyzer
-
File containing default Irish stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.gl.GalicianAnalyzer
-
File containing default Galician stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.hi.HindiAnalyzer
-
File containing default Hindi stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.hu.HungarianAnalyzer
-
File containing default Hungarian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.hy.ArmenianAnalyzer
-
File containing default Armenian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.id.IndonesianAnalyzer
-
File containing default Indonesian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.it.ItalianAnalyzer
-
File containing default Italian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.lv.LatvianAnalyzer
-
File containing default Latvian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.nl.DutchAnalyzer
-
File containing default Dutch stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.no.NorwegianAnalyzer
-
File containing default Norwegian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.pt.PortugueseAnalyzer
-
File containing default Portuguese stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.ro.RomanianAnalyzer
-
File containing default Romanian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.ru.RussianAnalyzer
-
File containing default Russian stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.sv.SwedishAnalyzer
-
File containing default Swedish stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.th.ThaiAnalyzer
-
File containing default Thai stopwords.
- DEFAULT_STOPWORD_FILE - Static variable in class org.apache.lucene.analysis.tr.TurkishAnalyzer
-
File containing default Turkish stopwords.
- DEFAULT_TOKEN_TYPE - Static variable in class org.apache.lucene.analysis.shingle.ShingleFilter
-
default token type attribute value is "shingle"
- defaultMaxDocFreqPercent - Static variable in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
- defaultSettingsCodec - Static variable in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- defaultSpacerCharacter - Static variable in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- delete(char[], int, int) - Static method in class org.apache.lucene.analysis.util.StemmerUtil
-
Delete a character in-place
- deleteN(char[], int, int, int) - Static method in class org.apache.lucene.analysis.util.StemmerUtil
-
Delete n characters in-place
- DelimitedPayloadTokenFilter - Class in org.apache.lucene.analysis.payloads
-
Characters before the delimiter are the "token", those after are the payload.
- DelimitedPayloadTokenFilter(TokenStream, char, PayloadEncoder) - Constructor for class org.apache.lucene.analysis.payloads.DelimitedPayloadTokenFilter
- dictionary - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- DictionaryCompoundWordTokenFilter - Class in org.apache.lucene.analysis.compound
-
A
TokenFilter
that decomposes compound words found in many Germanic languages. - DictionaryCompoundWordTokenFilter(TokenStream, String[]) - Constructor for class org.apache.lucene.analysis.compound.DictionaryCompoundWordTokenFilter
-
Deprecated.
- DictionaryCompoundWordTokenFilter(TokenStream, String[], int, int, int, boolean) - Constructor for class org.apache.lucene.analysis.compound.DictionaryCompoundWordTokenFilter
-
Deprecated.
- DictionaryCompoundWordTokenFilter(TokenStream, Set) - Constructor for class org.apache.lucene.analysis.compound.DictionaryCompoundWordTokenFilter
-
Deprecated.
- DictionaryCompoundWordTokenFilter(TokenStream, Set, int, int, int, boolean) - Constructor for class org.apache.lucene.analysis.compound.DictionaryCompoundWordTokenFilter
-
Deprecated.
- DictionaryCompoundWordTokenFilter(Version, TokenStream, String[]) - Constructor for class org.apache.lucene.analysis.compound.DictionaryCompoundWordTokenFilter
-
Deprecated.Use the constructors taking
Set
- DictionaryCompoundWordTokenFilter(Version, TokenStream, String[], int, int, int, boolean) - Constructor for class org.apache.lucene.analysis.compound.DictionaryCompoundWordTokenFilter
-
Deprecated.Use the constructors taking
Set
- DictionaryCompoundWordTokenFilter(Version, TokenStream, Set<?>) - Constructor for class org.apache.lucene.analysis.compound.DictionaryCompoundWordTokenFilter
-
Creates a new
DictionaryCompoundWordTokenFilter
- DictionaryCompoundWordTokenFilter(Version, TokenStream, Set<?>, int, int, int, boolean) - Constructor for class org.apache.lucene.analysis.compound.DictionaryCompoundWordTokenFilter
-
Creates a new
DictionaryCompoundWordTokenFilter
- DOTLESS_YEH - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- DOUBLE_TYPE - Static variable in class org.apache.lucene.analysis.cjk.CJKBigramFilter
-
when we emit a bigram, its then marked as this type
- DUTCH_STOP_WORDS - Static variable in class org.apache.lucene.analysis.nl.DutchAnalyzer
-
Deprecated.use
DutchAnalyzer.getDefaultStopSet()
instead - DutchAnalyzer - Class in org.apache.lucene.analysis.nl
-
Analyzer
for Dutch language. - DutchAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.nl.DutchAnalyzer
-
Builds an analyzer with the default stop words (
DutchAnalyzer.getDefaultStopSet()
) and a few default entries for the stem exclusion table. - DutchAnalyzer(Version, File) - Constructor for class org.apache.lucene.analysis.nl.DutchAnalyzer
-
Deprecated.use
DutchAnalyzer(Version, Set)
instead - DutchAnalyzer(Version, String...) - Constructor for class org.apache.lucene.analysis.nl.DutchAnalyzer
-
Deprecated.use
DutchAnalyzer(Version, Set)
instead - DutchAnalyzer(Version, HashSet<?>) - Constructor for class org.apache.lucene.analysis.nl.DutchAnalyzer
-
Deprecated.use
DutchAnalyzer(Version, Set)
instead - DutchAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.nl.DutchAnalyzer
- DutchAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.nl.DutchAnalyzer
- DutchAnalyzer(Version, Set<?>, Set<?>, CharArrayMap<String>) - Constructor for class org.apache.lucene.analysis.nl.DutchAnalyzer
- DutchStemFilter - Class in org.apache.lucene.analysis.nl
-
Deprecated.Use
SnowballFilter
withDutchStemmer
instead, which has the same functionality. This filter will be removed in Lucene 5.0 - DutchStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.nl.DutchStemFilter
-
Deprecated.
- DutchStemFilter(TokenStream, Map<?, ?>) - Constructor for class org.apache.lucene.analysis.nl.DutchStemFilter
-
Deprecated.
- DutchStemFilter(TokenStream, Set<?>) - Constructor for class org.apache.lucene.analysis.nl.DutchStemFilter
-
Deprecated.use
KeywordAttribute
withKeywordMarkerFilter
instead. - DutchStemFilter(TokenStream, Set<?>, Map<?, ?>) - Constructor for class org.apache.lucene.analysis.nl.DutchStemFilter
-
Deprecated.use
KeywordAttribute
withKeywordMarkerFilter
instead. - DutchStemmer - Class in org.apache.lucene.analysis.nl
-
Deprecated.Use
DutchStemmer
instead, which has the same functionality. This filter will be removed in Lucene 5.0 - DutchStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- DutchStemmer() - Constructor for class org.apache.lucene.analysis.nl.DutchStemmer
-
Deprecated.
- DutchStemmer() - Constructor for class org.tartarus.snowball.ext.DutchStemmer
E
- EdgeNGramTokenFilter - Class in org.apache.lucene.analysis.ngram
-
Tokenizes the given token into n-grams of given size(s).
- EdgeNGramTokenFilter(TokenStream, String, int, int) - Constructor for class org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter
-
Creates EdgeNGramTokenFilter that can generate n-grams in the sizes of the given range
- EdgeNGramTokenFilter(TokenStream, EdgeNGramTokenFilter.Side, int, int) - Constructor for class org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter
-
Creates EdgeNGramTokenFilter that can generate n-grams in the sizes of the given range
- EdgeNGramTokenFilter.Side - Enum in org.apache.lucene.analysis.ngram
-
Specifies which side of the input the n-gram should be generated from
- EdgeNGramTokenizer - Class in org.apache.lucene.analysis.ngram
-
Tokenizes the input from an edge into n-grams of given size(s).
- EdgeNGramTokenizer(Reader, String, int, int) - Constructor for class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
-
Creates EdgeNGramTokenizer that can generate n-grams in the sizes of the given range
- EdgeNGramTokenizer(Reader, EdgeNGramTokenizer.Side, int, int) - Constructor for class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
-
Creates EdgeNGramTokenizer that can generate n-grams in the sizes of the given range
- EdgeNGramTokenizer(AttributeSource.AttributeFactory, Reader, String, int, int) - Constructor for class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
-
Creates EdgeNGramTokenizer that can generate n-grams in the sizes of the given range
- EdgeNGramTokenizer(AttributeSource.AttributeFactory, Reader, EdgeNGramTokenizer.Side, int, int) - Constructor for class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
-
Creates EdgeNGramTokenizer that can generate n-grams in the sizes of the given range
- EdgeNGramTokenizer(AttributeSource, Reader, String, int, int) - Constructor for class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
-
Creates EdgeNGramTokenizer that can generate n-grams in the sizes of the given range
- EdgeNGramTokenizer(AttributeSource, Reader, EdgeNGramTokenizer.Side, int, int) - Constructor for class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
-
Creates EdgeNGramTokenizer that can generate n-grams in the sizes of the given range
- EdgeNGramTokenizer.Side - Enum in org.apache.lucene.analysis.ngram
-
Specifies which side of the input the n-gram should be generated from
- ElisionFilter - Class in org.apache.lucene.analysis.fr
-
Removes elisions from a
TokenStream
. - ElisionFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.fr.ElisionFilter
-
Deprecated.use
ElisionFilter(Version, TokenStream)
instead - ElisionFilter(TokenStream, String[]) - Constructor for class org.apache.lucene.analysis.fr.ElisionFilter
-
Deprecated.use
ElisionFilter(Version, TokenStream, Set)
instead - ElisionFilter(TokenStream, Set<?>) - Constructor for class org.apache.lucene.analysis.fr.ElisionFilter
-
Deprecated.use
ElisionFilter(Version, TokenStream, Set)
instead - ElisionFilter(Version, TokenStream) - Constructor for class org.apache.lucene.analysis.fr.ElisionFilter
-
Constructs an elision filter with standard stop words
- ElisionFilter(Version, TokenStream, Set<?>) - Constructor for class org.apache.lucene.analysis.fr.ElisionFilter
-
Constructs an elision filter with a Set of stop words
- EMAIL_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- EmptyTokenStream - Class in org.apache.lucene.analysis.miscellaneous
-
An always exhausted token stream.
- EmptyTokenStream() - Constructor for class org.apache.lucene.analysis.miscellaneous.EmptyTokenStream
- encode(char[]) - Method in class org.apache.lucene.analysis.payloads.AbstractEncoder
- encode(char[]) - Method in interface org.apache.lucene.analysis.payloads.PayloadEncoder
- encode(char[], int, int) - Method in class org.apache.lucene.analysis.payloads.FloatEncoder
- encode(char[], int, int) - Method in class org.apache.lucene.analysis.payloads.IdentityEncoder
- encode(char[], int, int) - Method in class org.apache.lucene.analysis.payloads.IntegerEncoder
- encode(char[], int, int) - Method in interface org.apache.lucene.analysis.payloads.PayloadEncoder
-
Convert a char array to a
Payload
- encodeFloat(float) - Static method in class org.apache.lucene.analysis.payloads.PayloadHelper
- encodeFloat(float, byte[], int) - Static method in class org.apache.lucene.analysis.payloads.PayloadHelper
- encodeInt(int) - Static method in class org.apache.lucene.analysis.payloads.PayloadHelper
- encodeInt(int, byte[], int) - Static method in class org.apache.lucene.analysis.payloads.PayloadHelper
- end() - Method in class org.apache.lucene.analysis.cjk.CJKTokenizer
-
Deprecated.
- end() - Method in class org.apache.lucene.analysis.cn.ChineseTokenizer
-
Deprecated.
- end() - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAndSuffixAwareTokenFilter
- end() - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
- end() - Method in class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
- end() - Method in class org.apache.lucene.analysis.ngram.NGramTokenizer
- end() - Method in class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- end() - Method in class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- end() - Method in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- endElement(String, String, String) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- endOffset - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase.CompoundToken
- endsWith(char[], int, char[]) - Static method in class org.apache.lucene.analysis.util.StemmerUtil
-
Returns true if the character array ends with the suffix.
- endsWith(char[], int, String) - Static method in class org.apache.lucene.analysis.util.StemmerUtil
-
Returns true if the character array ends with the suffix.
- EnglishAnalyzer - Class in org.apache.lucene.analysis.en
-
Analyzer
for English. - EnglishAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.en.EnglishAnalyzer
-
Builds an analyzer with the default stop words:
EnglishAnalyzer.getDefaultStopSet()
. - EnglishAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.en.EnglishAnalyzer
-
Builds an analyzer with the given stop words.
- EnglishAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.en.EnglishAnalyzer
-
Builds an analyzer with the given stop words.
- EnglishMinimalStemFilter - Class in org.apache.lucene.analysis.en
-
A
TokenFilter
that appliesEnglishMinimalStemmer
to stem English words. - EnglishMinimalStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.en.EnglishMinimalStemFilter
- EnglishMinimalStemmer - Class in org.apache.lucene.analysis.en
-
Minimal plural stemmer for English.
- EnglishMinimalStemmer() - Constructor for class org.apache.lucene.analysis.en.EnglishMinimalStemmer
- EnglishPossessiveFilter - Class in org.apache.lucene.analysis.en
-
TokenFilter that removes possessives (trailing 's) from words.
- EnglishPossessiveFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.en.EnglishPossessiveFilter
-
Deprecated.Use
EnglishPossessiveFilter(Version, TokenStream)
instead. - EnglishPossessiveFilter(Version, TokenStream) - Constructor for class org.apache.lucene.analysis.en.EnglishPossessiveFilter
- EnglishStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- EnglishStemmer() - Constructor for class org.tartarus.snowball.ext.EnglishStemmer
- eq - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Pointer to equal branch and to data when this node is a string terminator.
- eq_s(int, CharSequence) - Method in class org.tartarus.snowball.SnowballProgram
- eq_s(int, String) - Method in class org.tartarus.snowball.SnowballProgram
-
Deprecated.for binary back compat. Will be removed in Lucene 4.0
- eq_s_b(int, CharSequence) - Method in class org.tartarus.snowball.SnowballProgram
- eq_s_b(int, String) - Method in class org.tartarus.snowball.SnowballProgram
-
Deprecated.for binary back compat. Will be removed in Lucene 4.0
- eq_v(CharSequence) - Method in class org.tartarus.snowball.SnowballProgram
- eq_v(StringBuilder) - Method in class org.tartarus.snowball.SnowballProgram
-
Deprecated.for binary back compat. Will be removed in Lucene 4.0
- eq_v_b(CharSequence) - Method in class org.tartarus.snowball.SnowballProgram
- eq_v_b(StringBuilder) - Method in class org.tartarus.snowball.SnowballProgram
-
Deprecated.for binary back compat. Will be removed in Lucene 4.0
- equals(Object) - Method in class org.apache.lucene.analysis.miscellaneous.PatternAnalyzer
-
Indicates whether some other object is "equal to" this one.
- equals(Object) - Method in class org.tartarus.snowball.ext.ArmenianStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.BasqueStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.CatalanStemmer
- equals(Object) - Method in class org.tartarus.snowball.ext.IrishStemmer
- exceptions - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSetExceptions
- exceptions - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSuffixExceptions
- EXTENDED_ANALYZER - Static variable in class org.apache.lucene.analysis.miscellaneous.PatternAnalyzer
-
A lower-casing word analyzer with extended English stop words (can be shared freely across threads without harm); global per class loader.
- EXTERNAL_LINK - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- EXTERNAL_LINK_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- EXTERNAL_LINK_URL - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- EXTERNAL_LINK_URL_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
F
- FARSI_YEH - Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
- FATHA - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- FATHATAN - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- FEH - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- FILLER_TOKEN - Static variable in class org.apache.lucene.analysis.shingle.ShingleFilter
-
filler token for when positionIncrement is more than 1
- find(char[], int) - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- find(String) - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- find_among(Among[], int) - Method in class org.tartarus.snowball.SnowballProgram
- find_among_b(Among[], int) - Method in class org.tartarus.snowball.SnowballProgram
- findPattern(String) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
- FinnishAnalyzer - Class in org.apache.lucene.analysis.fi
-
Analyzer
for Finnish. - FinnishAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.fi.FinnishAnalyzer
-
Builds an analyzer with the default stop words:
FinnishAnalyzer.DEFAULT_STOPWORD_FILE
. - FinnishAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.fi.FinnishAnalyzer
-
Builds an analyzer with the given stop words.
- FinnishAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.fi.FinnishAnalyzer
-
Builds an analyzer with the given stop words.
- FinnishLightStemFilter - Class in org.apache.lucene.analysis.fi
-
A
TokenFilter
that appliesFinnishLightStemmer
to stem Finnish words. - FinnishLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.fi.FinnishLightStemFilter
- FinnishLightStemmer - Class in org.apache.lucene.analysis.fi
-
Light Stemmer for Finnish.
- FinnishLightStemmer() - Constructor for class org.apache.lucene.analysis.fi.FinnishLightStemmer
- FinnishStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- FinnishStemmer() - Constructor for class org.tartarus.snowball.ext.FinnishStemmer
- first() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- FloatEncoder - Class in org.apache.lucene.analysis.payloads
-
Encode a character array Float as a
Payload
. - FloatEncoder() - Constructor for class org.apache.lucene.analysis.payloads.FloatEncoder
- flush() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- freenode - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- FRENCH_STOP_WORDS - Static variable in class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
Deprecated.use
FrenchAnalyzer.getDefaultStopSet()
instead - FrenchAnalyzer - Class in org.apache.lucene.analysis.fr
-
Analyzer
for French language. - FrenchAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
Builds an analyzer with the default stop words (
FrenchAnalyzer.getDefaultStopSet()
). - FrenchAnalyzer(Version, File) - Constructor for class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
Deprecated.use
FrenchAnalyzer(Version, Set)
instead - FrenchAnalyzer(Version, String...) - Constructor for class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
Deprecated.use
FrenchAnalyzer(Version, Set)
instead - FrenchAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
Builds an analyzer with the given stop words
- FrenchAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
Builds an analyzer with the given stop words
- FrenchLightStemFilter - Class in org.apache.lucene.analysis.fr
-
A
TokenFilter
that appliesFrenchLightStemmer
to stem French words. - FrenchLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.fr.FrenchLightStemFilter
- FrenchLightStemmer - Class in org.apache.lucene.analysis.fr
-
Light Stemmer for French.
- FrenchLightStemmer() - Constructor for class org.apache.lucene.analysis.fr.FrenchLightStemmer
- FrenchMinimalStemFilter - Class in org.apache.lucene.analysis.fr
-
A
TokenFilter
that appliesFrenchMinimalStemmer
to stem French words. - FrenchMinimalStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.fr.FrenchMinimalStemFilter
- FrenchMinimalStemmer - Class in org.apache.lucene.analysis.fr
-
Light Stemmer for French.
- FrenchMinimalStemmer() - Constructor for class org.apache.lucene.analysis.fr.FrenchMinimalStemmer
- FrenchStemFilter - Class in org.apache.lucene.analysis.fr
-
Deprecated.Use
SnowballFilter
withFrenchStemmer
instead, which has the same functionality. This filter will be removed in Lucene 5.0 - FrenchStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.fr.FrenchStemFilter
-
Deprecated.
- FrenchStemFilter(TokenStream, Set<?>) - Constructor for class org.apache.lucene.analysis.fr.FrenchStemFilter
-
Deprecated.use
KeywordAttribute
withKeywordMarkerFilter
instead. - FrenchStemmer - Class in org.apache.lucene.analysis.fr
-
Deprecated.Use
FrenchStemmer
instead, which has the same functionality. This filter will be removed in Lucene 5.0 - FrenchStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- FrenchStemmer() - Constructor for class org.apache.lucene.analysis.fr.FrenchStemmer
-
Deprecated.
- FrenchStemmer() - Constructor for class org.tartarus.snowball.ext.FrenchStemmer
- FRONT - org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter.Side
-
Get the n-gram from the front of the input
- FRONT - org.apache.lucene.analysis.ngram.EdgeNGramTokenizer.Side
-
Get the n-gram from the front of the input
- fst - Variable in class org.apache.lucene.analysis.synonym.SynonymMap
-
map>
G
- GalicianAnalyzer - Class in org.apache.lucene.analysis.gl
-
Analyzer
for Galician. - GalicianAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.gl.GalicianAnalyzer
-
Builds an analyzer with the default stop words:
GalicianAnalyzer.DEFAULT_STOPWORD_FILE
. - GalicianAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.gl.GalicianAnalyzer
-
Builds an analyzer with the given stop words.
- GalicianAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.gl.GalicianAnalyzer
-
Builds an analyzer with the given stop words.
- GalicianMinimalStemFilter - Class in org.apache.lucene.analysis.gl
-
A
TokenFilter
that appliesGalicianMinimalStemmer
to stem Galician words. - GalicianMinimalStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.gl.GalicianMinimalStemFilter
- GalicianMinimalStemmer - Class in org.apache.lucene.analysis.gl
-
Minimal Stemmer for Galician
- GalicianMinimalStemmer() - Constructor for class org.apache.lucene.analysis.gl.GalicianMinimalStemmer
- GalicianStemFilter - Class in org.apache.lucene.analysis.gl
-
A
TokenFilter
that appliesGalicianStemmer
to stem Galician words. - GalicianStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.gl.GalicianStemFilter
- GalicianStemmer - Class in org.apache.lucene.analysis.gl
-
Galician stemmer implementing "Regras do lematizador para o galego".
- GalicianStemmer() - Constructor for class org.apache.lucene.analysis.gl.GalicianStemmer
- GERMAN_STOP_WORDS - Static variable in class org.apache.lucene.analysis.de.GermanAnalyzer
-
Deprecated.use
GermanAnalyzer.getDefaultStopSet()
instead - German2Stemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- German2Stemmer() - Constructor for class org.tartarus.snowball.ext.German2Stemmer
- GermanAnalyzer - Class in org.apache.lucene.analysis.de
-
Analyzer
for German language. - GermanAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.de.GermanAnalyzer
-
Builds an analyzer with the default stop words:
GermanAnalyzer.getDefaultStopSet()
. - GermanAnalyzer(Version, File) - Constructor for class org.apache.lucene.analysis.de.GermanAnalyzer
-
Deprecated.
- GermanAnalyzer(Version, String...) - Constructor for class org.apache.lucene.analysis.de.GermanAnalyzer
-
Deprecated.
- GermanAnalyzer(Version, Map<?, ?>) - Constructor for class org.apache.lucene.analysis.de.GermanAnalyzer
-
Deprecated.
- GermanAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.de.GermanAnalyzer
-
Builds an analyzer with the given stop words
- GermanAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.de.GermanAnalyzer
-
Builds an analyzer with the given stop words
- GermanLightStemFilter - Class in org.apache.lucene.analysis.de
-
A
TokenFilter
that appliesGermanLightStemmer
to stem German words. - GermanLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.de.GermanLightStemFilter
- GermanLightStemmer - Class in org.apache.lucene.analysis.de
-
Light Stemmer for German.
- GermanLightStemmer() - Constructor for class org.apache.lucene.analysis.de.GermanLightStemmer
- GermanMinimalStemFilter - Class in org.apache.lucene.analysis.de
-
A
TokenFilter
that appliesGermanMinimalStemmer
to stem German words. - GermanMinimalStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.de.GermanMinimalStemFilter
- GermanMinimalStemmer - Class in org.apache.lucene.analysis.de
-
Minimal Stemmer for German.
- GermanMinimalStemmer() - Constructor for class org.apache.lucene.analysis.de.GermanMinimalStemmer
- GermanNormalizationFilter - Class in org.apache.lucene.analysis.de
-
Normalizes German characters according to the heuristics of the German2 snowball algorithm.
- GermanNormalizationFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.de.GermanNormalizationFilter
- GermanStemFilter - Class in org.apache.lucene.analysis.de
-
A
TokenFilter
that stems German words. - GermanStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.de.GermanStemFilter
-
Creates a
GermanStemFilter
instance - GermanStemFilter(TokenStream, Set<?>) - Constructor for class org.apache.lucene.analysis.de.GermanStemFilter
-
Deprecated.use
KeywordAttribute
withKeywordMarkerFilter
instead. - GermanStemmer - Class in org.apache.lucene.analysis.de
-
A stemmer for German words.
- GermanStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- GermanStemmer() - Constructor for class org.apache.lucene.analysis.de.GermanStemmer
- GermanStemmer() - Constructor for class org.tartarus.snowball.ext.GermanStemmer
- get(int) - Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
- get(int) - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
- getAppend() - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Returns the append defined for the affix
- getAppendFlags() - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Returns the flags defined for the affix append
- getArray() - Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
- getArray() - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
- getArray() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- getBeginIndex() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- getColumn() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column.Row
-
Deprecated.
- getColumns() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix
-
Deprecated.
- getCondition() - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Returns the condition that must be met before the affix can be applied
- getCurrent() - Method in class org.tartarus.snowball.SnowballProgram
-
Get the current string.
- getCurrentBuffer() - Method in class org.tartarus.snowball.SnowballProgram
-
Get the current buffer containing the stem.
- getCurrentBufferLength() - Method in class org.tartarus.snowball.SnowballProgram
-
Get the valid length of the character array in
SnowballProgram.getCurrentBuffer()
. - getDefaultStopSet() - Static method in class org.apache.lucene.analysis.ar.ArabicAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.bg.BulgarianAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.ca.CatalanAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.cjk.CJKAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.cz.CzechAnalyzer
-
Returns a set of default Czech-stopwords
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.da.DanishAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.de.GermanAnalyzer
-
Returns a set of default German-stopwords
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.el.GreekAnalyzer
-
Returns a set of default Greek-stopwords
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.en.EnglishAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.es.SpanishAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.eu.BasqueAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.fa.PersianAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.fi.FinnishAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.ga.IrishAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.gl.GalicianAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.hi.HindiAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.hu.HungarianAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.hy.ArmenianAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.id.IndonesianAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.it.ItalianAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.lv.LatvianAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.nl.DutchAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.no.NorwegianAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.pt.PortugueseAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.ro.RomanianAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.ru.RussianAnalyzer
-
Returns an unmodifiable instance of the default stop-words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.sv.SwedishAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.th.ThaiAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getDefaultStopSet() - Static method in class org.apache.lucene.analysis.tr.TurkishAnalyzer
-
Returns an unmodifiable instance of the default stop words set.
- getEndIndex() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- getExceptionWord(ArrayList<?>) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- getFlag() - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Returns the affix flag
- getFlags() - Method in class org.apache.lucene.analysis.hunspell.HunspellWord
-
Returns the flags associated with the word
- getHyphenationPoints() - Method in class org.apache.lucene.analysis.compound.hyphenation.Hyphenation
- getHyphenationTree(File) - Static method in class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
-
Create a hyphenator tree
- getHyphenationTree(Reader) - Static method in class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
-
Deprecated.Don't use Readers with fixed charset to load XML files, unless programatically created. Use
HyphenationCompoundWordTokenFilter.getHyphenationTree(InputSource)
instead, where you can supply default charset and input stream, if you like. - getHyphenationTree(String) - Static method in class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
-
Create a hyphenator tree
- getHyphenationTree(InputSource) - Static method in class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
-
Create a hyphenator tree
- getIndex() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
-
Deprecated.
- getIndex() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column.Row
-
Deprecated.
- getIndex() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenPositioner
-
Deprecated.
- getIndex() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- getInterletterValues(String) - Static method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- getLabel() - Method in enum org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter.Side
- getLabel() - Method in enum org.apache.lucene.analysis.ngram.EdgeNGramTokenizer.Side
- getLength() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- getMatrix() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- getMatrix() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
-
Deprecated.
- getMaximumShingleSize() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- getMaxShingleSize() - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
The max shingle (token ngram) size
- getMinimumShingleSize() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- getMinShingleSize() - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
The min shingle (token ngram) size
- getPattern(String) - Static method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- getPrefix() - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
- getPrefixes() - Method in class org.apache.lucene.analysis.hunspell.HunspellStemmer.Stem
-
Returns the list of prefixes used to generate the stem
- getRows() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
-
Deprecated.
- getSide(String) - Static method in enum org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter.Side
- getSide(String) - Static method in enum org.apache.lucene.analysis.ngram.EdgeNGramTokenizer.Side
- getSpacerCharacter() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- getStart() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- getStem() - Method in class org.apache.lucene.analysis.hunspell.HunspellStemmer.Stem
-
Returns the actual word stem itself
- getStemLength() - Method in class org.apache.lucene.analysis.hunspell.HunspellStemmer.Stem
- getStemString() - Method in class org.apache.lucene.analysis.hunspell.HunspellStemmer.Stem
- getStopWords() - Method in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Provides information on which stop words have been identified for all fields
- getStopWords(String) - Method in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Provides information on which stop words have been identified for a field
- getStrip() - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Returns the stripping characters defined for the affix
- getSuffix() - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
- getSuffixes() - Method in class org.apache.lucene.analysis.hunspell.HunspellStemmer.Stem
-
Returns the list of suffixes used to generate the stem
- getText() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- getToken() - Method in class org.apache.lucene.analysis.miscellaneous.SingleTokenTokenStream
- getTokenPositioner(Token) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec
-
Deprecated.
- getTokenPositioner(Token) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec
-
Deprecated.
- getTokenPositioner(Token) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenSettingsCodec
-
Deprecated.Retrieves information on how a
Token
is to be inserted to aShingleMatrixFilter.Matrix
. - getTokenPositioner(Token) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec
-
Deprecated.
- getTokens() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column.Row
-
Deprecated.
- getTokenSeparator() - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- getValue() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree.Iterator
- getValues(int) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
- getVersion() - Method in class org.apache.lucene.analysis.hunspell.HunspellDictionary
- getWeight(Token) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec
-
Deprecated.
- getWeight(Token) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec
-
Deprecated.Returns a 32 bit float from the payload, or 1f it null.
- getWeight(Token) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenSettingsCodec
-
Deprecated.Have this method return 1f in order to 'disable' weights.
- getWeight(Token) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec
-
Deprecated.
- GreekAnalyzer - Class in org.apache.lucene.analysis.el
-
Analyzer
for the Greek language. - GreekAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.el.GreekAnalyzer
-
Builds an analyzer with the default stop words.
- GreekAnalyzer(Version, String...) - Constructor for class org.apache.lucene.analysis.el.GreekAnalyzer
-
Deprecated.use
GreekAnalyzer(Version, Set)
instead - GreekAnalyzer(Version, Map<?, ?>) - Constructor for class org.apache.lucene.analysis.el.GreekAnalyzer
-
Deprecated.use
GreekAnalyzer(Version, Set)
instead - GreekAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.el.GreekAnalyzer
-
Builds an analyzer with the given stop words.
- GreekLowerCaseFilter - Class in org.apache.lucene.analysis.el
-
Normalizes token text to lower case, removes some Greek diacritics, and standardizes final sigma to sigma.
- GreekLowerCaseFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.el.GreekLowerCaseFilter
-
Deprecated.Use
GreekLowerCaseFilter(Version, TokenStream)
instead. - GreekLowerCaseFilter(Version, TokenStream) - Constructor for class org.apache.lucene.analysis.el.GreekLowerCaseFilter
-
Create a GreekLowerCaseFilter that normalizes Greek token text.
- GreekStemFilter - Class in org.apache.lucene.analysis.el
-
A
TokenFilter
that appliesGreekStemmer
to stem Greek words. - GreekStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.el.GreekStemFilter
- GreekStemmer - Class in org.apache.lucene.analysis.el
-
A stemmer for Greek words, according to: Development of a Stemmer for the Greek Language. Georgios Ntais
- GreekStemmer() - Constructor for class org.apache.lucene.analysis.el.GreekStemmer
H
- HAMZA_ABOVE - Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
- HAN - Static variable in class org.apache.lucene.analysis.cjk.CJKBigramFilter
-
bigram flag for Han Ideographs
- HANGUL - Static variable in class org.apache.lucene.analysis.cjk.CJKBigramFilter
-
bigram flag for Hangul
- HAS_BUGGY_BREAKITERATORS - Static variable in class org.apache.lucene.analysis.util.CharArrayIterator
-
True if this JRE has a buggy BreakIterator implementation
- hasFlag(char) - Method in class org.apache.lucene.analysis.hunspell.HunspellWord
-
Checks whether the word has the given flag associated with it
- hashCode() - Method in class org.apache.lucene.analysis.miscellaneous.PatternAnalyzer
-
Returns a hash code value for the object.
- hashCode() - Method in class org.tartarus.snowball.ext.ArmenianStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.BasqueStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.CatalanStemmer
- hashCode() - Method in class org.tartarus.snowball.ext.IrishStemmer
- hasMoreElements() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree.Iterator
- HEADING - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- HEADING_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- HEH - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- HEH - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- HEH - Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
- HEH_GOAL - Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
- HEH_YEH - Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
- hi - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Pointer to high branch.
- HindiAnalyzer - Class in org.apache.lucene.analysis.hi
-
Analyzer for Hindi.
- HindiAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.hi.HindiAnalyzer
-
Builds an analyzer with the default stop words:
HindiAnalyzer.DEFAULT_STOPWORD_FILE
. - HindiAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.hi.HindiAnalyzer
-
Builds an analyzer with the given stop words
- HindiAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.hi.HindiAnalyzer
-
Builds an analyzer with the given stop words
- HindiNormalizationFilter - Class in org.apache.lucene.analysis.hi
-
A
TokenFilter
that appliesHindiNormalizer
to normalize the orthography. - HindiNormalizationFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.hi.HindiNormalizationFilter
- HindiNormalizer - Class in org.apache.lucene.analysis.hi
-
Normalizer for Hindi.
- HindiNormalizer() - Constructor for class org.apache.lucene.analysis.hi.HindiNormalizer
- HindiStemFilter - Class in org.apache.lucene.analysis.hi
-
A
TokenFilter
that appliesHindiStemmer
to stem Hindi words. - HindiStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.hi.HindiStemFilter
- HindiStemmer - Class in org.apache.lucene.analysis.hi
-
Light Stemmer for Hindi.
- HindiStemmer() - Constructor for class org.apache.lucene.analysis.hi.HindiStemmer
- HIRAGANA - Static variable in class org.apache.lucene.analysis.cjk.CJKBigramFilter
-
bigram flag for Hiragana
- HOST_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- hstrcmp(char[], int, char[], int) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
String compare, returns 0 if equal or t is a substring of s
- HTMLStripCharFilter - Class in org.apache.lucene.analysis.charfilter
-
A CharFilter that wraps another Reader and attempts to strip out HTML constructs.
- HTMLStripCharFilter(CharStream) - Constructor for class org.apache.lucene.analysis.charfilter.HTMLStripCharFilter
- HTMLStripCharFilter(CharStream, Set<String>) - Constructor for class org.apache.lucene.analysis.charfilter.HTMLStripCharFilter
- HungarianAnalyzer - Class in org.apache.lucene.analysis.hu
-
Analyzer
for Hungarian. - HungarianAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.hu.HungarianAnalyzer
-
Builds an analyzer with the default stop words:
HungarianAnalyzer.DEFAULT_STOPWORD_FILE
. - HungarianAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.hu.HungarianAnalyzer
-
Builds an analyzer with the given stop words.
- HungarianAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.hu.HungarianAnalyzer
-
Builds an analyzer with the given stop words.
- HungarianLightStemFilter - Class in org.apache.lucene.analysis.hu
-
A
TokenFilter
that appliesHungarianLightStemmer
to stem Hungarian words. - HungarianLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.hu.HungarianLightStemFilter
- HungarianLightStemmer - Class in org.apache.lucene.analysis.hu
-
Light Stemmer for Hungarian.
- HungarianLightStemmer() - Constructor for class org.apache.lucene.analysis.hu.HungarianLightStemmer
- HungarianStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- HungarianStemmer() - Constructor for class org.tartarus.snowball.ext.HungarianStemmer
- HunspellAffix - Class in org.apache.lucene.analysis.hunspell
-
Wrapper class representing a hunspell affix
- HunspellAffix() - Constructor for class org.apache.lucene.analysis.hunspell.HunspellAffix
- HunspellDictionary - Class in org.apache.lucene.analysis.hunspell
-
In-memory structure for the dictionary (.dic) and affix (.aff) data of a hunspell dictionary.
- HunspellDictionary(InputStream, InputStream, Version) - Constructor for class org.apache.lucene.analysis.hunspell.HunspellDictionary
-
Creates a new HunspellDictionary containing the information read from the provided InputStreams to hunspell affix and dictionary files.
- HunspellDictionary(InputStream, InputStream, Version, boolean) - Constructor for class org.apache.lucene.analysis.hunspell.HunspellDictionary
-
Creates a new HunspellDictionary containing the information read from the provided InputStreams to hunspell affix and dictionary files.
- HunspellDictionary(InputStream, List<InputStream>, Version, boolean) - Constructor for class org.apache.lucene.analysis.hunspell.HunspellDictionary
-
Creates a new HunspellDictionary containing the information read from the provided InputStreams to hunspell affix and dictionary files.
- HunspellStemFilter - Class in org.apache.lucene.analysis.hunspell
-
TokenFilter that uses hunspell affix rules and words to stem tokens.
- HunspellStemFilter(TokenStream, HunspellDictionary) - Constructor for class org.apache.lucene.analysis.hunspell.HunspellStemFilter
-
Creates a new HunspellStemFilter that will stem tokens from the given TokenStream using affix rules in the provided HunspellDictionary
- HunspellStemFilter(TokenStream, HunspellDictionary, boolean) - Constructor for class org.apache.lucene.analysis.hunspell.HunspellStemFilter
-
Creates a new HunspellStemFilter that will stem tokens from the given TokenStream using affix rules in the provided HunspellDictionary
- HunspellStemmer - Class in org.apache.lucene.analysis.hunspell
-
HunspellStemmer uses the affix rules declared in the HunspellDictionary to generate one or more stems for a word.
- HunspellStemmer(HunspellDictionary) - Constructor for class org.apache.lucene.analysis.hunspell.HunspellStemmer
-
Constructs a new HunspellStemmer which will use the provided HunspellDictionary to create its stems
- HunspellStemmer.Stem - Class in org.apache.lucene.analysis.hunspell
-
Stem represents all information known about a stem of a word.
- HunspellWord - Class in org.apache.lucene.analysis.hunspell
-
A dictionary (.dic) entry with its associated flags.
- HunspellWord() - Constructor for class org.apache.lucene.analysis.hunspell.HunspellWord
-
Creates a new HunspellWord with no associated flags
- HunspellWord(char[]) - Constructor for class org.apache.lucene.analysis.hunspell.HunspellWord
-
Constructs a new HunspellWord with the given flags
- Hyphen - Class in org.apache.lucene.analysis.compound.hyphenation
-
This class represents a hyphen.
- hyphenate(char[], int, int, int, int) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Hyphenate word and return an array of hyphenation points.
- hyphenate(String, int, int) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Hyphenate word and return a Hyphenation object.
- Hyphenation - Class in org.apache.lucene.analysis.compound.hyphenation
-
This class represents a hyphenated word.
- HyphenationCompoundWordTokenFilter - Class in org.apache.lucene.analysis.compound
-
A
TokenFilter
that decomposes compound words found in many Germanic languages. - HyphenationCompoundWordTokenFilter(TokenStream, HyphenationTree, String[]) - Constructor for class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
-
Deprecated.
- HyphenationCompoundWordTokenFilter(TokenStream, HyphenationTree, String[], int, int, int, boolean) - Constructor for class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
- HyphenationCompoundWordTokenFilter(TokenStream, HyphenationTree, Set<?>) - Constructor for class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
-
Deprecated.
- HyphenationCompoundWordTokenFilter(TokenStream, HyphenationTree, Set<?>, int, int, int, boolean) - Constructor for class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
- HyphenationCompoundWordTokenFilter(Version, TokenStream, HyphenationTree) - Constructor for class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
-
Create a HyphenationCompoundWordTokenFilter with no dictionary.
- HyphenationCompoundWordTokenFilter(Version, TokenStream, HyphenationTree, int, int, int) - Constructor for class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
-
Create a HyphenationCompoundWordTokenFilter with no dictionary.
- HyphenationCompoundWordTokenFilter(Version, TokenStream, HyphenationTree, String[]) - Constructor for class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
-
Deprecated.Use the constructors taking
Set
- HyphenationCompoundWordTokenFilter(Version, TokenStream, HyphenationTree, String[], int, int, int, boolean) - Constructor for class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
-
Deprecated.Use the constructors taking
Set
- HyphenationCompoundWordTokenFilter(Version, TokenStream, HyphenationTree, Set<?>) - Constructor for class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
-
Creates a new
HyphenationCompoundWordTokenFilter
instance. - HyphenationCompoundWordTokenFilter(Version, TokenStream, HyphenationTree, Set<?>, int, int, int, boolean) - Constructor for class org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
-
Creates a new
HyphenationCompoundWordTokenFilter
instance. - HyphenationException - Exception in org.apache.lucene.analysis.compound.hyphenation
-
This class has been taken from the Apache FOP project (http://xmlgraphics.apache.org/fop/).
- HyphenationException(String) - Constructor for exception org.apache.lucene.analysis.compound.hyphenation.HyphenationException
- HyphenationTree - Class in org.apache.lucene.analysis.compound.hyphenation
-
This tree structure stores the hyphenation patterns in an efficient way for fast lookup.
- HyphenationTree() - Constructor for class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
I
- IdentityEncoder - Class in org.apache.lucene.analysis.payloads
-
Does nothing other than convert the char array to a byte array using the specified encoding.
- IdentityEncoder() - Constructor for class org.apache.lucene.analysis.payloads.IdentityEncoder
- IdentityEncoder(Charset) - Constructor for class org.apache.lucene.analysis.payloads.IdentityEncoder
- ignoringSinglePrefixOrSuffixShingleByDefault - Static variable in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- in_grouping(char[], int, int) - Method in class org.tartarus.snowball.SnowballProgram
- in_grouping_b(char[], int, int) - Method in class org.tartarus.snowball.SnowballProgram
- in_range(int, int) - Method in class org.tartarus.snowball.SnowballProgram
- in_range_b(int, int) - Method in class org.tartarus.snowball.SnowballProgram
- incrementToken() - Method in class org.apache.lucene.analysis.ar.ArabicNormalizationFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ar.ArabicStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.bg.BulgarianStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.br.BrazilianStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.cjk.CJKBigramFilter
- incrementToken() - Method in class org.apache.lucene.analysis.cjk.CJKTokenizer
-
Deprecated.Returns true for the next token in the stream, or false at EOS.
- incrementToken() - Method in class org.apache.lucene.analysis.cjk.CJKWidthFilter
- incrementToken() - Method in class org.apache.lucene.analysis.cn.ChineseFilter
-
Deprecated.
- incrementToken() - Method in class org.apache.lucene.analysis.cn.ChineseTokenizer
-
Deprecated.
- incrementToken() - Method in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- incrementToken() - Method in class org.apache.lucene.analysis.cz.CzechStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.de.GermanLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.de.GermanMinimalStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.de.GermanNormalizationFilter
- incrementToken() - Method in class org.apache.lucene.analysis.de.GermanStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.el.GreekLowerCaseFilter
- incrementToken() - Method in class org.apache.lucene.analysis.el.GreekStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.en.EnglishMinimalStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.en.EnglishPossessiveFilter
- incrementToken() - Method in class org.apache.lucene.analysis.en.KStemFilter
-
Returns the next, stemmed, input Token.
- incrementToken() - Method in class org.apache.lucene.analysis.es.SpanishLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.fa.PersianNormalizationFilter
- incrementToken() - Method in class org.apache.lucene.analysis.fi.FinnishLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.fr.ElisionFilter
-
Increments the
TokenStream
with aCharTermAttribute
without elisioned start - incrementToken() - Method in class org.apache.lucene.analysis.fr.FrenchLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.fr.FrenchMinimalStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.fr.FrenchStemFilter
-
Deprecated.
- incrementToken() - Method in class org.apache.lucene.analysis.ga.IrishLowerCaseFilter
- incrementToken() - Method in class org.apache.lucene.analysis.gl.GalicianMinimalStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.gl.GalicianStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.hi.HindiNormalizationFilter
- incrementToken() - Method in class org.apache.lucene.analysis.hi.HindiStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.hu.HungarianLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.hunspell.HunspellStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.id.IndonesianStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.in.IndicNormalizationFilter
- incrementToken() - Method in class org.apache.lucene.analysis.it.ItalianLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.lv.LatvianStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.EmptyTokenStream
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAndSuffixAwareTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.SingleTokenTokenStream
- incrementToken() - Method in class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
-
Returns the next token in the stream, or null at EOS.
- incrementToken() - Method in class org.apache.lucene.analysis.ngram.NGramTokenFilter
-
Returns the next token in the stream, or null at EOS.
- incrementToken() - Method in class org.apache.lucene.analysis.ngram.NGramTokenizer
-
Returns the next token in the stream, or null at EOS.
- incrementToken() - Method in class org.apache.lucene.analysis.nl.DutchStemFilter
-
Deprecated.Returns the next token in the stream, or null at EOS
- incrementToken() - Method in class org.apache.lucene.analysis.no.NorwegianLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.no.NorwegianMinimalStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- incrementToken() - Method in class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- incrementToken() - Method in class org.apache.lucene.analysis.payloads.DelimitedPayloadTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.payloads.NumericPayloadTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.payloads.TokenOffsetPayloadTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.payloads.TypeAsPayloadTokenFilter
- incrementToken() - Method in class org.apache.lucene.analysis.position.PositionFilter
- incrementToken() - Method in class org.apache.lucene.analysis.pt.PortugueseLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.pt.PortugueseMinimalStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.pt.PortugueseStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ru.RussianLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.ru.RussianLowerCaseFilter
-
Deprecated.
- incrementToken() - Method in class org.apache.lucene.analysis.ru.RussianStemFilter
-
Deprecated.Returns the next token in the stream, or null at EOS
- incrementToken() - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
- incrementToken() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- incrementToken() - Method in class org.apache.lucene.analysis.snowball.SnowballFilter
-
Returns the next input Token, after being stemmed
- incrementToken() - Method in class org.apache.lucene.analysis.sv.SwedishLightStemFilter
- incrementToken() - Method in class org.apache.lucene.analysis.synonym.SynonymFilter
- incrementToken() - Method in class org.apache.lucene.analysis.th.ThaiWordFilter
- incrementToken() - Method in class org.apache.lucene.analysis.tr.TurkishLowerCaseFilter
- incrementToken() - Method in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- IndicNormalizationFilter - Class in org.apache.lucene.analysis.in
-
A
TokenFilter
that appliesIndicNormalizer
to normalize text in Indian Languages. - IndicNormalizationFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.in.IndicNormalizationFilter
- IndicNormalizer - Class in org.apache.lucene.analysis.in
-
Normalizes the Unicode representation of text in Indian languages.
- IndicNormalizer() - Constructor for class org.apache.lucene.analysis.in.IndicNormalizer
- IndicTokenizer - Class in org.apache.lucene.analysis.in
-
Deprecated.(3.6) Use
StandardTokenizer
instead. - IndicTokenizer(Version, Reader) - Constructor for class org.apache.lucene.analysis.in.IndicTokenizer
-
Deprecated.
- IndicTokenizer(Version, AttributeSource.AttributeFactory, Reader) - Constructor for class org.apache.lucene.analysis.in.IndicTokenizer
-
Deprecated.
- IndicTokenizer(Version, AttributeSource, Reader) - Constructor for class org.apache.lucene.analysis.in.IndicTokenizer
-
Deprecated.
- IndonesianAnalyzer - Class in org.apache.lucene.analysis.id
-
Analyzer for Indonesian (Bahasa)
- IndonesianAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.id.IndonesianAnalyzer
-
Builds an analyzer with the default stop words:
IndonesianAnalyzer.DEFAULT_STOPWORD_FILE
. - IndonesianAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.id.IndonesianAnalyzer
-
Builds an analyzer with the given stop words
- IndonesianAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.id.IndonesianAnalyzer
-
Builds an analyzer with the given stop word.
- IndonesianStemFilter - Class in org.apache.lucene.analysis.id
-
A
TokenFilter
that appliesIndonesianStemmer
to stem Indonesian words. - IndonesianStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.id.IndonesianStemFilter
- IndonesianStemFilter(TokenStream, boolean) - Constructor for class org.apache.lucene.analysis.id.IndonesianStemFilter
-
Create a new IndonesianStemFilter.
- IndonesianStemmer - Class in org.apache.lucene.analysis.id
-
Stemmer for Indonesian.
- IndonesianStemmer() - Constructor for class org.apache.lucene.analysis.id.IndonesianStemmer
- INFORMATION_SEPARATOR_MARKER - Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Example marker character: U+001F (INFORMATION SEPARATOR ONE)
- init() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- initReader(Reader) - Method in class org.apache.lucene.analysis.fa.PersianAnalyzer
-
Wraps the Reader with
PersianCharFilter
- insert(char[], int, char) - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- insert(int, int, CharSequence) - Method in class org.tartarus.snowball.SnowballProgram
- insert(int, int, String) - Method in class org.tartarus.snowball.SnowballProgram
-
Deprecated.for binary back compat. Will be removed in Lucene 4.0
- insert(int, int, StringBuilder) - Method in class org.tartarus.snowball.SnowballProgram
-
Deprecated.for binary back compat. Will be removed in Lucene 4.0
- insert(String, char) - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Branches are initially compressed, needing one node per key plus the size of the string key.
- insertBalanced(String[], char[], int, int) - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Recursively insert the median first and then the median of the lower and upper halves, and so on in order to get a balanced tree.
- IntegerEncoder - Class in org.apache.lucene.analysis.payloads
-
Encode a character array Integer as a
Payload
. - IntegerEncoder() - Constructor for class org.apache.lucene.analysis.payloads.IntegerEncoder
- INTERNAL_LINK - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- INTERNAL_LINK_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- IrishAnalyzer - Class in org.apache.lucene.analysis.ga
-
Analyzer
for Irish. - IrishAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.ga.IrishAnalyzer
-
Builds an analyzer with the default stop words:
IrishAnalyzer.DEFAULT_STOPWORD_FILE
. - IrishAnalyzer(Version, CharArraySet) - Constructor for class org.apache.lucene.analysis.ga.IrishAnalyzer
-
Builds an analyzer with the given stop words.
- IrishAnalyzer(Version, CharArraySet, CharArraySet) - Constructor for class org.apache.lucene.analysis.ga.IrishAnalyzer
-
Builds an analyzer with the given stop words.
- IrishLowerCaseFilter - Class in org.apache.lucene.analysis.ga
-
Normalises token text to lower case, handling t-prothesis and n-eclipsis (i.e., that 'nAthair' should become 'n-athair')
- IrishLowerCaseFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ga.IrishLowerCaseFilter
-
Create an IrishLowerCaseFilter that normalises Irish token text.
- IrishStemmer - Class in org.tartarus.snowball.ext
-
This class was automatically generated by a Snowball to Java compiler It implements the stemming algorithm defined by a snowball script.
- IrishStemmer() - Constructor for class org.tartarus.snowball.ext.IrishStemmer
- isCrossProduct() - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Returns whether the affix is defined as cross product
- isFirst() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
-
Deprecated.
- isIgnoreCase() - Method in class org.apache.lucene.analysis.hunspell.HunspellDictionary
- isIgnoringSinglePrefixOrSuffixShingle() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- isLast() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
-
Deprecated.
- isOutputUnigrams() - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- isOutputUnigramsIfNoShingles() - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- isTokenChar(int) - Method in class org.apache.lucene.analysis.ar.ArabicLetterTokenizer
-
Deprecated.Allows for Letter category or NonspacingMark category
- isTokenChar(int) - Method in class org.apache.lucene.analysis.in.IndicTokenizer
-
Deprecated.
- isTokenChar(int) - Method in class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
Deprecated.Collects only characters which satisfy
Character.isLetter(int)
. - ItalianAnalyzer - Class in org.apache.lucene.analysis.it
-
Analyzer
for Italian. - ItalianAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.it.ItalianAnalyzer
-
Builds an analyzer with the default stop words:
ItalianAnalyzer.DEFAULT_STOPWORD_FILE
. - ItalianAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.it.ItalianAnalyzer
-
Builds an analyzer with the given stop words.
- ItalianAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.it.ItalianAnalyzer
-
Builds an analyzer with the given stop words.
- ItalianLightStemFilter - Class in org.apache.lucene.analysis.it
-
A
TokenFilter
that appliesItalianLightStemmer
to stem Italian words. - ItalianLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.it.ItalianLightStemFilter
- ItalianLightStemmer - Class in org.apache.lucene.analysis.it
-
Light Stemmer for Italian.
- ItalianLightStemmer() - Constructor for class org.apache.lucene.analysis.it.ItalianLightStemmer
- ItalianStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- ItalianStemmer() - Constructor for class org.tartarus.snowball.ext.ItalianStemmer
- ITALICS - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- ITALICS_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- Iterator() - Constructor for class org.apache.lucene.analysis.compound.hyphenation.TernaryTree.Iterator
J
- join(String[], CharsRef) - Static method in class org.apache.lucene.analysis.synonym.SynonymMap.Builder
-
Sugar: just joins the provided terms with
SynonymMap.WORD_SEPARATOR
. - jreBugWorkaround(char) - Method in class org.apache.lucene.analysis.util.CharArrayIterator
K
- KAF - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- KAF - Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
- KASRA - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- KASRATAN - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- KATAKANA - Static variable in class org.apache.lucene.analysis.cjk.CJKBigramFilter
-
bigram flag for Katakana
- KEHEH - Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
- ket - Variable in class org.tartarus.snowball.SnowballProgram
- keys() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- knows(String) - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- KpStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- KpStemmer() - Constructor for class org.tartarus.snowball.ext.KpStemmer
- KStemFilter - Class in org.apache.lucene.analysis.en
-
A high-performance kstem filter for english.
- KStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.en.KStemFilter
- KStemmer - Class in org.apache.lucene.analysis.en
-
This class implements the Kstem algorithm
- kv - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
This vector holds the trailing of the keys when the branch is compressed.
L
- LAM - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- last() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- LatvianAnalyzer - Class in org.apache.lucene.analysis.lv
-
Analyzer
for Latvian. - LatvianAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.lv.LatvianAnalyzer
-
Builds an analyzer with the default stop words:
LatvianAnalyzer.DEFAULT_STOPWORD_FILE
. - LatvianAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.lv.LatvianAnalyzer
-
Builds an analyzer with the given stop words.
- LatvianAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.lv.LatvianAnalyzer
-
Builds an analyzer with the given stop words.
- LatvianStemFilter - Class in org.apache.lucene.analysis.lv
-
A
TokenFilter
that appliesLatvianStemmer
to stem Latvian words. - LatvianStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.lv.LatvianStemFilter
- LatvianStemmer - Class in org.apache.lucene.analysis.lv
-
Light stemmer for Latvian.
- LatvianStemmer() - Constructor for class org.apache.lucene.analysis.lv.LatvianStemmer
- len - Variable in class org.apache.lucene.analysis.util.OpenStringBuilder
- length - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- length() - Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
-
return number of items in array
- length() - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
-
return number of items in array
- length() - Method in class org.apache.lucene.analysis.compound.hyphenation.Hyphenation
- length() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- limit - Variable in class org.tartarus.snowball.SnowballProgram
- limit_backward - Variable in class org.tartarus.snowball.SnowballProgram
- lo - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Pointer to low branch and to rest of the key when it is stored directly in this node, we don't have unions in java!
- loadPatterns(File) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Read hyphenation patterns from an XML file.
- loadPatterns(InputSource) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Read hyphenation patterns from an XML file.
- loadStopWords(InputStream, String) - Method in class org.apache.lucene.analysis.cz.CzechAnalyzer
-
Deprecated.use
WordlistLoader.getWordSet(Reader, String, Version)
andCzechAnalyzer(Version, Set)
instead - log() - Method in class org.apache.lucene.analysis.br.BrazilianStemmer
-
For log and debug purpose
- lookupPrefix(char[], int, int) - Method in class org.apache.lucene.analysis.hunspell.HunspellDictionary
-
Looks up HunspellAffix prefixes that have an append that matches the String created from the given char array, offset and length
- lookupSuffix(char[], int, int) - Method in class org.apache.lucene.analysis.hunspell.HunspellDictionary
-
Looks up HunspellAffix suffixes that have an append that matches the String created from the given char array, offset and length
- lookupWord(char[], int, int) - Method in class org.apache.lucene.analysis.hunspell.HunspellDictionary
-
Looks up HunspellWords that match the String created from the given char array, offset and length
- LovinsStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- LovinsStemmer() - Constructor for class org.tartarus.snowball.ext.LovinsStemmer
M
- main(String[]) - Static method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- main(String[]) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- main(String[]) - Static method in class org.apache.lucene.analysis.hunspell.HunspellStemmer
-
HunspellStemmer entry point.
- main(String[]) - Static method in class org.tartarus.snowball.TestApp
- makeDictionary(Version, String[]) - Static method in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
-
Deprecated.Only available for backwards compatibility.
- matches(char[], int) - Method in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
- matches(char[], int) - Method in class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSetExceptions
- matches(char[], int) - Method in class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSuffixExceptions
- Matrix() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix
-
Deprecated.
- maxHorizontalContext - Variable in class org.apache.lucene.analysis.synonym.SynonymMap
-
maxHorizontalContext: maximum context we need on the tokenstream
- maxSubwordSize - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- method - Variable in class org.tartarus.snowball.Among
- methodobject - Variable in class org.tartarus.snowball.Among
- min - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
- min - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
- minSubwordSize - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- minWordSize - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
N
- name - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
- newColumn - Static variable in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenPositioner
-
Deprecated.
- newRow - Static variable in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenPositioner
-
Deprecated.
- newSentenceInstance() - Static method in class org.apache.lucene.analysis.util.CharArrayIterator
-
Create a new CharArrayIterator that works around JRE bugs in a manner suitable for
BreakIterator.getSentenceInstance()
- newWordInstance() - Static method in class org.apache.lucene.analysis.util.CharArrayIterator
-
Create a new CharArrayIterator that works around JRE bugs in a manner suitable for
BreakIterator.getWordInstance()
- next() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- nextElement() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree.Iterator
- NGramTokenFilter - Class in org.apache.lucene.analysis.ngram
-
Tokenizes the input into n-grams of the given size(s).
- NGramTokenFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ngram.NGramTokenFilter
-
Creates NGramTokenFilter with default min and max n-grams.
- NGramTokenFilter(TokenStream, int, int) - Constructor for class org.apache.lucene.analysis.ngram.NGramTokenFilter
-
Creates NGramTokenFilter with given min and max n-grams.
- NGramTokenizer - Class in org.apache.lucene.analysis.ngram
-
Tokenizes the input into n-grams of the given size(s).
- NGramTokenizer(Reader) - Constructor for class org.apache.lucene.analysis.ngram.NGramTokenizer
-
Creates NGramTokenizer with default min and max n-grams.
- NGramTokenizer(Reader, int, int) - Constructor for class org.apache.lucene.analysis.ngram.NGramTokenizer
-
Creates NGramTokenizer with given min and max n-grams.
- NGramTokenizer(AttributeSource.AttributeFactory, Reader, int, int) - Constructor for class org.apache.lucene.analysis.ngram.NGramTokenizer
-
Creates NGramTokenizer with given min and max n-grams.
- NGramTokenizer(AttributeSource, Reader, int, int) - Constructor for class org.apache.lucene.analysis.ngram.NGramTokenizer
-
Creates NGramTokenizer with given min and max n-grams.
- noBreak - Variable in class org.apache.lucene.analysis.compound.hyphenation.Hyphen
- NON_WORD_PATTERN - Static variable in class org.apache.lucene.analysis.miscellaneous.PatternAnalyzer
-
"\\W+"
; Divides text at non-letters (NOT Character.isLetter(c)) - NOON - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- normalize(char[], int) - Method in class org.apache.lucene.analysis.ar.ArabicNormalizer
-
Normalize an input buffer of Arabic text
- normalize(char[], int) - Method in class org.apache.lucene.analysis.fa.PersianNormalizer
-
Normalize an input buffer of Persian text
- normalize(char[], int) - Method in class org.apache.lucene.analysis.hi.HindiNormalizer
-
Normalize an input buffer of Hindi text
- normalize(char[], int) - Method in class org.apache.lucene.analysis.in.IndicNormalizer
-
Normalizes input text, and returns the new length.
- normalizeException(ArrayList<?>) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- NorwegianAnalyzer - Class in org.apache.lucene.analysis.no
-
Analyzer
for Norwegian. - NorwegianAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.no.NorwegianAnalyzer
-
Builds an analyzer with the default stop words:
NorwegianAnalyzer.DEFAULT_STOPWORD_FILE
. - NorwegianAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.no.NorwegianAnalyzer
-
Builds an analyzer with the given stop words.
- NorwegianAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.no.NorwegianAnalyzer
-
Builds an analyzer with the given stop words.
- NorwegianLightStemFilter - Class in org.apache.lucene.analysis.no
-
A
TokenFilter
that appliesNorwegianLightStemmer
to stem Norwegian words. - NorwegianLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.no.NorwegianLightStemFilter
- NorwegianLightStemmer - Class in org.apache.lucene.analysis.no
-
Light Stemmer for Norwegian.
- NorwegianLightStemmer() - Constructor for class org.apache.lucene.analysis.no.NorwegianLightStemmer
- NorwegianMinimalStemFilter - Class in org.apache.lucene.analysis.no
-
A
TokenFilter
that appliesNorwegianMinimalStemmer
to stem Norwegian words. - NorwegianMinimalStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.no.NorwegianMinimalStemFilter
- NorwegianMinimalStemmer - Class in org.apache.lucene.analysis.no
-
Minimal Stemmer for Norwegian bokmål (no-nb)
- NorwegianMinimalStemmer() - Constructor for class org.apache.lucene.analysis.no.NorwegianMinimalStemmer
- NorwegianStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- NorwegianStemmer() - Constructor for class org.tartarus.snowball.ext.NorwegianStemmer
- NUM_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- NumericPayloadTokenFilter - Class in org.apache.lucene.analysis.payloads
-
Assigns a payload to a token based on the
Token.type()
- NumericPayloadTokenFilter(TokenStream, float, String) - Constructor for class org.apache.lucene.analysis.payloads.NumericPayloadTokenFilter
O
- offsetAtt - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- OneDimensionalNonWeightedTokenSettingsCodec() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec
-
Deprecated.
- onlyLongestMatch - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- OpenStringBuilder - Class in org.apache.lucene.analysis.util
-
A StringBuilder that allows one to access the array.
- OpenStringBuilder() - Constructor for class org.apache.lucene.analysis.util.OpenStringBuilder
- OpenStringBuilder(char[], int) - Constructor for class org.apache.lucene.analysis.util.OpenStringBuilder
- OpenStringBuilder(int) - Constructor for class org.apache.lucene.analysis.util.OpenStringBuilder
- org.apache.lucene.analysis.ar - package org.apache.lucene.analysis.ar
-
Analyzer for Arabic.
- org.apache.lucene.analysis.bg - package org.apache.lucene.analysis.bg
-
Analyzer for Bulgarian.
- org.apache.lucene.analysis.br - package org.apache.lucene.analysis.br
-
Analyzer for Brazilian Portuguese.
- org.apache.lucene.analysis.ca - package org.apache.lucene.analysis.ca
-
Analyzer for Catalan.
- org.apache.lucene.analysis.charfilter - package org.apache.lucene.analysis.charfilter
-
CharFilters: process text before the Tokenizer
- org.apache.lucene.analysis.cjk - package org.apache.lucene.analysis.cjk
-
Analyzer for Chinese, Japanese, and Korean, which indexes bigrams (overlapping groups of two adjacent Han characters).
- org.apache.lucene.analysis.cn - package org.apache.lucene.analysis.cn
-
Analyzer for Chinese, which indexes unigrams (individual chinese characters).
- org.apache.lucene.analysis.compound - package org.apache.lucene.analysis.compound
-
A filter that decomposes compound words you find in many Germanic languages into the word parts.
- org.apache.lucene.analysis.compound.hyphenation - package org.apache.lucene.analysis.compound.hyphenation
-
The code for the compound word hyphenation is taken from the Apache FOP project.
- org.apache.lucene.analysis.cz - package org.apache.lucene.analysis.cz
-
Analyzer for Czech.
- org.apache.lucene.analysis.da - package org.apache.lucene.analysis.da
-
Analyzer for Danish.
- org.apache.lucene.analysis.de - package org.apache.lucene.analysis.de
-
Analyzer for German.
- org.apache.lucene.analysis.el - package org.apache.lucene.analysis.el
-
Analyzer for Greek.
- org.apache.lucene.analysis.en - package org.apache.lucene.analysis.en
-
Analyzer for English.
- org.apache.lucene.analysis.es - package org.apache.lucene.analysis.es
-
Analyzer for Spanish.
- org.apache.lucene.analysis.eu - package org.apache.lucene.analysis.eu
-
Analyzer for Basque.
- org.apache.lucene.analysis.fa - package org.apache.lucene.analysis.fa
-
Analyzer for Persian.
- org.apache.lucene.analysis.fi - package org.apache.lucene.analysis.fi
-
Analyzer for Finnish.
- org.apache.lucene.analysis.fr - package org.apache.lucene.analysis.fr
-
Analyzer for French.
- org.apache.lucene.analysis.ga - package org.apache.lucene.analysis.ga
-
Analysis for Irish.
- org.apache.lucene.analysis.gl - package org.apache.lucene.analysis.gl
-
Analyzer for Galician.
- org.apache.lucene.analysis.hi - package org.apache.lucene.analysis.hi
-
Analyzer for Hindi.
- org.apache.lucene.analysis.hu - package org.apache.lucene.analysis.hu
-
Analyzer for Hungarian.
- org.apache.lucene.analysis.hunspell - package org.apache.lucene.analysis.hunspell
-
Stemming TokenFilter using a Java implementation of the Hunspell stemming algorithm.
- org.apache.lucene.analysis.hy - package org.apache.lucene.analysis.hy
-
Analyzer for Armenian.
- org.apache.lucene.analysis.id - package org.apache.lucene.analysis.id
-
Analyzer for Indonesian.
- org.apache.lucene.analysis.in - package org.apache.lucene.analysis.in
-
Analysis components for Indian languages.
- org.apache.lucene.analysis.it - package org.apache.lucene.analysis.it
-
Analyzer for Italian.
- org.apache.lucene.analysis.lv - package org.apache.lucene.analysis.lv
-
Analyzer for Latvian.
- org.apache.lucene.analysis.miscellaneous - package org.apache.lucene.analysis.miscellaneous
-
Miscellaneous TokenStreams
- org.apache.lucene.analysis.ngram - package org.apache.lucene.analysis.ngram
-
Character n-gram tokenizers and filters.
- org.apache.lucene.analysis.nl - package org.apache.lucene.analysis.nl
-
Analyzer for Dutch.
- org.apache.lucene.analysis.no - package org.apache.lucene.analysis.no
-
Analyzer for Norwegian.
- org.apache.lucene.analysis.path - package org.apache.lucene.analysis.path
-
Analysis components for path-like strings such as filenames.
- org.apache.lucene.analysis.payloads - package org.apache.lucene.analysis.payloads
-
Provides various convenience classes for creating payloads on Tokens.
- org.apache.lucene.analysis.position - package org.apache.lucene.analysis.position
-
Filter for assigning position increments.
- org.apache.lucene.analysis.pt - package org.apache.lucene.analysis.pt
-
Analyzer for Portuguese.
- org.apache.lucene.analysis.query - package org.apache.lucene.analysis.query
-
Automatically filter high-frequency stopwords.
- org.apache.lucene.analysis.reverse - package org.apache.lucene.analysis.reverse
-
Filter to reverse token text.
- org.apache.lucene.analysis.ro - package org.apache.lucene.analysis.ro
-
Analyzer for Romanian.
- org.apache.lucene.analysis.ru - package org.apache.lucene.analysis.ru
-
Analyzer for Russian.
- org.apache.lucene.analysis.shingle - package org.apache.lucene.analysis.shingle
-
Word n-gram filters
- org.apache.lucene.analysis.sinks - package org.apache.lucene.analysis.sinks
-
Implementations of the SinkTokenizer that might be useful.
- org.apache.lucene.analysis.snowball - package org.apache.lucene.analysis.snowball
-
TokenFilter
andAnalyzer
implementations that use Snowball stemmers. - org.apache.lucene.analysis.sv - package org.apache.lucene.analysis.sv
-
Analyzer for Swedish.
- org.apache.lucene.analysis.synonym - package org.apache.lucene.analysis.synonym
-
Analysis components for Synonyms.
- org.apache.lucene.analysis.th - package org.apache.lucene.analysis.th
-
Analyzer for Thai.
- org.apache.lucene.analysis.tr - package org.apache.lucene.analysis.tr
-
Analyzer for Turkish.
- org.apache.lucene.analysis.util - package org.apache.lucene.analysis.util
-
Utility functions for text analysis.
- org.apache.lucene.analysis.wikipedia - package org.apache.lucene.analysis.wikipedia
-
Tokenizer that is aware of Wikipedia syntax.
- org.tartarus.snowball - package org.tartarus.snowball
-
Snowball stemmer API.
- org.tartarus.snowball.ext - package org.tartarus.snowball.ext
-
Autogenerated snowball stemmer implementations.
- out_grouping(char[], int, int) - Method in class org.tartarus.snowball.SnowballProgram
- out_grouping_b(char[], int, int) - Method in class org.tartarus.snowball.SnowballProgram
- out_range(int, int) - Method in class org.tartarus.snowball.SnowballProgram
- out_range_b(int, int) - Method in class org.tartarus.snowball.SnowballProgram
P
- packValues(String) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Packs the values by storing them in 4 bits, two values into a byte Values range is from 0 to 9.
- parse(File) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
Parses a hyphenation pattern file.
- parse(Class<? extends RSLPStemmerBase>, String) - Static method in class org.apache.lucene.analysis.pt.RSLPStemmerBase
-
Parse a resource file into an RSLP stemmer description.
- parse(String) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
Parses a hyphenation pattern file.
- parse(InputSource) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
-
Parses a hyphenation pattern file.
- PathHierarchyTokenizer - Class in org.apache.lucene.analysis.path
-
Tokenizer for path-like hierarchies.
- PathHierarchyTokenizer(Reader) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- PathHierarchyTokenizer(Reader, char, char) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- PathHierarchyTokenizer(Reader, char, char, int) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- PathHierarchyTokenizer(Reader, int) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- PathHierarchyTokenizer(Reader, int, char) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- PathHierarchyTokenizer(Reader, int, char, char, int) - Constructor for class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- PatternAnalyzer - Class in org.apache.lucene.analysis.miscellaneous
-
Efficient Lucene analyzer/tokenizer that preferably operates on a String rather than a
Reader
, that can flexibly separate text into terms via a regular expressionPattern
(with behaviour identical toString.split(String)
), and that combines the functionality ofLetterTokenizer
,LowerCaseTokenizer
,WhitespaceTokenizer
,StopFilter
into a single efficient multi-purpose class. - PatternAnalyzer(Version, Pattern, boolean, Set<?>) - Constructor for class org.apache.lucene.analysis.miscellaneous.PatternAnalyzer
-
Constructs a new instance with the given parameters.
- PatternConsumer - Interface in org.apache.lucene.analysis.compound.hyphenation
-
This interface is used to connect the XML pattern file parser to the hyphenation tree.
- PatternParser - Class in org.apache.lucene.analysis.compound.hyphenation
-
A SAX document handler to read and parse hyphenation patterns from a XML file.
- PatternParser() - Constructor for class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- PatternParser(PatternConsumer) - Constructor for class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- PayloadEncoder - Interface in org.apache.lucene.analysis.payloads
-
Mainly for use with the DelimitedPayloadTokenFilter, converts char buffers to Payload.
- PayloadHelper - Class in org.apache.lucene.analysis.payloads
-
Utility methods for encoding payloads.
- PayloadHelper() - Constructor for class org.apache.lucene.analysis.payloads.PayloadHelper
- permutationIterator() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix
-
Deprecated.
- PersianAnalyzer - Class in org.apache.lucene.analysis.fa
-
Analyzer
for Persian. - PersianAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
-
Builds an analyzer with the default stop words:
PersianAnalyzer.DEFAULT_STOPWORD_FILE
. - PersianAnalyzer(Version, File) - Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
-
Deprecated.use
PersianAnalyzer(Version, Set)
instead - PersianAnalyzer(Version, String...) - Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
-
Deprecated.use
PersianAnalyzer(Version, Set)
instead - PersianAnalyzer(Version, Hashtable<?, ?>) - Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
-
Deprecated.use
PersianAnalyzer(Version, Set)
instead - PersianAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.fa.PersianAnalyzer
-
Builds an analyzer with the given stop words
- PersianCharFilter - Class in org.apache.lucene.analysis.fa
-
CharFilter that replaces instances of Zero-width non-joiner with an ordinary space.
- PersianCharFilter(CharStream) - Constructor for class org.apache.lucene.analysis.fa.PersianCharFilter
- PersianNormalizationFilter - Class in org.apache.lucene.analysis.fa
-
A
TokenFilter
that appliesPersianNormalizer
to normalize the orthography. - PersianNormalizationFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.fa.PersianNormalizationFilter
- PersianNormalizer - Class in org.apache.lucene.analysis.fa
-
Normalizer for Persian.
- PersianNormalizer() - Constructor for class org.apache.lucene.analysis.fa.PersianNormalizer
- PorterStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- PorterStemmer() - Constructor for class org.tartarus.snowball.ext.PorterStemmer
- PortugueseAnalyzer - Class in org.apache.lucene.analysis.pt
-
Analyzer
for Portuguese. - PortugueseAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.pt.PortugueseAnalyzer
-
Builds an analyzer with the default stop words:
PortugueseAnalyzer.DEFAULT_STOPWORD_FILE
. - PortugueseAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.pt.PortugueseAnalyzer
-
Builds an analyzer with the given stop words.
- PortugueseAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.pt.PortugueseAnalyzer
-
Builds an analyzer with the given stop words.
- PortugueseLightStemFilter - Class in org.apache.lucene.analysis.pt
-
A
TokenFilter
that appliesPortugueseLightStemmer
to stem Portuguese words. - PortugueseLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.pt.PortugueseLightStemFilter
- PortugueseLightStemmer - Class in org.apache.lucene.analysis.pt
-
Light Stemmer for Portuguese
- PortugueseLightStemmer() - Constructor for class org.apache.lucene.analysis.pt.PortugueseLightStemmer
- PortugueseMinimalStemFilter - Class in org.apache.lucene.analysis.pt
-
A
TokenFilter
that appliesPortugueseMinimalStemmer
to stem Portuguese words. - PortugueseMinimalStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.pt.PortugueseMinimalStemFilter
- PortugueseMinimalStemmer - Class in org.apache.lucene.analysis.pt
-
Minimal Stemmer for Portuguese
- PortugueseMinimalStemmer() - Constructor for class org.apache.lucene.analysis.pt.PortugueseMinimalStemmer
- PortugueseStemFilter - Class in org.apache.lucene.analysis.pt
-
A
TokenFilter
that appliesPortugueseStemmer
to stem Portuguese words. - PortugueseStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.pt.PortugueseStemFilter
- PortugueseStemmer - Class in org.apache.lucene.analysis.pt
-
Portuguese stemmer implementing the RSLP (Removedor de Sufixos da Lingua Portuguesa) algorithm.
- PortugueseStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- PortugueseStemmer() - Constructor for class org.apache.lucene.analysis.pt.PortugueseStemmer
- PortugueseStemmer() - Constructor for class org.tartarus.snowball.ext.PortugueseStemmer
- PositionFilter - Class in org.apache.lucene.analysis.position
-
Set the positionIncrement of all tokens to the "positionIncrement", except the first return token which retains its original positionIncrement value.
- PositionFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.position.PositionFilter
-
Constructs a PositionFilter that assigns a position increment of zero to all but the first token from the given input stream.
- PositionFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.position.PositionFilter
-
Constructs a PositionFilter that assigns the given position increment to all but the first token from the given input stream.
- postBreak - Variable in class org.apache.lucene.analysis.compound.hyphenation.Hyphen
- preBreak - Variable in class org.apache.lucene.analysis.compound.hyphenation.Hyphen
- PrefixAndSuffixAwareTokenFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Links two
PrefixAwareTokenFilter
. - PrefixAndSuffixAwareTokenFilter(TokenStream, TokenStream, TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.PrefixAndSuffixAwareTokenFilter
- PrefixAwareTokenFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Joins two token streams and leaves the last token of the first stream available to be used when updating the token values in the second stream based on that token.
- PrefixAwareTokenFilter(TokenStream, TokenStream) - Constructor for class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
- prefixes - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- previous() - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- printStats() - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
- printStats() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- PUA_EC00_MARKER - Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Example marker character: U+EC00 (PRIVATE USE AREA: EC00)
- put(int, byte) - Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
- put(int, char) - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
Q
- QueryAutoStopWordAnalyzer - Class in org.apache.lucene.analysis.query
-
An
Analyzer
used primarily at query time to wrap another analyzer and provide a layer of protection which prevents very common words from being passed into queries. - QueryAutoStopWordAnalyzer(Version, Analyzer) - Constructor for class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Deprecated.Stopwords should be calculated at instantiation using one of the other constructors
- QueryAutoStopWordAnalyzer(Version, Analyzer, IndexReader) - Constructor for class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for all indexed fields from terms with a document frequency percentage greater than
QueryAutoStopWordAnalyzer.defaultMaxDocFreqPercent
- QueryAutoStopWordAnalyzer(Version, Analyzer, IndexReader, float) - Constructor for class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for all indexed fields from terms with a document frequency percentage greater than the given maxPercentDocs
- QueryAutoStopWordAnalyzer(Version, Analyzer, IndexReader, int) - Constructor for class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for all indexed fields from terms with a document frequency greater than the given maxDocFreq
- QueryAutoStopWordAnalyzer(Version, Analyzer, IndexReader, Collection<String>, float) - Constructor for class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for the given selection of fields from terms with a document frequency percentage greater than the given maxPercentDocs
- QueryAutoStopWordAnalyzer(Version, Analyzer, IndexReader, Collection<String>, int) - Constructor for class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
-
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for the given selection of fields from terms with a document frequency greater than the given maxDocFreq
R
- read() - Method in class org.apache.lucene.analysis.charfilter.HTMLStripCharFilter
- read(char[], int, int) - Method in class org.apache.lucene.analysis.charfilter.HTMLStripCharFilter
- read(char[], int, int) - Method in class org.apache.lucene.analysis.fa.PersianCharFilter
- readToken(StringBuffer) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- replace(char[], int) - Method in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
- replace_s(int, int, CharSequence) - Method in class org.tartarus.snowball.SnowballProgram
- replace_s(int, int, String) - Method in class org.tartarus.snowball.SnowballProgram
-
Deprecated.for binary back compat. Will be removed in Lucene 4.0
- replacement - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
- reserve(int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- reset() - Method in class org.apache.lucene.analysis.cjk.CJKBigramFilter
- reset() - Method in class org.apache.lucene.analysis.cjk.CJKTokenizer
-
Deprecated.
- reset() - Method in class org.apache.lucene.analysis.cn.ChineseTokenizer
-
Deprecated.
- reset() - Method in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- reset() - Method in class org.apache.lucene.analysis.hunspell.HunspellStemFilter
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAndSuffixAwareTokenFilter
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
- reset() - Method in class org.apache.lucene.analysis.miscellaneous.SingleTokenTokenStream
- reset() - Method in class org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter
- reset() - Method in class org.apache.lucene.analysis.ngram.EdgeNGramTokenizer
- reset() - Method in class org.apache.lucene.analysis.ngram.NGramTokenFilter
- reset() - Method in class org.apache.lucene.analysis.ngram.NGramTokenizer
- reset() - Method in class org.apache.lucene.analysis.path.PathHierarchyTokenizer
- reset() - Method in class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- reset() - Method in class org.apache.lucene.analysis.position.PositionFilter
- reset() - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
- reset() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- reset() - Method in class org.apache.lucene.analysis.sinks.TokenRangeSinkFilter
- reset() - Method in class org.apache.lucene.analysis.synonym.SynonymFilter
- reset() - Method in class org.apache.lucene.analysis.th.ThaiWordFilter
- reset() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- reset() - Method in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- reset(Reader) - Method in class org.apache.lucene.analysis.cjk.CJKTokenizer
-
Deprecated.
- reset(Reader) - Method in class org.apache.lucene.analysis.cn.ChineseTokenizer
-
Deprecated.
- reset(Reader) - Method in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- resize(int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- resolveEntity(String, String) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- result - Variable in class org.tartarus.snowball.Among
- reusableTokenStream(String, Reader) - Method in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
- reusableTokenStream(String, Reader) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- reusableTokenStream(String, Reader) - Method in class org.apache.lucene.analysis.snowball.SnowballAnalyzer
-
Deprecated.Returns a (possibly reused)
StandardTokenizer
filtered by aStandardFilter
, aLowerCaseFilter
, aStopFilter
, and aSnowballFilter
- reverse(char[]) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Deprecated.use
ReverseStringFilter.reverse(Version, char[])
instead. This method will be removed in Lucene 4.0 - reverse(char[], int) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Deprecated.use
ReverseStringFilter.reverse(Version, char[], int)
instead. This method will be removed in Lucene 4.0 - reverse(char[], int, int) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Deprecated.use
ReverseStringFilter.reverse(Version, char[], int, int)
instead. This method will be removed in Lucene 4.0 - reverse(String) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Deprecated.use
ReverseStringFilter.reverse(Version, String)
instead. This method will be removed in Lucene 4.0 - reverse(Version, char[]) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Reverses the given input buffer in-place
- reverse(Version, char[], int) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Partially reverses the given input buffer in-place from offset 0 up to the given length.
- reverse(Version, char[], int, int) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Partially reverses the given input buffer in-place from the given offset up to the given length.
- reverse(Version, String) - Static method in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Reverses the given input string
- ReversePathHierarchyTokenizer - Class in org.apache.lucene.analysis.path
-
Tokenizer for domain-like hierarchies.
- ReversePathHierarchyTokenizer(Reader) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- ReversePathHierarchyTokenizer(Reader, char, char) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- ReversePathHierarchyTokenizer(Reader, char, char, int) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- ReversePathHierarchyTokenizer(Reader, char, int) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- ReversePathHierarchyTokenizer(Reader, int) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- ReversePathHierarchyTokenizer(Reader, int, char) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- ReversePathHierarchyTokenizer(Reader, int, char, char) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- ReversePathHierarchyTokenizer(Reader, int, char, char, int) - Constructor for class org.apache.lucene.analysis.path.ReversePathHierarchyTokenizer
- ReverseStringFilter - Class in org.apache.lucene.analysis.reverse
-
Reverse token string, for example "country" => "yrtnuoc".
- ReverseStringFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Deprecated.use
ReverseStringFilter(Version, TokenStream)
instead. This constructor will be removed in Lucene 4.0 - ReverseStringFilter(TokenStream, char) - Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Deprecated.use
ReverseStringFilter(Version, TokenStream, char)
instead. This constructor will be removed in Lucene 4.0 - ReverseStringFilter(Version, TokenStream) - Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Create a new ReverseStringFilter that reverses all tokens in the supplied
TokenStream
. - ReverseStringFilter(Version, TokenStream, char) - Constructor for class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Create a new ReverseStringFilter that reverses and marks all tokens in the supplied
TokenStream
. - rewind() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree.Iterator
- RomanianAnalyzer - Class in org.apache.lucene.analysis.ro
-
Analyzer
for Romanian. - RomanianAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.ro.RomanianAnalyzer
-
Builds an analyzer with the default stop words:
RomanianAnalyzer.DEFAULT_STOPWORD_FILE
. - RomanianAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.ro.RomanianAnalyzer
-
Builds an analyzer with the given stop words.
- RomanianAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.ro.RomanianAnalyzer
-
Builds an analyzer with the given stop words.
- RomanianStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- RomanianStemmer() - Constructor for class org.tartarus.snowball.ext.RomanianStemmer
- root - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- Row() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column.Row
-
Deprecated.
- RSLPStemmerBase - Class in org.apache.lucene.analysis.pt
-
Base class for stemmers that use a set of RSLP-like stemming steps.
- RSLPStemmerBase() - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase
- RSLPStemmerBase.Rule - Class in org.apache.lucene.analysis.pt
-
A basic rule, with no exceptions.
- RSLPStemmerBase.RuleWithSetExceptions - Class in org.apache.lucene.analysis.pt
-
A rule with a set of whole-word exceptions.
- RSLPStemmerBase.RuleWithSuffixExceptions - Class in org.apache.lucene.analysis.pt
-
A rule with a set of exceptional suffixes.
- RSLPStemmerBase.Step - Class in org.apache.lucene.analysis.pt
-
A step containing a list of rules.
- RTL_DIRECTION_MARKER - Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Example marker character: U+200F (RIGHT-TO-LEFT MARK)
- Rule(String, int, String) - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
-
Create a rule.
- rules - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
- RuleWithSetExceptions(String, int, String, String[]) - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSetExceptions
- RuleWithSuffixExceptions(String, int, String, String[]) - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.RuleWithSuffixExceptions
- RussianAnalyzer - Class in org.apache.lucene.analysis.ru
-
Analyzer
for Russian language. - RussianAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
- RussianAnalyzer(Version, String...) - Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
-
Deprecated.use
RussianAnalyzer(Version, Set)
instead - RussianAnalyzer(Version, Map<?, ?>) - Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
-
Deprecated.use
RussianAnalyzer(Version, Set)
instead - RussianAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
-
Builds an analyzer with the given stop words
- RussianAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.ru.RussianAnalyzer
-
Builds an analyzer with the given stop words
- RussianLetterTokenizer - Class in org.apache.lucene.analysis.ru
-
Deprecated.Use
StandardTokenizer
instead, which has the same functionality. This filter will be removed in Lucene 5.0 - RussianLetterTokenizer(Reader) - Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
Deprecated.use
RussianLetterTokenizer(Version, Reader)
instead. This will be removed in Lucene 4.0. - RussianLetterTokenizer(AttributeSource.AttributeFactory, Reader) - Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
Deprecated.use
RussianLetterTokenizer(Version, AttributeSource.AttributeFactory, Reader)
instead. This will be removed in Lucene 4.0. - RussianLetterTokenizer(AttributeSource, Reader) - Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
Deprecated.use
RussianLetterTokenizer(Version, AttributeSource, Reader)
instead. This will be removed in Lucene 4.0. - RussianLetterTokenizer(Version, Reader) - Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
Deprecated.Construct a new RussianLetterTokenizer.
- RussianLetterTokenizer(Version, AttributeSource.AttributeFactory, Reader) - Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
Deprecated.Construct a new RussianLetterTokenizer using a given
AttributeSource.AttributeFactory
. - RussianLetterTokenizer(Version, AttributeSource, Reader) - Constructor for class org.apache.lucene.analysis.ru.RussianLetterTokenizer
-
Deprecated.Construct a new RussianLetterTokenizer using a given
AttributeSource
. - RussianLightStemFilter - Class in org.apache.lucene.analysis.ru
-
A
TokenFilter
that appliesRussianLightStemmer
to stem Russian words. - RussianLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ru.RussianLightStemFilter
- RussianLightStemmer - Class in org.apache.lucene.analysis.ru
-
Light Stemmer for Russian.
- RussianLightStemmer() - Constructor for class org.apache.lucene.analysis.ru.RussianLightStemmer
- RussianLowerCaseFilter - Class in org.apache.lucene.analysis.ru
-
Deprecated.Use
LowerCaseFilter
instead, which has the same functionality. This filter will be removed in Lucene 4.0 - RussianLowerCaseFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ru.RussianLowerCaseFilter
-
Deprecated.
- RussianStemFilter - Class in org.apache.lucene.analysis.ru
-
Deprecated.Use
SnowballFilter
withRussianStemmer
instead, which has the same functionality. This filter will be removed in Lucene 4.0 - RussianStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.ru.RussianStemFilter
-
Deprecated.
- RussianStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- RussianStemmer() - Constructor for class org.tartarus.snowball.ext.RussianStemmer
S
- s - Variable in class org.tartarus.snowball.Among
- s_size - Variable in class org.tartarus.snowball.Among
- sameRow - Static variable in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenPositioner
-
Deprecated.
- sc - Variable in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
The character stored in this node: splitchar.
- searchPatterns(char[], int, byte[]) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
Search for all possible partial matches of word starting at index an update interletter values.
- set(char[], int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- setAppend(String) - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Sets the append defined for the affix
- setAppendFlags(char[]) - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Sets the flags defined for the affix append
- setArticles(Set<?>) - Method in class org.apache.lucene.analysis.fr.ElisionFilter
-
Deprecated.use
ElisionFilter.setArticles(Version, Set)
instead - setArticles(Version, Set<?>) - Method in class org.apache.lucene.analysis.fr.ElisionFilter
-
Deprecated.use
ElisionFilter(Version, TokenStream, Set)
instead - setCharAt(int, char) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- setCondition(String, String) - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Sets the condition that must be met before the affix can be applied
- setConsumer(PatternConsumer) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- setCrossProduct(boolean) - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Sets whether the affix is defined as cross product
- setCurrent(char[], int) - Method in class org.tartarus.snowball.SnowballProgram
-
Set the current string.
- setCurrent(String) - Method in class org.tartarus.snowball.SnowballProgram
-
Set the current string.
- setExclusionSet(Set<?>) - Method in class org.apache.lucene.analysis.de.GermanStemFilter
-
Deprecated.use
KeywordAttribute
withKeywordMarkerFilter
instead. - setExclusionTable(HashSet<?>) - Method in class org.apache.lucene.analysis.nl.DutchStemFilter
-
Deprecated.use
KeywordAttribute
withKeywordMarkerFilter
instead. - setExclusionTable(Map<?, ?>) - Method in class org.apache.lucene.analysis.fr.FrenchStemFilter
-
Deprecated.use
KeywordAttribute
withKeywordMarkerFilter
instead. - setFirst(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
-
Deprecated.
- setFlag(char) - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Sets the affix flag
- setIgnoringSinglePrefixOrSuffixShingle(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- setIndex(int) - Method in class org.apache.lucene.analysis.util.CharArrayIterator
- setLast(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
-
Deprecated.
- setLength(int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- setMatrix(ShingleMatrixFilter.Matrix) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- setMaximumShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- setMaxShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Deprecated.Setting maxShingleSize after Analyzer instantiation prevents reuse. Confgure maxShingleSize during construction.
- setMaxShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Set the max shingle size (default: 2)
- setMinimumShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- setMinShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Deprecated.Setting minShingleSize after Analyzer instantiation prevents reuse. Confgure minShingleSize during construction.
- setMinShingleSize(int) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Set the min shingle size (default: 2).
- setOutputUnigrams(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Deprecated.Setting outputUnigrams after Analyzer instantiation prevents reuse. Confgure outputUnigrams during construction.
- setOutputUnigrams(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Shall the output stream contain the input tokens (unigrams) as well as shingles? (default: true.)
- setOutputUnigramsIfNoShingles(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Deprecated.Setting outputUnigramsIfNoShingles after Analyzer instantiation prevents reuse. Confgure outputUnigramsIfNoShingles during construction.
- setOutputUnigramsIfNoShingles(boolean) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Shall we override the behavior of outputUnigrams==false for those times when no shingles are available (because there are fewer than minShingleSize tokens in the input stream)? (default: false.)
- setPrefix(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
- setSpacerCharacter(Character) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.
- setStemDictionary(File) - Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
-
Deprecated.This prevents reuse of TokenStreams. If you wish to use a custom stem dictionary, create your own Analyzer with
StemmerOverrideFilter
- setStemDictionary(HashMap<?, ?>) - Method in class org.apache.lucene.analysis.nl.DutchStemFilter
-
Deprecated.Set dictionary for stemming, this dictionary overrules the algorithm, so you can correct for a particular unwanted word-stem pair.
- setStemExclusionTable(File) - Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
Deprecated.use
BrazilianAnalyzer(Version, Set, Set)
instead - setStemExclusionTable(File) - Method in class org.apache.lucene.analysis.de.GermanAnalyzer
-
Deprecated.use
GermanAnalyzer(Version, Set, Set)
instead - setStemExclusionTable(File) - Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
Deprecated.use
FrenchAnalyzer(Version, Set, Set)
instead - setStemExclusionTable(File) - Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
-
Deprecated.use
DutchAnalyzer(Version, Set, Set)
instead - setStemExclusionTable(String...) - Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
Deprecated.use
BrazilianAnalyzer(Version, Set, Set)
instead - setStemExclusionTable(String[]) - Method in class org.apache.lucene.analysis.de.GermanAnalyzer
-
Deprecated.use
GermanAnalyzer(Version, Set, Set)
instead - setStemExclusionTable(String...) - Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
Deprecated.use
FrenchAnalyzer(Version, Set, Set)
instead - setStemExclusionTable(String...) - Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
-
Deprecated.use
DutchAnalyzer(Version, Set, Set)
instead - setStemExclusionTable(HashSet<?>) - Method in class org.apache.lucene.analysis.nl.DutchAnalyzer
-
Deprecated.use
DutchAnalyzer(Version, Set, Set)
instead - setStemExclusionTable(Map<?, ?>) - Method in class org.apache.lucene.analysis.br.BrazilianAnalyzer
-
Deprecated.use
BrazilianAnalyzer(Version, Set, Set)
instead - setStemExclusionTable(Map<?, ?>) - Method in class org.apache.lucene.analysis.de.GermanAnalyzer
-
Deprecated.use
GermanAnalyzer(Version, Set, Set)
instead - setStemExclusionTable(Map<?, ?>) - Method in class org.apache.lucene.analysis.fr.FrenchAnalyzer
-
Deprecated.use
FrenchAnalyzer(Version, Set, Set)
instead - setStemmer(GermanStemmer) - Method in class org.apache.lucene.analysis.de.GermanStemFilter
-
Set a alternative/custom
GermanStemmer
for this filter. - setStemmer(FrenchStemmer) - Method in class org.apache.lucene.analysis.fr.FrenchStemFilter
-
Deprecated.Set a alternative/custom
FrenchStemmer
for this filter. - setStemmer(DutchStemmer) - Method in class org.apache.lucene.analysis.nl.DutchStemFilter
-
Deprecated.Set a alternative/custom
DutchStemmer
for this filter. - setStemmer(RussianStemmer) - Method in class org.apache.lucene.analysis.ru.RussianStemFilter
-
Deprecated.Set a alternative/custom
RussianStemmer
for this filter. - setStrip(String) - Method in class org.apache.lucene.analysis.hunspell.HunspellAffix
-
Sets the stripping characters defined for the affix
- setSuffix(TokenStream) - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
- setText(char[], int, int) - Method in class org.apache.lucene.analysis.util.CharArrayIterator
-
Set a new region of text to be examined by this iterator
- setToken(Token) - Method in class org.apache.lucene.analysis.miscellaneous.SingleTokenTokenStream
- setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec
-
Deprecated.
- setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec
-
Deprecated.Sets the TokenPositioner as token flags int value.
- setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenSettingsCodec
-
Deprecated.Sets information on how a
Token
is to be inserted to aShingleMatrixFilter.Matrix
. - setTokenPositioner(Token, ShingleMatrixFilter.TokenPositioner) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec
-
Deprecated.
- setTokens(List<Token>) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column.Row
-
Deprecated.
- setTokenSeparator(String) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Deprecated.Setting tokenSeparator after Analyzer instantiation prevents reuse. Confgure tokenSeparator during construction.
- setTokenSeparator(String) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Sets the string to use when joining adjacent tokens to form a shingle
- setTokenType(String) - Method in class org.apache.lucene.analysis.shingle.ShingleFilter
-
Set the type of the shingle tokens produced by this filter.
- setWeight(Token, float) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec
-
Deprecated.
- setWeight(Token, float) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec
-
Deprecated.Stores a 32 bit float in the payload, or set it to null if 1f;
- setWeight(Token, float) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenSettingsCodec
-
Deprecated.Have this method do nothing in order to 'disable' weights.
- setWeight(Token, float) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec
-
Deprecated.
- SHADDA - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- ShingleAnalyzerWrapper - Class in org.apache.lucene.analysis.shingle
-
A ShingleAnalyzerWrapper wraps a
ShingleFilter
around anotherAnalyzer
. - ShingleAnalyzerWrapper(Analyzer) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- ShingleAnalyzerWrapper(Analyzer, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- ShingleAnalyzerWrapper(Analyzer, int, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- ShingleAnalyzerWrapper(Analyzer, int, int, String, boolean, boolean) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Creates a new ShingleAnalyzerWrapper
- ShingleAnalyzerWrapper(Version) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Wraps
StandardAnalyzer
. - ShingleAnalyzerWrapper(Version, int, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
-
Wraps
StandardAnalyzer
. - ShingleFilter - Class in org.apache.lucene.analysis.shingle
-
A ShingleFilter constructs shingles (token n-grams) from a token stream.
- ShingleFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
-
Construct a ShingleFilter with default shingle size: 2.
- ShingleFilter(TokenStream, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
-
Constructs a ShingleFilter with the specified shingle size from the
TokenStream
input
- ShingleFilter(TokenStream, int, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
-
Constructs a ShingleFilter with the specified shingle size from the
TokenStream
input
- ShingleFilter(TokenStream, String) - Constructor for class org.apache.lucene.analysis.shingle.ShingleFilter
-
Construct a ShingleFilter with the specified token type for shingle tokens and the default shingle size: 2
- ShingleMatrixFilter - Class in org.apache.lucene.analysis.shingle
-
Deprecated.Will be removed in Lucene 4.0. This filter is unmaintained and might not behave correctly if used with custom Attributes, i.e. Attributes other than the ones located in
org.apache.lucene.analysis.tokenattributes
. It also uses hardcoded payload encoders which makes it not easily adaptable to other use-cases. - ShingleMatrixFilter(ShingleMatrixFilter.Matrix, int, int, Character, boolean, ShingleMatrixFilter.TokenSettingsCodec) - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.Creates a shingle filter based on a user defined matrix.
- ShingleMatrixFilter(TokenStream, int, int) - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.Creates a shingle filter using default settings.
- ShingleMatrixFilter(TokenStream, int, int, Character) - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.Creates a shingle filter using default settings.
- ShingleMatrixFilter(TokenStream, int, int, Character, boolean) - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.Creates a shingle filter using the default
ShingleMatrixFilter.TokenSettingsCodec
. - ShingleMatrixFilter(TokenStream, int, int, Character, boolean, ShingleMatrixFilter.TokenSettingsCodec) - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.Creates a shingle filter with ad hoc parameter settings.
- ShingleMatrixFilter.Matrix - Class in org.apache.lucene.analysis.shingle
-
Deprecated.A column focused matrix in three dimensions:
- ShingleMatrixFilter.Matrix.Column - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
- ShingleMatrixFilter.Matrix.Column.Row - Class in org.apache.lucene.analysis.shingle
-
Deprecated.
- ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
-
Deprecated.Using this codec makes a
ShingleMatrixFilter
act likeShingleFilter
. - ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
-
Deprecated.A full featured codec not to be used for something serious.
- ShingleMatrixFilter.TokenPositioner - Class in org.apache.lucene.analysis.shingle
-
Deprecated.Used to describe how a
Token
is to be inserted to aShingleMatrixFilter.Matrix
. - ShingleMatrixFilter.TokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
-
Deprecated.Strategy used to code and decode meta data of the tokens from the input stream regarding how to position the tokens in the matrix, set and retreive weight, et c.
- ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec - Class in org.apache.lucene.analysis.shingle
-
Deprecated.A codec that creates a two dimensional matrix by treating tokens from the input stream with 0 position increment as new rows to the current column.
- SimpleThreeDimensionalTokenSettingsCodec() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec
-
Deprecated.
- SINGLE_TYPE - Static variable in class org.apache.lucene.analysis.cjk.CJKBigramFilter
-
when we emit a unigram, its then marked as this type
- SingleTokenTokenStream - Class in org.apache.lucene.analysis.miscellaneous
-
A
TokenStream
containing a single token. - SingleTokenTokenStream(Token) - Constructor for class org.apache.lucene.analysis.miscellaneous.SingleTokenTokenStream
- size() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- size() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- slice_check() - Method in class org.tartarus.snowball.SnowballProgram
- slice_del() - Method in class org.tartarus.snowball.SnowballProgram
- slice_from(CharSequence) - Method in class org.tartarus.snowball.SnowballProgram
- slice_from(String) - Method in class org.tartarus.snowball.SnowballProgram
-
Deprecated.for binary back compat. Will be removed in Lucene 4.0
- slice_from(StringBuilder) - Method in class org.tartarus.snowball.SnowballProgram
-
Deprecated.for binary back compat. Will be removed in Lucene 4.0
- slice_to(StringBuilder) - Method in class org.tartarus.snowball.SnowballProgram
- SnowballAnalyzer - Class in org.apache.lucene.analysis.snowball
-
Deprecated.Use the language-specific analyzer in contrib/analyzers instead. This analyzer will be removed in Lucene 5.0
- SnowballAnalyzer(Version, String) - Constructor for class org.apache.lucene.analysis.snowball.SnowballAnalyzer
-
Deprecated.Builds the named analyzer with no stop words.
- SnowballAnalyzer(Version, String, String[]) - Constructor for class org.apache.lucene.analysis.snowball.SnowballAnalyzer
-
Deprecated.Use
SnowballAnalyzer(Version, String, Set)
instead. - SnowballAnalyzer(Version, String, Set<?>) - Constructor for class org.apache.lucene.analysis.snowball.SnowballAnalyzer
-
Deprecated.Builds the named analyzer with the given stop words.
- SnowballFilter - Class in org.apache.lucene.analysis.snowball
-
A filter that stems words using a Snowball-generated stemmer.
- SnowballFilter(TokenStream, String) - Constructor for class org.apache.lucene.analysis.snowball.SnowballFilter
-
Construct the named stemming filter.
- SnowballFilter(TokenStream, SnowballProgram) - Constructor for class org.apache.lucene.analysis.snowball.SnowballFilter
- SnowballProgram - Class in org.tartarus.snowball
-
This is the rev 502 of the Snowball SVN trunk, but modified: made abstract and introduced abstract method stem to avoid expensive reflection in filter class.
- SnowballProgram() - Constructor for class org.tartarus.snowball.SnowballProgram
- SolrSynonymParser - Class in org.apache.lucene.analysis.synonym
-
Parser for the Solr synonyms format.
- SolrSynonymParser(boolean, boolean, Analyzer) - Constructor for class org.apache.lucene.analysis.synonym.SolrSynonymParser
- SpanishAnalyzer - Class in org.apache.lucene.analysis.es
-
Analyzer
for Spanish. - SpanishAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.es.SpanishAnalyzer
-
Builds an analyzer with the default stop words:
SpanishAnalyzer.DEFAULT_STOPWORD_FILE
. - SpanishAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.es.SpanishAnalyzer
-
Builds an analyzer with the given stop words.
- SpanishAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.es.SpanishAnalyzer
-
Builds an analyzer with the given stop words.
- SpanishLightStemFilter - Class in org.apache.lucene.analysis.es
-
A
TokenFilter
that appliesSpanishLightStemmer
to stem Spanish words. - SpanishLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.es.SpanishLightStemFilter
- SpanishLightStemmer - Class in org.apache.lucene.analysis.es
-
Light Stemmer for Spanish
- SpanishLightStemmer() - Constructor for class org.apache.lucene.analysis.es.SpanishLightStemmer
- SpanishStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- SpanishStemmer() - Constructor for class org.tartarus.snowball.ext.SpanishStemmer
- START_OF_HEADING_MARKER - Static variable in class org.apache.lucene.analysis.reverse.ReverseStringFilter
-
Example marker character: U+0001 (START OF HEADING)
- startElement(String, String, String, Attributes) - Method in class org.apache.lucene.analysis.compound.hyphenation.PatternParser
- startOffset - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase.CompoundToken
- startsWith(char[], int, String) - Static method in class org.apache.lucene.analysis.util.StemmerUtil
-
Returns true if the character array starts with the suffix.
- stem() - Method in class org.tartarus.snowball.ext.ArmenianStemmer
- stem() - Method in class org.tartarus.snowball.ext.BasqueStemmer
- stem() - Method in class org.tartarus.snowball.ext.CatalanStemmer
- stem() - Method in class org.tartarus.snowball.ext.DanishStemmer
- stem() - Method in class org.tartarus.snowball.ext.DutchStemmer
- stem() - Method in class org.tartarus.snowball.ext.EnglishStemmer
- stem() - Method in class org.tartarus.snowball.ext.FinnishStemmer
- stem() - Method in class org.tartarus.snowball.ext.FrenchStemmer
- stem() - Method in class org.tartarus.snowball.ext.German2Stemmer
- stem() - Method in class org.tartarus.snowball.ext.GermanStemmer
- stem() - Method in class org.tartarus.snowball.ext.HungarianStemmer
- stem() - Method in class org.tartarus.snowball.ext.IrishStemmer
- stem() - Method in class org.tartarus.snowball.ext.ItalianStemmer
- stem() - Method in class org.tartarus.snowball.ext.KpStemmer
- stem() - Method in class org.tartarus.snowball.ext.LovinsStemmer
- stem() - Method in class org.tartarus.snowball.ext.NorwegianStemmer
- stem() - Method in class org.tartarus.snowball.ext.PorterStemmer
- stem() - Method in class org.tartarus.snowball.ext.PortugueseStemmer
- stem() - Method in class org.tartarus.snowball.ext.RomanianStemmer
- stem() - Method in class org.tartarus.snowball.ext.RussianStemmer
- stem() - Method in class org.tartarus.snowball.ext.SpanishStemmer
- stem() - Method in class org.tartarus.snowball.ext.SwedishStemmer
- stem() - Method in class org.tartarus.snowball.ext.TurkishStemmer
- stem() - Method in class org.tartarus.snowball.SnowballProgram
- stem(char[], int) - Method in class org.apache.lucene.analysis.ar.ArabicStemmer
-
Stem an input buffer of Arabic text.
- stem(char[], int) - Method in class org.apache.lucene.analysis.bg.BulgarianStemmer
-
Stem an input buffer of Bulgarian text.
- stem(char[], int) - Method in class org.apache.lucene.analysis.cz.CzechStemmer
-
Stem an input buffer of Czech text.
- stem(char[], int) - Method in class org.apache.lucene.analysis.de.GermanLightStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.de.GermanMinimalStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.el.GreekStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.en.EnglishMinimalStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.es.SpanishLightStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.fi.FinnishLightStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.fr.FrenchLightStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.fr.FrenchMinimalStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.gl.GalicianMinimalStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.gl.GalicianStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.hi.HindiStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.hu.HungarianLightStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.hunspell.HunspellStemmer
-
Find the stem(s) of the provided word
- stem(char[], int) - Method in class org.apache.lucene.analysis.it.ItalianLightStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.lv.LatvianStemmer
-
Stem a latvian word.
- stem(char[], int) - Method in class org.apache.lucene.analysis.no.NorwegianLightStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.no.NorwegianMinimalStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.pt.PortugueseLightStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.pt.PortugueseMinimalStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.pt.PortugueseStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.ru.RussianLightStemmer
- stem(char[], int) - Method in class org.apache.lucene.analysis.sv.SwedishLightStemmer
- stem(char[], int, boolean) - Method in class org.apache.lucene.analysis.id.IndonesianStemmer
-
Stem a term (returning its new length).
- stem(String) - Method in class org.apache.lucene.analysis.br.BrazilianStemmer
-
Stems the given term to an unique discriminator.
- stem(String) - Method in class org.apache.lucene.analysis.de.GermanStemmer
-
Stemms the given term to an unique discriminator.
- stem(String) - Method in class org.apache.lucene.analysis.fr.FrenchStemmer
-
Deprecated.Stems the given term to a unique discriminator.
- stem(String) - Method in class org.apache.lucene.analysis.hunspell.HunspellStemmer
-
Find the stem(s) of the provided word
- stem(String) - Method in class org.apache.lucene.analysis.nl.DutchStemmer
-
Deprecated.
- Stem(char[], int) - Constructor for class org.apache.lucene.analysis.hunspell.HunspellStemmer.Stem
-
Creates a new Stem wrapping the given word stem
- StemmerOverrideFilter - Class in org.apache.lucene.analysis.miscellaneous
-
Provides the ability to override any
KeywordAttribute
aware stemmer with custom dictionary-based stemming. - StemmerOverrideFilter(Version, TokenStream, Map<?, String>) - Constructor for class org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter
-
Create a new StemmerOverrideFilter, performing dictionary-based stemming with the provided
dictionary
. - StemmerUtil - Class in org.apache.lucene.analysis.util
-
Some commonly-used stemming functions
- StemmerUtil() - Constructor for class org.apache.lucene.analysis.util.StemmerUtil
- stemPrefix(char[], int) - Method in class org.apache.lucene.analysis.ar.ArabicStemmer
-
Stem a prefix off an Arabic word.
- stemSuffix(char[], int) - Method in class org.apache.lucene.analysis.ar.ArabicStemmer
-
Stem suffix(es) off an Arabic word.
- Step(String, RSLPStemmerBase.Rule[], int, String[]) - Constructor for class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
-
Create a new step
- STOP_WORDS - Static variable in class org.apache.lucene.analysis.cjk.CJKAnalyzer
-
Deprecated.use
CJKAnalyzer.getDefaultStopSet()
instead - STOP_WORDS - Static variable in class org.apache.lucene.analysis.cn.ChineseFilter
-
Deprecated.
- stoplist - Variable in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
This map stores hyphenation exceptions
- STOPWORDS_COMMENT - Static variable in class org.apache.lucene.analysis.ar.ArabicAnalyzer
-
Deprecated.use
WordlistLoader.getWordSet(Reader, String, Version)
directly - STOPWORDS_COMMENT - Static variable in class org.apache.lucene.analysis.bg.BulgarianAnalyzer
-
Deprecated.use
WordlistLoader.getWordSet(Reader, String, Version)
directly - STOPWORDS_COMMENT - Static variable in class org.apache.lucene.analysis.fa.PersianAnalyzer
-
The comment character in the stopwords file.
- strcmp(char[], int, char[], int) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Compares 2 null terminated char arrays
- strcmp(String, char[], int) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Compares a string with null terminated char array
- strcpy(char[], int, char[], int) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- strlen(char[]) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- strlen(char[], int) - Static method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
- SUB_HEADING - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- SUB_HEADING_ID - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
- subSequence(int, int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- substring_i - Variable in class org.tartarus.snowball.Among
- suffix - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Rule
- suffixes - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- suffixes - Variable in class org.apache.lucene.analysis.pt.RSLPStemmerBase.Step
- SUKUN - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- SwedishAnalyzer - Class in org.apache.lucene.analysis.sv
-
Analyzer
for Swedish. - SwedishAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.sv.SwedishAnalyzer
-
Builds an analyzer with the default stop words:
SwedishAnalyzer.DEFAULT_STOPWORD_FILE
. - SwedishAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.sv.SwedishAnalyzer
-
Builds an analyzer with the given stop words.
- SwedishAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.sv.SwedishAnalyzer
-
Builds an analyzer with the given stop words.
- SwedishLightStemFilter - Class in org.apache.lucene.analysis.sv
-
A
TokenFilter
that appliesSwedishLightStemmer
to stem Swedish words. - SwedishLightStemFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.sv.SwedishLightStemFilter
- SwedishLightStemmer - Class in org.apache.lucene.analysis.sv
-
Light Stemmer for Swedish.
- SwedishLightStemmer() - Constructor for class org.apache.lucene.analysis.sv.SwedishLightStemmer
- SwedishStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- SwedishStemmer() - Constructor for class org.tartarus.snowball.ext.SwedishStemmer
- SynonymFilter - Class in org.apache.lucene.analysis.synonym
-
Matches single or multi word synonyms in a token stream.
- SynonymFilter(TokenStream, SynonymMap, boolean) - Constructor for class org.apache.lucene.analysis.synonym.SynonymFilter
- SynonymMap - Class in org.apache.lucene.analysis.synonym
-
A map of synonyms, keys and values are phrases.
- SynonymMap(FST<BytesRef>, BytesRefHash, int) - Constructor for class org.apache.lucene.analysis.synonym.SynonymMap
- SynonymMap.Builder - Class in org.apache.lucene.analysis.synonym
-
Builds an FSTSynonymMap.
T
- TATWEEL - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- TEH - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- TEH_MARBUTA - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- TEH_MARBUTA - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- termAtt - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- termAtt - Variable in class org.apache.lucene.analysis.sinks.DateRecognizerSinkFilter
- TernaryTree - Class in org.apache.lucene.analysis.compound.hyphenation
-
Ternary Search Tree.
- TernaryTree.Iterator - Class in org.apache.lucene.analysis.compound.hyphenation
- TestApp - Class in org.tartarus.snowball
- TestApp() - Constructor for class org.tartarus.snowball.TestApp
- ThaiAnalyzer - Class in org.apache.lucene.analysis.th
-
Analyzer
for Thai language. - ThaiAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.th.ThaiAnalyzer
-
Builds an analyzer with the default stop words.
- ThaiAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.th.ThaiAnalyzer
-
Builds an analyzer with the given stop words.
- ThaiWordFilter - Class in org.apache.lucene.analysis.th
-
TokenFilter
that useBreakIterator
to break each Token that is Thai into separate Token(s) for each Thai word. - ThaiWordFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.th.ThaiWordFilter
-
Deprecated.Use the ctor with
matchVersion
instead! - ThaiWordFilter(Version, TokenStream) - Constructor for class org.apache.lucene.analysis.th.ThaiWordFilter
-
Creates a new ThaiWordFilter with the specified match version.
- toCharArray() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- TOKEN_SEPARATOR - Static variable in class org.apache.lucene.analysis.shingle.ShingleFilter
-
The default string to use when joining adjacent tokens to form a shingle
- TOKEN_TYPES - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
String token types that correspond to token type int constants
- TokenOffsetPayloadTokenFilter - Class in org.apache.lucene.analysis.payloads
-
Adds the
Token.setStartOffset(int)
andToken.setEndOffset(int)
First 4 bytes are the start - TokenOffsetPayloadTokenFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.payloads.TokenOffsetPayloadTokenFilter
- TokenRangeSinkFilter - Class in org.apache.lucene.analysis.sinks
-
Counts the tokens as they go by and saves to the internal list those between the range of lower and upper, exclusive of upper
- TokenRangeSinkFilter(int, int) - Constructor for class org.apache.lucene.analysis.sinks.TokenRangeSinkFilter
- tokens - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase
- TOKENS_ONLY - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
Only output tokens
- TokenSettingsCodec() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TokenSettingsCodec
-
Deprecated.
- tokenStream(String, Reader) - Method in class org.apache.lucene.analysis.query.QueryAutoStopWordAnalyzer
- tokenStream(String, Reader) - Method in class org.apache.lucene.analysis.shingle.ShingleAnalyzerWrapper
- tokenStream(String, Reader) - Method in class org.apache.lucene.analysis.snowball.SnowballAnalyzer
-
Deprecated.Constructs a
StandardTokenizer
filtered by aStandardFilter
, aLowerCaseFilter
, aStopFilter
, and aSnowballFilter
- TokenTypeSinkFilter - Class in org.apache.lucene.analysis.sinks
-
Adds a token to the sink if it has a specific type.
- TokenTypeSinkFilter(String) - Constructor for class org.apache.lucene.analysis.sinks.TokenTypeSinkFilter
- toString() - Method in class org.apache.lucene.analysis.compound.hyphenation.Hyphen
- toString() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column.Row
-
Deprecated.
- toString() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix.Column
-
Deprecated.
- toString() - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix
-
Deprecated.
- toString() - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- trimToSize() - Method in class org.apache.lucene.analysis.compound.hyphenation.ByteVector
- trimToSize() - Method in class org.apache.lucene.analysis.compound.hyphenation.CharVector
- trimToSize() - Method in class org.apache.lucene.analysis.compound.hyphenation.TernaryTree
-
Each node stores a character (splitchar) which is part of some key(s).
- TurkishAnalyzer - Class in org.apache.lucene.analysis.tr
-
Analyzer
for Turkish. - TurkishAnalyzer(Version) - Constructor for class org.apache.lucene.analysis.tr.TurkishAnalyzer
-
Builds an analyzer with the default stop words:
TurkishAnalyzer.DEFAULT_STOPWORD_FILE
. - TurkishAnalyzer(Version, Set<?>) - Constructor for class org.apache.lucene.analysis.tr.TurkishAnalyzer
-
Builds an analyzer with the given stop words.
- TurkishAnalyzer(Version, Set<?>, Set<?>) - Constructor for class org.apache.lucene.analysis.tr.TurkishAnalyzer
-
Builds an analyzer with the given stop words.
- TurkishLowerCaseFilter - Class in org.apache.lucene.analysis.tr
-
Normalizes Turkish token text to lower case.
- TurkishLowerCaseFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.tr.TurkishLowerCaseFilter
-
Create a new TurkishLowerCaseFilter, that normalizes Turkish token text to lower case.
- TurkishStemmer - Class in org.tartarus.snowball.ext
-
Generated class implementing code defined by a snowball script.
- TurkishStemmer() - Constructor for class org.tartarus.snowball.ext.TurkishStemmer
- TwoDimensionalNonWeightedSynonymTokenSettingsCodec() - Constructor for class org.apache.lucene.analysis.shingle.ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec
-
Deprecated.
- txt - Variable in class org.apache.lucene.analysis.compound.CompoundWordTokenFilterBase.CompoundToken
- TYPE_SYNONYM - Static variable in class org.apache.lucene.analysis.synonym.SynonymFilter
- TypeAsPayloadTokenFilter - Class in org.apache.lucene.analysis.payloads
-
Makes the
Token.type()
a payload. - TypeAsPayloadTokenFilter(TokenStream) - Constructor for class org.apache.lucene.analysis.payloads.TypeAsPayloadTokenFilter
U
- uniqueStems(char[], int) - Method in class org.apache.lucene.analysis.hunspell.HunspellStemmer
-
Find the unique stem(s) of the provided word
- unpackValues(int) - Method in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
- unsafeWrite(char) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- unsafeWrite(char[], int, int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- unsafeWrite(int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- UNTOKENIZED_ONLY - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
Only output untokenized tokens, which are tokens that would normally be split into several tokens
- UNTOKENIZED_TOKEN_FLAG - Static variable in class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
This flag is used to indicate that the produced "Token" would, if
WikipediaTokenizer.TOKENS_ONLY
was used, produce multiple tokens. - updateInputToken(Token, Token) - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAndSuffixAwareTokenFilter
- updateSuffixToken(Token, Token) - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAndSuffixAwareTokenFilter
- updateSuffixToken(Token, Token) - Method in class org.apache.lucene.analysis.miscellaneous.PrefixAwareTokenFilter
-
The default implementation adds last prefix token end offset to the suffix token start and end offsets.
- updateToken(Token, List<Token>, int, List<ShingleMatrixFilter.Matrix.Column.Row>, List<Token>) - Method in class org.apache.lucene.analysis.shingle.ShingleMatrixFilter
-
Deprecated.Final touch of a shingle token before it is passed on to the consumer from method
ShingleMatrixFilter.incrementToken()
.
V
- valueOf(String) - Static method in enum org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter.Side
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.apache.lucene.analysis.ngram.EdgeNGramTokenizer.Side
-
Returns the enum constant of this type with the specified name.
- values() - Static method in enum org.apache.lucene.analysis.ngram.EdgeNGramTokenFilter.Side
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.apache.lucene.analysis.ngram.EdgeNGramTokenizer.Side
-
Returns an array containing the constants of this enum type, in the order they are declared.
- vspace - Variable in class org.apache.lucene.analysis.compound.hyphenation.HyphenationTree
-
value space: stores the interletter values
W
- WAW - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- WHITESPACE_PATTERN - Static variable in class org.apache.lucene.analysis.miscellaneous.PatternAnalyzer
-
"\\s+"
; Divides text at whitespaces (Character.isWhitespace(c)) - WikipediaTokenizer - Class in org.apache.lucene.analysis.wikipedia
-
Extension of StandardTokenizer that is aware of Wikipedia syntax.
- WikipediaTokenizer(Reader) - Constructor for class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
Creates a new instance of the
WikipediaTokenizer
. - WikipediaTokenizer(Reader, int, Set<String>) - Constructor for class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
Creates a new instance of the
WikipediaTokenizer
. - WikipediaTokenizer(AttributeSource.AttributeFactory, Reader, int, Set<String>) - Constructor for class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
Creates a new instance of the
WikipediaTokenizer
. - WikipediaTokenizer(AttributeSource, Reader, int, Set<String>) - Constructor for class org.apache.lucene.analysis.wikipedia.WikipediaTokenizer
-
Creates a new instance of the
WikipediaTokenizer
. - WORD_SEPARATOR - Static variable in class org.apache.lucene.analysis.synonym.SynonymMap
-
for multiword support, you must separate words with this separator
- WordnetSynonymParser - Class in org.apache.lucene.analysis.synonym
-
Parser for wordnet prolog format
- WordnetSynonymParser(boolean, boolean, Analyzer) - Constructor for class org.apache.lucene.analysis.synonym.WordnetSynonymParser
- words - Variable in class org.apache.lucene.analysis.synonym.SynonymMap
-
map
- write(char) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- write(char[]) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- write(char[], int, int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- write(int) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- write(String) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
- write(OpenStringBuilder) - Method in class org.apache.lucene.analysis.util.OpenStringBuilder
Y
- YEH - Static variable in class org.apache.lucene.analysis.ar.ArabicNormalizer
- YEH - Static variable in class org.apache.lucene.analysis.ar.ArabicStemmer
- YEH - Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
- YEH_BARREE - Static variable in class org.apache.lucene.analysis.fa.PersianNormalizer
All Classes All Packages