Package org.apache.lucene.analysis.miscellaneous
Miscellaneous TokenStreams
-
Class Summary Class Description EmptyTokenStream An always exhausted token stream.PatternAnalyzer Efficient Lucene analyzer/tokenizer that preferably operates on a String rather than aReader
, that can flexibly separate text into terms via a regular expressionPattern
(with behaviour identical toString.split(String)
), and that combines the functionality ofLetterTokenizer
,LowerCaseTokenizer
,WhitespaceTokenizer
,StopFilter
into a single efficient multi-purpose class.PrefixAndSuffixAwareTokenFilter Links twoPrefixAwareTokenFilter
.PrefixAwareTokenFilter Joins two token streams and leaves the last token of the first stream available to be used when updating the token values in the second stream based on that token.SingleTokenTokenStream ATokenStream
containing a single token.StemmerOverrideFilter Provides the ability to override anyKeywordAttribute
aware stemmer with custom dictionary-based stemming.