Class ArabicLetterTokenizer

    • Nested Class Summary

      • Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource

        org.apache.lucene.util.AttributeSource.AttributeFactory, org.apache.lucene.util.AttributeSource.State
    • Field Summary

      • Fields inherited from class org.apache.lucene.analysis.Tokenizer

        input
    • Method Summary

      All Methods Instance Methods Concrete Methods Deprecated Methods 
      Modifier and Type Method Description
      protected boolean isTokenChar​(int c)
      Deprecated.
      Allows for Letter category or NonspacingMark category
      • Methods inherited from class org.apache.lucene.analysis.CharTokenizer

        end, incrementToken, isTokenChar, normalize, normalize, reset
      • Methods inherited from class org.apache.lucene.analysis.Tokenizer

        close, correctOffset
      • Methods inherited from class org.apache.lucene.analysis.TokenStream

        reset
      • Methods inherited from class org.apache.lucene.util.AttributeSource

        addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString
    • Constructor Detail

      • ArabicLetterTokenizer

        public ArabicLetterTokenizer​(org.apache.lucene.util.Version matchVersion,
                                     Reader in)
        Deprecated.
        Construct a new ArabicLetterTokenizer.
        Parameters:
        matchVersion - Lucene version to match See {@link above}
        in - the input to split up into tokens
      • ArabicLetterTokenizer

        public ArabicLetterTokenizer​(org.apache.lucene.util.Version matchVersion,
                                     org.apache.lucene.util.AttributeSource source,
                                     Reader in)
        Deprecated.
        Construct a new ArabicLetterTokenizer using a given AttributeSource.
        Parameters:
        matchVersion - Lucene version to match See {@link above}
        source - the attribute source to use for this Tokenizer
        in - the input to split up into tokens
      • ArabicLetterTokenizer

        public ArabicLetterTokenizer​(org.apache.lucene.util.Version matchVersion,
                                     org.apache.lucene.util.AttributeSource.AttributeFactory factory,
                                     Reader in)
        Deprecated.
        Construct a new ArabicLetterTokenizer using a given AttributeSource.AttributeFactory. * @param matchVersion Lucene version to match See {@link above}
        Parameters:
        factory - the attribute factory to use for this Tokenizer
        in - the input to split up into tokens
    • Method Detail

      • isTokenChar

        protected boolean isTokenChar​(int c)
        Deprecated.
        Allows for Letter category or NonspacingMark category
        Overrides:
        isTokenChar in class org.apache.lucene.analysis.LetterTokenizer
        See Also:
        LetterTokenizer.isTokenChar(int)