Uses of Class
org.apache.lucene.analysis.TokenStream
-
Packages that use TokenStream Package Description org.apache.lucene.analysis Text analysis.org.apache.lucene.analysis.ar Analyzer for Arabic.org.apache.lucene.analysis.bg Analyzer for Bulgarian.org.apache.lucene.analysis.bn Analyzer for Bengali Language.org.apache.lucene.analysis.boost Provides various convenience classes for creating boosts on Tokens.org.apache.lucene.analysis.br Analyzer for Brazilian Portuguese.org.apache.lucene.analysis.ca Analyzer for Catalan.org.apache.lucene.analysis.cjk Analyzer for Chinese, Japanese, and Korean, which indexes bigrams.org.apache.lucene.analysis.ckb Analyzer for Sorani Kurdish.org.apache.lucene.analysis.cn.smart Analyzer for Simplified Chinese, which indexes words.org.apache.lucene.analysis.commongrams Construct n-grams for frequently occurring terms and phrases.org.apache.lucene.analysis.compound A filter that decomposes compound words you find in many Germanic languages into the word parts.org.apache.lucene.analysis.core Basic, general-purpose analysis components.org.apache.lucene.analysis.custom A general-purpose Analyzer that can be created with a builder-style API.org.apache.lucene.analysis.cz Analyzer for Czech.org.apache.lucene.analysis.da Analyzer for Danish.org.apache.lucene.analysis.de Analyzer for German.org.apache.lucene.analysis.el Analyzer for Greek.org.apache.lucene.analysis.en Analyzer for English.org.apache.lucene.analysis.es Analyzer for Spanish.org.apache.lucene.analysis.et Analyzer for Estonian.org.apache.lucene.analysis.eu Analyzer for Basque.org.apache.lucene.analysis.fa Analyzer for Persian.org.apache.lucene.analysis.fi Analyzer for Finnish.org.apache.lucene.analysis.fr Analyzer for French.org.apache.lucene.analysis.ga Analyzer for Irish.org.apache.lucene.analysis.gl Analyzer for Galician.org.apache.lucene.analysis.hi Analyzer for Hindi.org.apache.lucene.analysis.hu Analyzer for Hungarian.org.apache.lucene.analysis.hunspell Stemming TokenFilter using a Java implementation of the Hunspell stemming algorithm.org.apache.lucene.analysis.hy Analyzer for Armenian.org.apache.lucene.analysis.icu Analysis components based on ICUorg.apache.lucene.analysis.icu.segmentation Tokenizer that breaks text into words with the Unicode Text Segmentation algorithm.org.apache.lucene.analysis.id Analyzer for Indonesian.org.apache.lucene.analysis.in Analyzer for Indian languages.org.apache.lucene.analysis.it Analyzer for Italian.org.apache.lucene.analysis.ja Analyzer for Japanese.org.apache.lucene.analysis.ko Analyzer for Korean.org.apache.lucene.analysis.lt Analyzer for Lithuanian.org.apache.lucene.analysis.lv Analyzer for Latvian.org.apache.lucene.analysis.minhash MinHash filtering (for LSH).org.apache.lucene.analysis.miscellaneous Miscellaneous Tokenstreams.org.apache.lucene.analysis.ngram Character n-gram tokenizers and filters.org.apache.lucene.analysis.nl Analyzer for Dutch.org.apache.lucene.analysis.no Analyzer for Norwegian.org.apache.lucene.analysis.path Analysis components for path-like strings such as filenames.org.apache.lucene.analysis.pattern Set of components for pattern-based (regex) analysis.org.apache.lucene.analysis.payloads Provides various convenience classes for creating payloads on Tokens.org.apache.lucene.analysis.phonetic Analysis components for phonetic search.org.apache.lucene.analysis.pl Analyzer for Polish.org.apache.lucene.analysis.pt Analyzer for Portuguese.org.apache.lucene.analysis.reverse Filter to reverse token text.org.apache.lucene.analysis.ro Analyzer for Romanian.org.apache.lucene.analysis.ru Analyzer for Russian.org.apache.lucene.analysis.shingle Word n-gram filters.org.apache.lucene.analysis.sinks org.apache.lucene.analysis.snowball TokenFilterandAnalyzerimplementations that use Snowball stemmers.org.apache.lucene.analysis.sr Analyzer for Serbian.org.apache.lucene.analysis.standard Fast, general-purpose grammar-based tokenizerStandardTokenizerimplements the Word Break rules from the Unicode Text Segmentation algorithm, as specified in Unicode Standard Annex #29.org.apache.lucene.analysis.stempel Stempel: Algorithmic Stemmerorg.apache.lucene.analysis.sv Analyzer for Swedish.org.apache.lucene.analysis.synonym Analysis components for Synonyms.org.apache.lucene.analysis.th Analyzer for Thai.org.apache.lucene.analysis.tr Analyzer for Turkish.org.apache.lucene.analysis.util Utility functions for text analysis.org.apache.lucene.analysis.wikipedia Tokenizer that is aware of Wikipedia syntax.org.apache.lucene.classification.document Uses already seen data (the indexed documents) to classify new documents.org.apache.lucene.codecs Codecs API: API for customization of the encoding and structure of the index.org.apache.lucene.document The logical representation of aDocumentfor indexing and searching.org.apache.lucene.index Code to maintain and access indices.org.apache.lucene.index.memory High-performance single-document main memory Apache Lucene fulltext search index.org.apache.lucene.monitor Monitoring frameworkorg.apache.lucene.search Code to search indices.org.apache.lucene.search.highlight Highlighting search terms.org.apache.lucene.search.suggest.analyzing Analyzer based autosuggest.org.apache.lucene.search.suggest.document Support for document suggestionorg.apache.lucene.search.uhighlight The UnifiedHighlighter -- a flexible highlighter that can get offsets from postings, term vectors, or analysis.org.apache.lucene.util Some utility classes.org.apache.lucene.util.graph Utility classes for working with token streams as graphs. -
-
Uses of TokenStream in org.apache.lucene.analysis
Subclasses of TokenStream in org.apache.lucene.analysis Modifier and Type Class Description private static classAnalyzer.StringTokenStreamclassCachingTokenFilterThis class can be used if the token attributes of a TokenStream are intended to be consumed more than once.classFilteringTokenFilterAbstract base class for TokenFilters that may remove tokens.classGraphTokenFilterAn abstract TokenFilter that exposes its input stream as a graph CallGraphTokenFilter.incrementBaseToken()to move the root of the graph to the next position in the TokenStream,GraphTokenFilter.incrementGraphToken()to move along the current graph, andGraphTokenFilter.incrementGraph()to reset to the next graph based at the current root.classLowerCaseFilterNormalizes token text to lower case.classStopFilterRemoves stop words from a token stream.classTokenFilterA TokenFilter is a TokenStream whose input is another TokenStream.classTokenizerA Tokenizer is a TokenStream whose input is a Reader.Fields in org.apache.lucene.analysis declared as TokenStream Modifier and Type Field Description protected TokenStreamTokenFilter. inputThe source of tokens for this filter.protected TokenStreamAnalyzer.TokenStreamComponents. sinkSink tokenstream, such as the outer tokenfilter decorating the chain.Methods in org.apache.lucene.analysis that return TokenStream Modifier and Type Method Description TokenStreamAnalyzer.TokenStreamComponents. getTokenStream()Returns the sinkTokenStreamprotected TokenStreamAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Wrap the givenTokenStreamin order to apply normalization filters.protected TokenStreamAnalyzerWrapper. normalize(java.lang.String fieldName, TokenStream in)TokenStreamAnalyzer. tokenStream(java.lang.String fieldName, java.io.Reader reader)Returns a TokenStream suitable forfieldName, tokenizing the contents ofreader.TokenStreamAnalyzer. tokenStream(java.lang.String fieldName, java.lang.String text)Returns a TokenStream suitable forfieldName, tokenizing the contents oftext.protected TokenStreamAnalyzerWrapper. wrapTokenStreamForNormalization(java.lang.String fieldName, TokenStream in)Wraps / alters the given TokenStream for normalization purposes, taken from the wrapped Analyzer, to form new components.protected TokenStreamDelegatingAnalyzerWrapper. wrapTokenStreamForNormalization(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis with parameters of type TokenStream Modifier and Type Method Description protected TokenStreamAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Wrap the givenTokenStreamin order to apply normalization filters.protected TokenStreamAnalyzerWrapper. normalize(java.lang.String fieldName, TokenStream in)AutomatonTokenStreamToAutomaton. toAutomaton(TokenStream in)Pulls the graph (includingPositionLengthAttribute) from the providedTokenStream, and creates the corresponding automaton where arcs are bytes (or Unicode code points if unicodeArcs = true) from each term.protected TokenStreamAnalyzerWrapper. wrapTokenStreamForNormalization(java.lang.String fieldName, TokenStream in)Wraps / alters the given TokenStream for normalization purposes, taken from the wrapped Analyzer, to form new components.protected TokenStreamDelegatingAnalyzerWrapper. wrapTokenStreamForNormalization(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis with parameters of type TokenStream Constructor Description CachingTokenFilter(TokenStream input)Create a new CachingTokenFilter aroundinput.FilteringTokenFilter(TokenStream in)Create a newFilteringTokenFilter.GraphTokenFilter(TokenStream input)Create a new GraphTokenFilterLowerCaseFilter(TokenStream in)Create a new LowerCaseFilter, that normalizes token text to lower case.StopFilter(TokenStream in, CharArraySet stopWords)Constructs a filter which removes words from the input TokenStream that are named in the Set.TokenFilter(TokenStream input)Construct a token stream filtering the given input.TokenStreamComponents(java.util.function.Consumer<java.io.Reader> source, TokenStream result)Creates a newAnalyzer.TokenStreamComponentsinstance.TokenStreamComponents(Tokenizer tokenizer, TokenStream result)Creates a newAnalyzer.TokenStreamComponentsinstance -
Uses of TokenStream in org.apache.lucene.analysis.ar
Subclasses of TokenStream in org.apache.lucene.analysis.ar Modifier and Type Class Description classArabicNormalizationFilterATokenFilterthat appliesArabicNormalizerto normalize the orthography.classArabicStemFilterATokenFilterthat appliesArabicStemmerto stem Arabic words..Methods in org.apache.lucene.analysis.ar that return TokenStream Modifier and Type Method Description TokenStreamArabicNormalizationFilterFactory. create(TokenStream input)protected TokenStreamArabicAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamArabicNormalizationFilterFactory. normalize(TokenStream input)Methods in org.apache.lucene.analysis.ar with parameters of type TokenStream Modifier and Type Method Description TokenStreamArabicNormalizationFilterFactory. create(TokenStream input)ArabicStemFilterArabicStemFilterFactory. create(TokenStream input)protected TokenStreamArabicAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamArabicNormalizationFilterFactory. normalize(TokenStream input)Constructors in org.apache.lucene.analysis.ar with parameters of type TokenStream Constructor Description ArabicNormalizationFilter(TokenStream input)ArabicStemFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.bg
Subclasses of TokenStream in org.apache.lucene.analysis.bg Modifier and Type Class Description classBulgarianStemFilterATokenFilterthat appliesBulgarianStemmerto stem Bulgarian words.Methods in org.apache.lucene.analysis.bg that return TokenStream Modifier and Type Method Description TokenStreamBulgarianStemFilterFactory. create(TokenStream input)protected TokenStreamBulgarianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.bg with parameters of type TokenStream Modifier and Type Method Description TokenStreamBulgarianStemFilterFactory. create(TokenStream input)protected TokenStreamBulgarianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis.bg with parameters of type TokenStream Constructor Description BulgarianStemFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.bn
Subclasses of TokenStream in org.apache.lucene.analysis.bn Modifier and Type Class Description classBengaliNormalizationFilterATokenFilterthat appliesBengaliNormalizerto normalize the orthography.classBengaliStemFilterATokenFilterthat appliesBengaliStemmerto stem Bengali words.Methods in org.apache.lucene.analysis.bn that return TokenStream Modifier and Type Method Description TokenStreamBengaliNormalizationFilterFactory. create(TokenStream input)TokenStreamBengaliStemFilterFactory. create(TokenStream input)protected TokenStreamBengaliAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamBengaliNormalizationFilterFactory. normalize(TokenStream input)Methods in org.apache.lucene.analysis.bn with parameters of type TokenStream Modifier and Type Method Description TokenStreamBengaliNormalizationFilterFactory. create(TokenStream input)TokenStreamBengaliStemFilterFactory. create(TokenStream input)protected TokenStreamBengaliAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamBengaliNormalizationFilterFactory. normalize(TokenStream input)Constructors in org.apache.lucene.analysis.bn with parameters of type TokenStream Constructor Description BengaliNormalizationFilter(TokenStream input)BengaliStemFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.boost
Subclasses of TokenStream in org.apache.lucene.analysis.boost Modifier and Type Class Description classDelimitedBoostTokenFilterCharacters before the delimiter are the "token", those after are the boost.Methods in org.apache.lucene.analysis.boost with parameters of type TokenStream Modifier and Type Method Description DelimitedBoostTokenFilterDelimitedBoostTokenFilterFactory. create(TokenStream input)Constructors in org.apache.lucene.analysis.boost with parameters of type TokenStream Constructor Description DelimitedBoostTokenFilter(TokenStream input, char delimiter) -
Uses of TokenStream in org.apache.lucene.analysis.br
Subclasses of TokenStream in org.apache.lucene.analysis.br Modifier and Type Class Description classBrazilianStemFilterATokenFilterthat appliesBrazilianStemmer.Methods in org.apache.lucene.analysis.br that return TokenStream Modifier and Type Method Description protected TokenStreamBrazilianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.br with parameters of type TokenStream Modifier and Type Method Description BrazilianStemFilterBrazilianStemFilterFactory. create(TokenStream in)protected TokenStreamBrazilianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis.br with parameters of type TokenStream Constructor Description BrazilianStemFilter(TokenStream in)Creates a new BrazilianStemFilter -
Uses of TokenStream in org.apache.lucene.analysis.ca
Methods in org.apache.lucene.analysis.ca that return TokenStream Modifier and Type Method Description protected TokenStreamCatalanAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.ca with parameters of type TokenStream Modifier and Type Method Description protected TokenStreamCatalanAnalyzer. normalize(java.lang.String fieldName, TokenStream in) -
Uses of TokenStream in org.apache.lucene.analysis.cjk
Subclasses of TokenStream in org.apache.lucene.analysis.cjk Modifier and Type Class Description classCJKBigramFilterForms bigrams of CJK terms that are generated from StandardTokenizer or ICUTokenizer.classCJKWidthFilterATokenFilterthat normalizes CJK width differences: Folds fullwidth ASCII variants into the equivalent basic latin Folds halfwidth Katakana variants into the equivalent kanaMethods in org.apache.lucene.analysis.cjk that return TokenStream Modifier and Type Method Description TokenStreamCJKBigramFilterFactory. create(TokenStream input)TokenStreamCJKWidthFilterFactory. create(TokenStream input)protected TokenStreamCJKAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamCJKWidthFilterFactory. normalize(TokenStream input)Methods in org.apache.lucene.analysis.cjk with parameters of type TokenStream Modifier and Type Method Description TokenStreamCJKBigramFilterFactory. create(TokenStream input)TokenStreamCJKWidthFilterFactory. create(TokenStream input)protected TokenStreamCJKAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamCJKWidthFilterFactory. normalize(TokenStream input)Constructors in org.apache.lucene.analysis.cjk with parameters of type TokenStream Constructor Description CJKBigramFilter(TokenStream in)CJKBigramFilter(TokenStream in, int flags)CJKBigramFilter(TokenStream in, int flags, boolean outputUnigrams)Create a new CJKBigramFilter, specifying which writing systems should be bigrammed, and whether or not unigrams should also be output.CJKWidthFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.ckb
Subclasses of TokenStream in org.apache.lucene.analysis.ckb Modifier and Type Class Description classSoraniNormalizationFilterATokenFilterthat appliesSoraniNormalizerto normalize the orthography.classSoraniStemFilterATokenFilterthat appliesSoraniStemmerto stem Sorani words.Methods in org.apache.lucene.analysis.ckb that return TokenStream Modifier and Type Method Description TokenStreamSoraniNormalizationFilterFactory. create(TokenStream input)protected TokenStreamSoraniAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamSoraniNormalizationFilterFactory. normalize(TokenStream input)Methods in org.apache.lucene.analysis.ckb with parameters of type TokenStream Modifier and Type Method Description TokenStreamSoraniNormalizationFilterFactory. create(TokenStream input)SoraniStemFilterSoraniStemFilterFactory. create(TokenStream input)protected TokenStreamSoraniAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamSoraniNormalizationFilterFactory. normalize(TokenStream input)Constructors in org.apache.lucene.analysis.ckb with parameters of type TokenStream Constructor Description SoraniNormalizationFilter(TokenStream input)SoraniStemFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.cn.smart
Subclasses of TokenStream in org.apache.lucene.analysis.cn.smart Modifier and Type Class Description classHMMChineseTokenizerTokenizer for Chinese or mixed Chinese-English text.Methods in org.apache.lucene.analysis.cn.smart that return TokenStream Modifier and Type Method Description protected TokenStreamSmartChineseAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.cn.smart with parameters of type TokenStream Modifier and Type Method Description protected TokenStreamSmartChineseAnalyzer. normalize(java.lang.String fieldName, TokenStream in) -
Uses of TokenStream in org.apache.lucene.analysis.commongrams
Subclasses of TokenStream in org.apache.lucene.analysis.commongrams Modifier and Type Class Description classCommonGramsFilterConstruct bigrams for frequently occurring terms while indexing.classCommonGramsQueryFilterWrap a CommonGramsFilter optimizing phrase queries by only returning single words when they are not a member of a bigram.Methods in org.apache.lucene.analysis.commongrams with parameters of type TokenStream Modifier and Type Method Description TokenFilterCommonGramsFilterFactory. create(TokenStream input)TokenFilterCommonGramsQueryFilterFactory. create(TokenStream input)Create a CommonGramsFilter and wrap it with a CommonGramsQueryFilterConstructors in org.apache.lucene.analysis.commongrams with parameters of type TokenStream Constructor Description CommonGramsFilter(TokenStream input, CharArraySet commonWords)Construct a token stream filtering the given input using a Set of common words to create bigrams. -
Uses of TokenStream in org.apache.lucene.analysis.compound
Subclasses of TokenStream in org.apache.lucene.analysis.compound Modifier and Type Class Description classCompoundWordTokenFilterBaseBase class for decomposition token filters.classDictionaryCompoundWordTokenFilterATokenFilterthat decomposes compound words found in many Germanic languages.classHyphenationCompoundWordTokenFilterATokenFilterthat decomposes compound words found in many Germanic languages.Methods in org.apache.lucene.analysis.compound that return TokenStream Modifier and Type Method Description TokenStreamDictionaryCompoundWordTokenFilterFactory. create(TokenStream input)Methods in org.apache.lucene.analysis.compound with parameters of type TokenStream Modifier and Type Method Description TokenStreamDictionaryCompoundWordTokenFilterFactory. create(TokenStream input)TokenFilterHyphenationCompoundWordTokenFilterFactory. create(TokenStream input)Constructors in org.apache.lucene.analysis.compound with parameters of type TokenStream Constructor Description CompoundWordTokenFilterBase(TokenStream input, CharArraySet dictionary)CompoundWordTokenFilterBase(TokenStream input, CharArraySet dictionary, boolean onlyLongestMatch)CompoundWordTokenFilterBase(TokenStream input, CharArraySet dictionary, int minWordSize, int minSubwordSize, int maxSubwordSize, boolean onlyLongestMatch)DictionaryCompoundWordTokenFilter(TokenStream input, CharArraySet dictionary)Creates a newDictionaryCompoundWordTokenFilterDictionaryCompoundWordTokenFilter(TokenStream input, CharArraySet dictionary, int minWordSize, int minSubwordSize, int maxSubwordSize, boolean onlyLongestMatch)Creates a newDictionaryCompoundWordTokenFilterHyphenationCompoundWordTokenFilter(TokenStream input, HyphenationTree hyphenator)Create a HyphenationCompoundWordTokenFilter with no dictionary.HyphenationCompoundWordTokenFilter(TokenStream input, HyphenationTree hyphenator, int minWordSize, int minSubwordSize, int maxSubwordSize)Create a HyphenationCompoundWordTokenFilter with no dictionary.HyphenationCompoundWordTokenFilter(TokenStream input, HyphenationTree hyphenator, CharArraySet dictionary)Creates a newHyphenationCompoundWordTokenFilterinstance.HyphenationCompoundWordTokenFilter(TokenStream input, HyphenationTree hyphenator, CharArraySet dictionary, int minWordSize, int minSubwordSize, int maxSubwordSize, boolean onlyLongestMatch)Creates a newHyphenationCompoundWordTokenFilterinstance. -
Uses of TokenStream in org.apache.lucene.analysis.core
Subclasses of TokenStream in org.apache.lucene.analysis.core Modifier and Type Class Description classDecimalDigitFilterFolds all Unicode digits in[:General_Category=Decimal_Number:]to Basic Latin digits (0-9).classFlattenGraphFilterConverts an incoming graph token stream, such as one fromSynonymGraphFilter, into a flat form so that all nodes form a single linear chain with no side paths.classKeywordTokenizerEmits the entire input as a single token.classLetterTokenizerA LetterTokenizer is a tokenizer that divides text at non-letters.classLowerCaseFilterNormalizes token text to lower case.classStopFilterRemoves stop words from a token stream.classTypeTokenFilterRemoves tokens whose types appear in a set of blocked types from a token stream.classUnicodeWhitespaceTokenizerA UnicodeWhitespaceTokenizer is a tokenizer that divides text at whitespace.classUpperCaseFilterNormalizes token text to UPPER CASE.classWhitespaceTokenizerA tokenizer that divides text at whitespace characters as defined byCharacter.isWhitespace(int).Methods in org.apache.lucene.analysis.core that return TokenStream Modifier and Type Method Description TokenStreamDecimalDigitFilterFactory. create(TokenStream input)TokenStreamFlattenGraphFilterFactory. create(TokenStream input)TokenStreamLowerCaseFilterFactory. create(TokenStream input)TokenStreamStopFilterFactory. create(TokenStream input)TokenStreamTypeTokenFilterFactory. create(TokenStream input)TokenStreamUpperCaseFilterFactory. create(TokenStream input)TokenStreamDecimalDigitFilterFactory. normalize(TokenStream input)TokenStreamLowerCaseFilterFactory. normalize(TokenStream input)protected TokenStreamSimpleAnalyzer. normalize(java.lang.String fieldName, TokenStream in)protected TokenStreamStopAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamUpperCaseFilterFactory. normalize(TokenStream input)Methods in org.apache.lucene.analysis.core with parameters of type TokenStream Modifier and Type Method Description TokenStreamDecimalDigitFilterFactory. create(TokenStream input)TokenStreamFlattenGraphFilterFactory. create(TokenStream input)TokenStreamLowerCaseFilterFactory. create(TokenStream input)TokenStreamStopFilterFactory. create(TokenStream input)TokenStreamTypeTokenFilterFactory. create(TokenStream input)TokenStreamUpperCaseFilterFactory. create(TokenStream input)TokenStreamDecimalDigitFilterFactory. normalize(TokenStream input)TokenStreamLowerCaseFilterFactory. normalize(TokenStream input)protected TokenStreamSimpleAnalyzer. normalize(java.lang.String fieldName, TokenStream in)protected TokenStreamStopAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamUpperCaseFilterFactory. normalize(TokenStream input)Constructors in org.apache.lucene.analysis.core with parameters of type TokenStream Constructor Description DecimalDigitFilter(TokenStream input)Creates a new DecimalDigitFilter overinputFlattenGraphFilter(TokenStream in)LowerCaseFilter(TokenStream in)Create a new LowerCaseFilter, that normalizes token text to lower case.StopFilter(TokenStream in, CharArraySet stopWords)Constructs a filter which removes words from the input TokenStream that are named in the Set.TypeTokenFilter(TokenStream input, java.util.Set<java.lang.String> stopTypes)Create a newTypeTokenFilterthat filters tokens out (useWhiteList=false).TypeTokenFilter(TokenStream input, java.util.Set<java.lang.String> stopTypes, boolean useWhiteList)Create a newTypeTokenFilter.UpperCaseFilter(TokenStream in)Create a new UpperCaseFilter, that normalizes token text to upper case. -
Uses of TokenStream in org.apache.lucene.analysis.custom
Methods in org.apache.lucene.analysis.custom that return TokenStream Modifier and Type Method Description protected TokenStreamCustomAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.custom with parameters of type TokenStream Modifier and Type Method Description protected TokenStreamCustomAnalyzer. normalize(java.lang.String fieldName, TokenStream in) -
Uses of TokenStream in org.apache.lucene.analysis.cz
Subclasses of TokenStream in org.apache.lucene.analysis.cz Modifier and Type Class Description classCzechStemFilterATokenFilterthat appliesCzechStemmerto stem Czech words.Methods in org.apache.lucene.analysis.cz that return TokenStream Modifier and Type Method Description TokenStreamCzechStemFilterFactory. create(TokenStream input)protected TokenStreamCzechAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.cz with parameters of type TokenStream Modifier and Type Method Description TokenStreamCzechStemFilterFactory. create(TokenStream input)protected TokenStreamCzechAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis.cz with parameters of type TokenStream Constructor Description CzechStemFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.da
Methods in org.apache.lucene.analysis.da that return TokenStream Modifier and Type Method Description protected TokenStreamDanishAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.da with parameters of type TokenStream Modifier and Type Method Description protected TokenStreamDanishAnalyzer. normalize(java.lang.String fieldName, TokenStream in) -
Uses of TokenStream in org.apache.lucene.analysis.de
Subclasses of TokenStream in org.apache.lucene.analysis.de Modifier and Type Class Description classGermanLightStemFilterATokenFilterthat appliesGermanLightStemmerto stem German words.classGermanMinimalStemFilterATokenFilterthat appliesGermanMinimalStemmerto stem German words.classGermanNormalizationFilterNormalizes German characters according to the heuristics of the German2 snowball algorithm.classGermanStemFilterATokenFilterthat stems German words.Methods in org.apache.lucene.analysis.de that return TokenStream Modifier and Type Method Description TokenStreamGermanLightStemFilterFactory. create(TokenStream input)TokenStreamGermanMinimalStemFilterFactory. create(TokenStream input)TokenStreamGermanNormalizationFilterFactory. create(TokenStream input)protected TokenStreamGermanAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamGermanNormalizationFilterFactory. normalize(TokenStream input)Methods in org.apache.lucene.analysis.de with parameters of type TokenStream Modifier and Type Method Description TokenStreamGermanLightStemFilterFactory. create(TokenStream input)TokenStreamGermanMinimalStemFilterFactory. create(TokenStream input)TokenStreamGermanNormalizationFilterFactory. create(TokenStream input)GermanStemFilterGermanStemFilterFactory. create(TokenStream in)protected TokenStreamGermanAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamGermanNormalizationFilterFactory. normalize(TokenStream input)Constructors in org.apache.lucene.analysis.de with parameters of type TokenStream Constructor Description GermanLightStemFilter(TokenStream input)GermanMinimalStemFilter(TokenStream input)GermanNormalizationFilter(TokenStream input)GermanStemFilter(TokenStream in)Creates aGermanStemFilterinstance -
Uses of TokenStream in org.apache.lucene.analysis.el
Subclasses of TokenStream in org.apache.lucene.analysis.el Modifier and Type Class Description classGreekLowerCaseFilterNormalizes token text to lower case, removes some Greek diacritics, and standardizes final sigma to sigma.classGreekStemFilterATokenFilterthat appliesGreekStemmerto stem Greek words.Methods in org.apache.lucene.analysis.el that return TokenStream Modifier and Type Method Description TokenStreamGreekLowerCaseFilterFactory. create(TokenStream in)TokenStreamGreekStemFilterFactory. create(TokenStream input)protected TokenStreamGreekAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamGreekLowerCaseFilterFactory. normalize(TokenStream input)Methods in org.apache.lucene.analysis.el with parameters of type TokenStream Modifier and Type Method Description TokenStreamGreekLowerCaseFilterFactory. create(TokenStream in)TokenStreamGreekStemFilterFactory. create(TokenStream input)protected TokenStreamGreekAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamGreekLowerCaseFilterFactory. normalize(TokenStream input)Constructors in org.apache.lucene.analysis.el with parameters of type TokenStream Constructor Description GreekLowerCaseFilter(TokenStream in)Create a GreekLowerCaseFilter that normalizes Greek token text.GreekStemFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.en
Subclasses of TokenStream in org.apache.lucene.analysis.en Modifier and Type Class Description classEnglishMinimalStemFilterATokenFilterthat appliesEnglishMinimalStemmerto stem English words.classEnglishPossessiveFilterTokenFilter that removes possessives (trailing 's) from words.classKStemFilterA high-performance kstem filter for english.classPorterStemFilterTransforms the token stream as per the Porter stemming algorithm.Methods in org.apache.lucene.analysis.en that return TokenStream Modifier and Type Method Description TokenStreamEnglishMinimalStemFilterFactory. create(TokenStream input)TokenStreamEnglishPossessiveFilterFactory. create(TokenStream input)protected TokenStreamEnglishAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.en with parameters of type TokenStream Modifier and Type Method Description TokenStreamEnglishMinimalStemFilterFactory. create(TokenStream input)TokenStreamEnglishPossessiveFilterFactory. create(TokenStream input)TokenFilterKStemFilterFactory. create(TokenStream input)PorterStemFilterPorterStemFilterFactory. create(TokenStream input)protected TokenStreamEnglishAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis.en with parameters of type TokenStream Constructor Description EnglishMinimalStemFilter(TokenStream input)EnglishPossessiveFilter(TokenStream input)KStemFilter(TokenStream in)PorterStemFilter(TokenStream in) -
Uses of TokenStream in org.apache.lucene.analysis.es
Subclasses of TokenStream in org.apache.lucene.analysis.es Modifier and Type Class Description classSpanishLightStemFilterATokenFilterthat appliesSpanishLightStemmerto stem Spanish words.classSpanishMinimalStemFilterATokenFilterthat appliesSpanishMinimalStemmerto stem Spanish words.Methods in org.apache.lucene.analysis.es that return TokenStream Modifier and Type Method Description TokenStreamSpanishLightStemFilterFactory. create(TokenStream input)TokenStreamSpanishMinimalStemFilterFactory. create(TokenStream input)protected TokenStreamSpanishAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.es with parameters of type TokenStream Modifier and Type Method Description TokenStreamSpanishLightStemFilterFactory. create(TokenStream input)TokenStreamSpanishMinimalStemFilterFactory. create(TokenStream input)protected TokenStreamSpanishAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis.es with parameters of type TokenStream Constructor Description SpanishLightStemFilter(TokenStream input)SpanishMinimalStemFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.et
Methods in org.apache.lucene.analysis.et that return TokenStream Modifier and Type Method Description protected TokenStreamEstonianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.et with parameters of type TokenStream Modifier and Type Method Description protected TokenStreamEstonianAnalyzer. normalize(java.lang.String fieldName, TokenStream in) -
Uses of TokenStream in org.apache.lucene.analysis.eu
Methods in org.apache.lucene.analysis.eu that return TokenStream Modifier and Type Method Description protected TokenStreamBasqueAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.eu with parameters of type TokenStream Modifier and Type Method Description protected TokenStreamBasqueAnalyzer. normalize(java.lang.String fieldName, TokenStream in) -
Uses of TokenStream in org.apache.lucene.analysis.fa
Subclasses of TokenStream in org.apache.lucene.analysis.fa Modifier and Type Class Description classPersianNormalizationFilterATokenFilterthat appliesPersianNormalizerto normalize the orthography.Methods in org.apache.lucene.analysis.fa that return TokenStream Modifier and Type Method Description TokenStreamPersianNormalizationFilterFactory. create(TokenStream input)protected TokenStreamPersianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamPersianNormalizationFilterFactory. normalize(TokenStream input)Methods in org.apache.lucene.analysis.fa with parameters of type TokenStream Modifier and Type Method Description TokenStreamPersianNormalizationFilterFactory. create(TokenStream input)protected TokenStreamPersianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamPersianNormalizationFilterFactory. normalize(TokenStream input)Constructors in org.apache.lucene.analysis.fa with parameters of type TokenStream Constructor Description PersianNormalizationFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.fi
Subclasses of TokenStream in org.apache.lucene.analysis.fi Modifier and Type Class Description classFinnishLightStemFilterATokenFilterthat appliesFinnishLightStemmerto stem Finnish words.Methods in org.apache.lucene.analysis.fi that return TokenStream Modifier and Type Method Description TokenStreamFinnishLightStemFilterFactory. create(TokenStream input)protected TokenStreamFinnishAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.fi with parameters of type TokenStream Modifier and Type Method Description TokenStreamFinnishLightStemFilterFactory. create(TokenStream input)protected TokenStreamFinnishAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis.fi with parameters of type TokenStream Constructor Description FinnishLightStemFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.fr
Subclasses of TokenStream in org.apache.lucene.analysis.fr Modifier and Type Class Description classFrenchLightStemFilterATokenFilterthat appliesFrenchLightStemmerto stem French words.classFrenchMinimalStemFilterATokenFilterthat appliesFrenchMinimalStemmerto stem French words.Methods in org.apache.lucene.analysis.fr that return TokenStream Modifier and Type Method Description TokenStreamFrenchLightStemFilterFactory. create(TokenStream input)TokenStreamFrenchMinimalStemFilterFactory. create(TokenStream input)protected TokenStreamFrenchAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.fr with parameters of type TokenStream Modifier and Type Method Description TokenStreamFrenchLightStemFilterFactory. create(TokenStream input)TokenStreamFrenchMinimalStemFilterFactory. create(TokenStream input)protected TokenStreamFrenchAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis.fr with parameters of type TokenStream Constructor Description FrenchLightStemFilter(TokenStream input)FrenchMinimalStemFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.ga
Subclasses of TokenStream in org.apache.lucene.analysis.ga Modifier and Type Class Description classIrishLowerCaseFilterNormalises token text to lower case, handling t-prothesis and n-eclipsis (i.e., that 'nAthair' should become 'n-athair')Methods in org.apache.lucene.analysis.ga that return TokenStream Modifier and Type Method Description TokenStreamIrishLowerCaseFilterFactory. create(TokenStream input)protected TokenStreamIrishAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamIrishLowerCaseFilterFactory. normalize(TokenStream input)Methods in org.apache.lucene.analysis.ga with parameters of type TokenStream Modifier and Type Method Description TokenStreamIrishLowerCaseFilterFactory. create(TokenStream input)protected TokenStreamIrishAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamIrishLowerCaseFilterFactory. normalize(TokenStream input)Constructors in org.apache.lucene.analysis.ga with parameters of type TokenStream Constructor Description IrishLowerCaseFilter(TokenStream in)Create an IrishLowerCaseFilter that normalises Irish token text. -
Uses of TokenStream in org.apache.lucene.analysis.gl
Subclasses of TokenStream in org.apache.lucene.analysis.gl Modifier and Type Class Description classGalicianMinimalStemFilterATokenFilterthat appliesGalicianMinimalStemmerto stem Galician words.classGalicianStemFilterATokenFilterthat appliesGalicianStemmerto stem Galician words.Methods in org.apache.lucene.analysis.gl that return TokenStream Modifier and Type Method Description TokenStreamGalicianMinimalStemFilterFactory. create(TokenStream input)TokenStreamGalicianStemFilterFactory. create(TokenStream input)protected TokenStreamGalicianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.gl with parameters of type TokenStream Modifier and Type Method Description TokenStreamGalicianMinimalStemFilterFactory. create(TokenStream input)TokenStreamGalicianStemFilterFactory. create(TokenStream input)protected TokenStreamGalicianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis.gl with parameters of type TokenStream Constructor Description GalicianMinimalStemFilter(TokenStream input)GalicianStemFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.hi
Subclasses of TokenStream in org.apache.lucene.analysis.hi Modifier and Type Class Description classHindiNormalizationFilterATokenFilterthat appliesHindiNormalizerto normalize the orthography.classHindiStemFilterATokenFilterthat appliesHindiStemmerto stem Hindi words.Methods in org.apache.lucene.analysis.hi that return TokenStream Modifier and Type Method Description TokenStreamHindiNormalizationFilterFactory. create(TokenStream input)TokenStreamHindiStemFilterFactory. create(TokenStream input)protected TokenStreamHindiAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamHindiNormalizationFilterFactory. normalize(TokenStream input)Methods in org.apache.lucene.analysis.hi with parameters of type TokenStream Modifier and Type Method Description TokenStreamHindiNormalizationFilterFactory. create(TokenStream input)TokenStreamHindiStemFilterFactory. create(TokenStream input)protected TokenStreamHindiAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamHindiNormalizationFilterFactory. normalize(TokenStream input)Constructors in org.apache.lucene.analysis.hi with parameters of type TokenStream Constructor Description HindiNormalizationFilter(TokenStream input)HindiStemFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.hu
Subclasses of TokenStream in org.apache.lucene.analysis.hu Modifier and Type Class Description classHungarianLightStemFilterATokenFilterthat appliesHungarianLightStemmerto stem Hungarian words.Methods in org.apache.lucene.analysis.hu that return TokenStream Modifier and Type Method Description TokenStreamHungarianLightStemFilterFactory. create(TokenStream input)protected TokenStreamHungarianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.hu with parameters of type TokenStream Modifier and Type Method Description TokenStreamHungarianLightStemFilterFactory. create(TokenStream input)protected TokenStreamHungarianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis.hu with parameters of type TokenStream Constructor Description HungarianLightStemFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.hunspell
Subclasses of TokenStream in org.apache.lucene.analysis.hunspell Modifier and Type Class Description classHunspellStemFilterTokenFilter that uses hunspell affix rules and words to stem tokens.Methods in org.apache.lucene.analysis.hunspell that return TokenStream Modifier and Type Method Description TokenStreamHunspellStemFilterFactory. create(TokenStream tokenStream)Methods in org.apache.lucene.analysis.hunspell with parameters of type TokenStream Modifier and Type Method Description TokenStreamHunspellStemFilterFactory. create(TokenStream tokenStream)Constructors in org.apache.lucene.analysis.hunspell with parameters of type TokenStream Constructor Description HunspellStemFilter(TokenStream input, Dictionary dictionary)Create aHunspellStemFilteroutputting all possible stems.HunspellStemFilter(TokenStream input, Dictionary dictionary, boolean dedup)Create aHunspellStemFilteroutputting all possible stems.HunspellStemFilter(TokenStream input, Dictionary dictionary, boolean dedup, boolean longestOnly)Creates a new HunspellStemFilter that will stem tokens from the given TokenStream using affix rules in the provided Dictionary -
Uses of TokenStream in org.apache.lucene.analysis.hy
Methods in org.apache.lucene.analysis.hy that return TokenStream Modifier and Type Method Description protected TokenStreamArmenianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.hy with parameters of type TokenStream Modifier and Type Method Description protected TokenStreamArmenianAnalyzer. normalize(java.lang.String fieldName, TokenStream in) -
Uses of TokenStream in org.apache.lucene.analysis.icu
Subclasses of TokenStream in org.apache.lucene.analysis.icu Modifier and Type Class Description classICUFoldingFilterA TokenFilter that applies search term folding to Unicode text, applying foldings from UTR#30 Character Foldings.classICUNormalizer2FilterNormalize token text with ICU'sNormalizer2classICUTransformFilterATokenFilterthat transforms text with ICU.Methods in org.apache.lucene.analysis.icu that return TokenStream Modifier and Type Method Description TokenStreamICUFoldingFilterFactory. create(TokenStream input)TokenStreamICUNormalizer2FilterFactory. create(TokenStream input)TokenStreamICUTransformFilterFactory. create(TokenStream input)TokenStreamICUFoldingFilterFactory. normalize(TokenStream input)TokenStreamICUNormalizer2FilterFactory. normalize(TokenStream input)TokenStreamICUTransformFilterFactory. normalize(TokenStream input)Methods in org.apache.lucene.analysis.icu with parameters of type TokenStream Modifier and Type Method Description TokenStreamICUFoldingFilterFactory. create(TokenStream input)TokenStreamICUNormalizer2FilterFactory. create(TokenStream input)TokenStreamICUTransformFilterFactory. create(TokenStream input)TokenStreamICUFoldingFilterFactory. normalize(TokenStream input)TokenStreamICUNormalizer2FilterFactory. normalize(TokenStream input)TokenStreamICUTransformFilterFactory. normalize(TokenStream input)Constructors in org.apache.lucene.analysis.icu with parameters of type TokenStream Constructor Description ICUFoldingFilter(TokenStream input)Create a new ICUFoldingFilter on the specified inputICUFoldingFilter(TokenStream input, com.ibm.icu.text.Normalizer2 normalizer)Create a new ICUFoldingFilter on the specified input with the specified normalizerICUNormalizer2Filter(TokenStream input)Create a new Normalizer2Filter that combines NFKC normalization, Case Folding, and removes Default Ignorables (NFKC_Casefold)ICUNormalizer2Filter(TokenStream input, com.ibm.icu.text.Normalizer2 normalizer)Create a new Normalizer2Filter with the specified Normalizer2ICUTransformFilter(TokenStream input, com.ibm.icu.text.Transliterator transform)Create a new ICUTransformFilter that transforms text on the given stream. -
Uses of TokenStream in org.apache.lucene.analysis.icu.segmentation
Subclasses of TokenStream in org.apache.lucene.analysis.icu.segmentation Modifier and Type Class Description classICUTokenizerBreaks text into words according to UAX #29: Unicode Text Segmentation (http://www.unicode.org/reports/tr29/) -
Uses of TokenStream in org.apache.lucene.analysis.id
Subclasses of TokenStream in org.apache.lucene.analysis.id Modifier and Type Class Description classIndonesianStemFilterATokenFilterthat appliesIndonesianStemmerto stem Indonesian words.Methods in org.apache.lucene.analysis.id that return TokenStream Modifier and Type Method Description TokenStreamIndonesianStemFilterFactory. create(TokenStream input)protected TokenStreamIndonesianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.id with parameters of type TokenStream Modifier and Type Method Description TokenStreamIndonesianStemFilterFactory. create(TokenStream input)protected TokenStreamIndonesianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis.id with parameters of type TokenStream Constructor Description IndonesianStemFilter(TokenStream input)IndonesianStemFilter(TokenStream input, boolean stemDerivational)Create a new IndonesianStemFilter. -
Uses of TokenStream in org.apache.lucene.analysis.in
Subclasses of TokenStream in org.apache.lucene.analysis.in Modifier and Type Class Description classIndicNormalizationFilterATokenFilterthat appliesIndicNormalizerto normalize text in Indian Languages.Methods in org.apache.lucene.analysis.in that return TokenStream Modifier and Type Method Description TokenStreamIndicNormalizationFilterFactory. create(TokenStream input)TokenStreamIndicNormalizationFilterFactory. normalize(TokenStream input)Methods in org.apache.lucene.analysis.in with parameters of type TokenStream Modifier and Type Method Description TokenStreamIndicNormalizationFilterFactory. create(TokenStream input)TokenStreamIndicNormalizationFilterFactory. normalize(TokenStream input)Constructors in org.apache.lucene.analysis.in with parameters of type TokenStream Constructor Description IndicNormalizationFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.it
Subclasses of TokenStream in org.apache.lucene.analysis.it Modifier and Type Class Description classItalianLightStemFilterATokenFilterthat appliesItalianLightStemmerto stem Italian words.Methods in org.apache.lucene.analysis.it that return TokenStream Modifier and Type Method Description TokenStreamItalianLightStemFilterFactory. create(TokenStream input)protected TokenStreamItalianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.it with parameters of type TokenStream Modifier and Type Method Description TokenStreamItalianLightStemFilterFactory. create(TokenStream input)protected TokenStreamItalianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis.it with parameters of type TokenStream Constructor Description ItalianLightStemFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.ja
Subclasses of TokenStream in org.apache.lucene.analysis.ja Modifier and Type Class Description classJapaneseBaseFormFilterReplaces term text with theBaseFormAttribute.classJapaneseKatakanaStemFilterATokenFilterthat normalizes common katakana spelling variations ending in a long sound character by removing this character (U+30FC).classJapaneseNumberFilterATokenFilterthat normalizes Japanese numbers (kansūji) to regular Arabic decimal numbers in half-width characters.classJapanesePartOfSpeechStopFilterRemoves tokens that match a set of part-of-speech tags.classJapaneseReadingFormFilterATokenFilterthat replaces the term attribute with the reading of a token in either katakana or romaji form.classJapaneseTokenizerTokenizer for Japanese that uses morphological analysis.Methods in org.apache.lucene.analysis.ja that return TokenStream Modifier and Type Method Description TokenStreamJapaneseBaseFormFilterFactory. create(TokenStream input)TokenStreamJapaneseKatakanaStemFilterFactory. create(TokenStream input)TokenStreamJapaneseNumberFilterFactory. create(TokenStream input)TokenStreamJapanesePartOfSpeechStopFilterFactory. create(TokenStream stream)TokenStreamJapaneseReadingFormFilterFactory. create(TokenStream input)protected TokenStreamJapaneseAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.ja with parameters of type TokenStream Modifier and Type Method Description TokenStreamJapaneseBaseFormFilterFactory. create(TokenStream input)TokenStreamJapaneseKatakanaStemFilterFactory. create(TokenStream input)TokenStreamJapaneseNumberFilterFactory. create(TokenStream input)TokenStreamJapanesePartOfSpeechStopFilterFactory. create(TokenStream stream)TokenStreamJapaneseReadingFormFilterFactory. create(TokenStream input)protected TokenStreamJapaneseAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis.ja with parameters of type TokenStream Constructor Description JapaneseBaseFormFilter(TokenStream input)JapaneseKatakanaStemFilter(TokenStream input)JapaneseKatakanaStemFilter(TokenStream input, int minimumLength)JapaneseNumberFilter(TokenStream input)JapanesePartOfSpeechStopFilter(TokenStream input, java.util.Set<java.lang.String> stopTags)Create a newJapanesePartOfSpeechStopFilter.JapaneseReadingFormFilter(TokenStream input)JapaneseReadingFormFilter(TokenStream input, boolean useRomaji) -
Uses of TokenStream in org.apache.lucene.analysis.ko
Subclasses of TokenStream in org.apache.lucene.analysis.ko Modifier and Type Class Description classKoreanNumberFilterATokenFilterthat normalizes Korean numbers to regular Arabic decimal numbers in half-width characters.classKoreanPartOfSpeechStopFilterRemoves tokens that match a set of part-of-speech tags.classKoreanReadingFormFilterReplaces term text with theReadingAttributewhich is the Hangul transcription of Hanja characters.classKoreanTokenizerTokenizer for Korean that uses morphological analysis.Methods in org.apache.lucene.analysis.ko that return TokenStream Modifier and Type Method Description TokenStreamKoreanNumberFilterFactory. create(TokenStream input)TokenStreamKoreanPartOfSpeechStopFilterFactory. create(TokenStream stream)TokenStreamKoreanReadingFormFilterFactory. create(TokenStream input)protected TokenStreamKoreanAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.ko with parameters of type TokenStream Modifier and Type Method Description TokenStreamKoreanNumberFilterFactory. create(TokenStream input)TokenStreamKoreanPartOfSpeechStopFilterFactory. create(TokenStream stream)TokenStreamKoreanReadingFormFilterFactory. create(TokenStream input)protected TokenStreamKoreanAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis.ko with parameters of type TokenStream Constructor Description KoreanNumberFilter(TokenStream input)KoreanPartOfSpeechStopFilter(TokenStream input)Create a newKoreanPartOfSpeechStopFilterwith the default list of stop tagsKoreanPartOfSpeechStopFilter.DEFAULT_STOP_TAGS.KoreanPartOfSpeechStopFilter(TokenStream input, java.util.Set<POS.Tag> stopTags)Create a newKoreanPartOfSpeechStopFilter.KoreanReadingFormFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.lt
Methods in org.apache.lucene.analysis.lt that return TokenStream Modifier and Type Method Description protected TokenStreamLithuanianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.lt with parameters of type TokenStream Modifier and Type Method Description protected TokenStreamLithuanianAnalyzer. normalize(java.lang.String fieldName, TokenStream in) -
Uses of TokenStream in org.apache.lucene.analysis.lv
Subclasses of TokenStream in org.apache.lucene.analysis.lv Modifier and Type Class Description classLatvianStemFilterATokenFilterthat appliesLatvianStemmerto stem Latvian words.Methods in org.apache.lucene.analysis.lv that return TokenStream Modifier and Type Method Description TokenStreamLatvianStemFilterFactory. create(TokenStream input)protected TokenStreamLatvianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.lv with parameters of type TokenStream Modifier and Type Method Description TokenStreamLatvianStemFilterFactory. create(TokenStream input)protected TokenStreamLatvianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis.lv with parameters of type TokenStream Constructor Description LatvianStemFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.minhash
Subclasses of TokenStream in org.apache.lucene.analysis.minhash Modifier and Type Class Description classMinHashFilterGenerate min hash tokens from an incoming stream of tokens.Methods in org.apache.lucene.analysis.minhash that return TokenStream Modifier and Type Method Description TokenStreamMinHashFilterFactory. create(TokenStream input)Methods in org.apache.lucene.analysis.minhash with parameters of type TokenStream Modifier and Type Method Description TokenStreamMinHashFilterFactory. create(TokenStream input)Constructors in org.apache.lucene.analysis.minhash with parameters of type TokenStream Constructor Description MinHashFilter(TokenStream input, int hashCount, int bucketCount, int hashSetSize, boolean withRotation)create a MinHash filter -
Uses of TokenStream in org.apache.lucene.analysis.miscellaneous
Subclasses of TokenStream in org.apache.lucene.analysis.miscellaneous Modifier and Type Class Description classASCIIFoldingFilterThis class converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the "Basic Latin" Unicode block) into their ASCII equivalents, if one exists.classCapitalizationFilterA filter to apply normal capitalization rules to Tokens.classCodepointCountFilterRemoves words that are too long or too short from the stream.classConcatenateGraphFilterConcatenates/Joins every incoming token with a separator into one output token for every path through the token stream (which is a graph).classConcatenatingTokenStreamA TokenStream that takes an array of input TokenStreams as sources, and concatenates them together.classConditionalTokenFilterAllows skipping TokenFilters based on the current set of attributes.private classConditionalTokenFilter.OneTimeWrapperclassDateRecognizerFilterFilters all tokens that cannot be parsed to a date, using the providedDateFormat.classDelimitedTermFrequencyTokenFilterCharacters before the delimiter are the "token", the textual integer after is the term frequency.classEmptyTokenStreamAn always exhausted token stream.classFingerprintFilterFilter outputs a single token which is a concatenation of the sorted and de-duplicated set of input tokens.classFixBrokenOffsetsFilterDeprecated.Fix the token filters that create broken offsets in the first place.classHyphenatedWordsFilterWhen the plain text is extracted from documents, we will often have many words hyphenated and broken into two lines.classKeepWordFilterA TokenFilter that only keeps tokens with text contained in the required words.classKeywordMarkerFilterMarks terms as keywords via theKeywordAttribute.classKeywordRepeatFilterThis TokenFilter emits each incoming token twice once as keyword and once non-keyword, in other words once withKeywordAttribute.setKeyword(boolean)set totrueand once set tofalse.classLengthFilterRemoves words that are too long or too short from the stream.classLimitTokenCountFilterThis TokenFilter limits the number of tokens while indexing.classLimitTokenOffsetFilterLets all tokens pass through until it sees one with a start offset <= a configured limit, which won't pass and ends the stream.classLimitTokenPositionFilterThis TokenFilter limits its emitted tokens to those with positions that are not greater than the configured limit.classPatternKeywordMarkerFilterMarks terms as keywords via theKeywordAttribute.classProtectedTermFilterA ConditionalTokenFilter that only applies its wrapped filters to tokens that are not contained in a protected set.classRemoveDuplicatesTokenFilterA TokenFilter which filters out Tokens at the same position and Term text as the previous token in the stream.classScandinavianFoldingFilterThis filter folds Scandinavian characters åÅäæÄÆ->a and öÖøØ->o.classScandinavianNormalizationFilterThis filter normalize use of the interchangeable Scandinavian characters æÆäÄöÖøØ and folded variants (aa, ao, ae, oe and oo) by transforming them to åÅæÆøØ.classSetKeywordMarkerFilterMarks terms as keywords via theKeywordAttribute.classStemmerOverrideFilterProvides the ability to override anyKeywordAttributeaware stemmer with custom dictionary-based stemming.classTrimFilterTrims leading and trailing whitespace from Tokens in the stream.classTruncateTokenFilterA token filter for truncating the terms into a specific length.classTypeAsSynonymFilterAdds theTypeAttribute.type()as a synonym, i.e.classWordDelimiterFilterDeprecated.UseWordDelimiterGraphFilterinstead: it produces a correct token graph so that e.g.classWordDelimiterGraphFilterSplits words into subwords and performs optional transformations on subword groups, producing a correct token graph so that e.g.Fields in org.apache.lucene.analysis.miscellaneous declared as TokenStream Modifier and Type Field Description private TokenStreamConditionalTokenFilter. delegateprivate TokenStreamConcatenateGraphFilter. inputTokenStreamprivate TokenStream[]ConcatenatingTokenStream. sourcesMethods in org.apache.lucene.analysis.miscellaneous that return TokenStream Modifier and Type Method Description TokenStreamASCIIFoldingFilterFactory. create(TokenStream input)TokenStreamConcatenateGraphFilterFactory. create(TokenStream input)TokenStreamConditionalTokenFilterFactory. create(TokenStream input)TokenStreamDateRecognizerFilterFactory. create(TokenStream input)TokenStreamFingerprintFilterFactory. create(TokenStream input)TokenStreamFixBrokenOffsetsFilterFactory. create(TokenStream input)TokenStreamKeepWordFilterFactory. create(TokenStream input)TokenStreamKeywordMarkerFilterFactory. create(TokenStream input)TokenStreamKeywordRepeatFilterFactory. create(TokenStream input)TokenStreamLimitTokenCountFilterFactory. create(TokenStream input)TokenStreamLimitTokenOffsetFilterFactory. create(TokenStream input)TokenStreamLimitTokenPositionFilterFactory. create(TokenStream input)TokenStreamScandinavianFoldingFilterFactory. create(TokenStream input)TokenStreamStemmerOverrideFilterFactory. create(TokenStream input)TokenStreamTrimFilterFactory. create(TokenStream input)TokenStreamTruncateTokenFilterFactory. create(TokenStream input)TokenStreamTypeAsSynonymFilterFactory. create(TokenStream input)TokenStreamASCIIFoldingFilterFactory. normalize(TokenStream input)TokenStreamScandinavianFoldingFilterFactory. normalize(TokenStream input)TokenStreamScandinavianNormalizationFilterFactory. normalize(TokenStream input)TokenStreamTrimFilterFactory. normalize(TokenStream input)Methods in org.apache.lucene.analysis.miscellaneous with parameters of type TokenStream Modifier and Type Method Description private static AttributeSourceConcatenatingTokenStream. combineSources(TokenStream... sources)TokenStreamASCIIFoldingFilterFactory. create(TokenStream input)CapitalizationFilterCapitalizationFilterFactory. create(TokenStream input)CodepointCountFilterCodepointCountFilterFactory. create(TokenStream input)TokenStreamConcatenateGraphFilterFactory. create(TokenStream input)TokenStreamConditionalTokenFilterFactory. create(TokenStream input)protected abstract ConditionalTokenFilterConditionalTokenFilterFactory. create(TokenStream input, java.util.function.Function<TokenStream,TokenStream> inner)Modify the incomingTokenStreamwith aConditionalTokenFilterTokenStreamDateRecognizerFilterFactory. create(TokenStream input)DelimitedTermFrequencyTokenFilterDelimitedTermFrequencyTokenFilterFactory. create(TokenStream input)TokenStreamFingerprintFilterFactory. create(TokenStream input)TokenStreamFixBrokenOffsetsFilterFactory. create(TokenStream input)HyphenatedWordsFilterHyphenatedWordsFilterFactory. create(TokenStream input)TokenStreamKeepWordFilterFactory. create(TokenStream input)TokenStreamKeywordMarkerFilterFactory. create(TokenStream input)TokenStreamKeywordRepeatFilterFactory. create(TokenStream input)LengthFilterLengthFilterFactory. create(TokenStream input)TokenStreamLimitTokenCountFilterFactory. create(TokenStream input)TokenStreamLimitTokenOffsetFilterFactory. create(TokenStream input)TokenStreamLimitTokenPositionFilterFactory. create(TokenStream input)protected ConditionalTokenFilterProtectedTermFilterFactory. create(TokenStream input, java.util.function.Function<TokenStream,TokenStream> inner)RemoveDuplicatesTokenFilterRemoveDuplicatesTokenFilterFactory. create(TokenStream input)TokenStreamScandinavianFoldingFilterFactory. create(TokenStream input)ScandinavianNormalizationFilterScandinavianNormalizationFilterFactory. create(TokenStream input)TokenStreamStemmerOverrideFilterFactory. create(TokenStream input)TokenStreamTrimFilterFactory. create(TokenStream input)TokenStreamTruncateTokenFilterFactory. create(TokenStream input)TokenStreamTypeAsSynonymFilterFactory. create(TokenStream input)TokenFilterWordDelimiterFilterFactory. create(TokenStream input)Deprecated.TokenFilterWordDelimiterGraphFilterFactory. create(TokenStream input)TokenStreamASCIIFoldingFilterFactory. normalize(TokenStream input)TokenStreamScandinavianFoldingFilterFactory. normalize(TokenStream input)TokenStreamScandinavianNormalizationFilterFactory. normalize(TokenStream input)TokenStreamTrimFilterFactory. normalize(TokenStream input)Method parameters in org.apache.lucene.analysis.miscellaneous with type arguments of type TokenStream Modifier and Type Method Description protected abstract ConditionalTokenFilterConditionalTokenFilterFactory. create(TokenStream input, java.util.function.Function<TokenStream,TokenStream> inner)Modify the incomingTokenStreamwith aConditionalTokenFilterprotected abstract ConditionalTokenFilterConditionalTokenFilterFactory. create(TokenStream input, java.util.function.Function<TokenStream,TokenStream> inner)Modify the incomingTokenStreamwith aConditionalTokenFilterprotected ConditionalTokenFilterProtectedTermFilterFactory. create(TokenStream input, java.util.function.Function<TokenStream,TokenStream> inner)protected ConditionalTokenFilterProtectedTermFilterFactory. create(TokenStream input, java.util.function.Function<TokenStream,TokenStream> inner)Constructors in org.apache.lucene.analysis.miscellaneous with parameters of type TokenStream Constructor Description ASCIIFoldingFilter(TokenStream input)ASCIIFoldingFilter(TokenStream input, boolean preserveOriginal)Create a newASCIIFoldingFilter.CapitalizationFilter(TokenStream in)Creates a CapitalizationFilter with the default parameters.CapitalizationFilter(TokenStream in, boolean onlyFirstWord, CharArraySet keep, boolean forceFirstLetter, java.util.Collection<char[]> okPrefix, int minWordLength, int maxWordCount, int maxTokenLength)Creates a CapitalizationFilter with the specified parameters.CodepointCountFilter(TokenStream in, int min, int max)Create a newCodepointCountFilter.ConcatenateGraphFilter(TokenStream inputTokenStream)Creates a token stream to convertinputto a token stream of accepted strings by its token stream graph.ConcatenateGraphFilter(TokenStream inputTokenStream, boolean preserveSep, boolean preservePositionIncrements, int maxGraphExpansions)ConcatenateGraphFilter(TokenStream inputTokenStream, java.lang.Character tokenSeparator, boolean preservePositionIncrements, int maxGraphExpansions)Creates a token stream to convertinputto a token stream of accepted strings by its token stream graph.ConcatenatingTokenStream(TokenStream... sources)Create a new ConcatenatingTokenStream from a set of inputsConditionalTokenFilter(TokenStream input, java.util.function.Function<TokenStream,TokenStream> inputFactory)Create a new ConditionalTokenFilterDateRecognizerFilter(TokenStream input)UsesDateFormat.DEFAULTandLocale.ENGLISHto create aDateFormatinstance.DateRecognizerFilter(TokenStream input, java.text.DateFormat dateFormat)DelimitedTermFrequencyTokenFilter(TokenStream input)DelimitedTermFrequencyTokenFilter(TokenStream input, char delimiter)FingerprintFilter(TokenStream input)Create a new FingerprintFilter with default settingsFingerprintFilter(TokenStream input, int maxOutputTokenSize, char separator)Create a new FingerprintFilter with control over all settingsFixBrokenOffsetsFilter(TokenStream in)Deprecated.HyphenatedWordsFilter(TokenStream in)Creates a new HyphenatedWordsFilterKeepWordFilter(TokenStream in, CharArraySet words)Create a newKeepWordFilter.KeywordMarkerFilter(TokenStream in)Creates a newKeywordMarkerFilterKeywordRepeatFilter(TokenStream input)Construct a token stream filtering the given input.LengthFilter(TokenStream in, int min, int max)Create a newLengthFilter.LimitTokenCountFilter(TokenStream in, int maxTokenCount)Build a filter that only accepts tokens up to a maximum number.LimitTokenCountFilter(TokenStream in, int maxTokenCount, boolean consumeAllTokens)Build an filter that limits the maximum number of tokens per field.LimitTokenOffsetFilter(TokenStream input, int maxStartOffset)Lets all tokens pass through until it sees one with a start offset <=maxStartOffsetwhich won't pass and ends the stream.LimitTokenOffsetFilter(TokenStream input, int maxStartOffset, boolean consumeAllTokens)LimitTokenPositionFilter(TokenStream in, int maxTokenPosition)Build a filter that only accepts tokens up to and including the given maximum position.LimitTokenPositionFilter(TokenStream in, int maxTokenPosition, boolean consumeAllTokens)Build a filter that limits the maximum position of tokens to emit.PatternKeywordMarkerFilter(TokenStream in, java.util.regex.Pattern pattern)Create a newPatternKeywordMarkerFilter, that marks the current token as a keyword if the tokens term buffer matches the providedPatternvia theKeywordAttribute.ProtectedTermFilter(CharArraySet protectedTerms, TokenStream input, java.util.function.Function<TokenStream,TokenStream> inputFactory)Creates a new ProtectedTermFilterRemoveDuplicatesTokenFilter(TokenStream in)Creates a new RemoveDuplicatesTokenFilterScandinavianFoldingFilter(TokenStream input)ScandinavianNormalizationFilter(TokenStream input)SetKeywordMarkerFilter(TokenStream in, CharArraySet keywordSet)Create a new KeywordSetMarkerFilter, that marks the current token as a keyword if the tokens term buffer is contained in the given set via theKeywordAttribute.StemmerOverrideFilter(TokenStream input, StemmerOverrideFilter.StemmerOverrideMap stemmerOverrideMap)Create a new StemmerOverrideFilter, performing dictionary-based stemming with the provideddictionary.TrimFilter(TokenStream in)Create a newTrimFilter.TruncateTokenFilter(TokenStream input, int length)TypeAsSynonymFilter(TokenStream input)TypeAsSynonymFilter(TokenStream input, java.lang.String prefix)WordDelimiterFilter(TokenStream in, byte[] charTypeTable, int configurationFlags, CharArraySet protWords)Deprecated.Creates a new WordDelimiterFilterWordDelimiterFilter(TokenStream in, int configurationFlags, CharArraySet protWords)Deprecated.Creates a new WordDelimiterFilter usingWordDelimiterIterator.DEFAULT_WORD_DELIM_TABLEas its charTypeTableWordDelimiterGraphFilter(TokenStream in, boolean adjustInternalOffsets, byte[] charTypeTable, int configurationFlags, CharArraySet protWords)Creates a new WordDelimiterGraphFilterWordDelimiterGraphFilter(TokenStream in, int configurationFlags, CharArraySet protWords)Creates a new WordDelimiterGraphFilter usingWordDelimiterIterator.DEFAULT_WORD_DELIM_TABLEas its charTypeTableConstructor parameters in org.apache.lucene.analysis.miscellaneous with type arguments of type TokenStream Constructor Description ConditionalTokenFilter(TokenStream input, java.util.function.Function<TokenStream,TokenStream> inputFactory)Create a new ConditionalTokenFilterConditionalTokenFilter(TokenStream input, java.util.function.Function<TokenStream,TokenStream> inputFactory)Create a new ConditionalTokenFilterProtectedTermFilter(CharArraySet protectedTerms, TokenStream input, java.util.function.Function<TokenStream,TokenStream> inputFactory)Creates a new ProtectedTermFilterProtectedTermFilter(CharArraySet protectedTerms, TokenStream input, java.util.function.Function<TokenStream,TokenStream> inputFactory)Creates a new ProtectedTermFilter -
Uses of TokenStream in org.apache.lucene.analysis.ngram
Subclasses of TokenStream in org.apache.lucene.analysis.ngram Modifier and Type Class Description classEdgeNGramTokenFilterTokenizes the given token into n-grams of given size(s).classEdgeNGramTokenizerTokenizes the input from an edge into n-grams of given size(s).classNGramTokenFilterTokenizes the input into n-grams of the given size(s).classNGramTokenizerTokenizes the input into n-grams of the given size(s).Methods in org.apache.lucene.analysis.ngram with parameters of type TokenStream Modifier and Type Method Description TokenFilterEdgeNGramFilterFactory. create(TokenStream input)TokenFilterNGramFilterFactory. create(TokenStream input)Constructors in org.apache.lucene.analysis.ngram with parameters of type TokenStream Constructor Description EdgeNGramTokenFilter(TokenStream input, int gramSize)Creates an EdgeNGramTokenFilter that produces edge n-grams of the given size.EdgeNGramTokenFilter(TokenStream input, int minGram, int maxGram, boolean preserveOriginal)Creates an EdgeNGramTokenFilter that, for a given input term, produces all edge n-grams with lengths >= minGram and <= maxGram.NGramTokenFilter(TokenStream input, int gramSize)Creates an NGramTokenFilter that produces n-grams of the indicated size.NGramTokenFilter(TokenStream input, int minGram, int maxGram, boolean preserveOriginal)Creates an NGramTokenFilter that, for a given input term, produces all contained n-grams with lengths >= minGram and <= maxGram. -
Uses of TokenStream in org.apache.lucene.analysis.nl
Methods in org.apache.lucene.analysis.nl that return TokenStream Modifier and Type Method Description protected TokenStreamDutchAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.nl with parameters of type TokenStream Modifier and Type Method Description protected TokenStreamDutchAnalyzer. normalize(java.lang.String fieldName, TokenStream in) -
Uses of TokenStream in org.apache.lucene.analysis.no
Subclasses of TokenStream in org.apache.lucene.analysis.no Modifier and Type Class Description classNorwegianLightStemFilterATokenFilterthat appliesNorwegianLightStemmerto stem Norwegian words.classNorwegianMinimalStemFilterATokenFilterthat appliesNorwegianMinimalStemmerto stem Norwegian words.Methods in org.apache.lucene.analysis.no that return TokenStream Modifier and Type Method Description TokenStreamNorwegianLightStemFilterFactory. create(TokenStream input)TokenStreamNorwegianMinimalStemFilterFactory. create(TokenStream input)protected TokenStreamNorwegianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.no with parameters of type TokenStream Modifier and Type Method Description TokenStreamNorwegianLightStemFilterFactory. create(TokenStream input)TokenStreamNorwegianMinimalStemFilterFactory. create(TokenStream input)protected TokenStreamNorwegianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis.no with parameters of type TokenStream Constructor Description NorwegianLightStemFilter(TokenStream input)NorwegianLightStemFilter(TokenStream input, int flags)Creates a new NorwegianLightStemFilterNorwegianMinimalStemFilter(TokenStream input)NorwegianMinimalStemFilter(TokenStream input, int flags)Creates a new NorwegianLightStemFilter -
Uses of TokenStream in org.apache.lucene.analysis.path
Subclasses of TokenStream in org.apache.lucene.analysis.path Modifier and Type Class Description classPathHierarchyTokenizerTokenizer for path-like hierarchies.classReversePathHierarchyTokenizerTokenizer for domain-like hierarchies. -
Uses of TokenStream in org.apache.lucene.analysis.pattern
Subclasses of TokenStream in org.apache.lucene.analysis.pattern Modifier and Type Class Description classPatternCaptureGroupTokenFilterCaptureGroup uses Java regexes to emit multiple tokens - one for each capture group in one or more patterns.classPatternReplaceFilterA TokenFilter which applies a Pattern to each token in the stream, replacing match occurrences with the specified replacement string.classPatternTokenizerThis tokenizer uses regex pattern matching to construct distinct tokens for the input stream.classSimplePatternSplitTokenizerclassSimplePatternTokenizerMethods in org.apache.lucene.analysis.pattern with parameters of type TokenStream Modifier and Type Method Description PatternCaptureGroupTokenFilterPatternCaptureGroupFilterFactory. create(TokenStream input)PatternReplaceFilterPatternReplaceFilterFactory. create(TokenStream input)Constructors in org.apache.lucene.analysis.pattern with parameters of type TokenStream Constructor Description PatternCaptureGroupTokenFilter(TokenStream input, boolean preserveOriginal, java.util.regex.Pattern... patterns)PatternReplaceFilter(TokenStream in, java.util.regex.Pattern p, java.lang.String replacement, boolean all)Constructs an instance to replace either the first, or all occurrences -
Uses of TokenStream in org.apache.lucene.analysis.payloads
Subclasses of TokenStream in org.apache.lucene.analysis.payloads Modifier and Type Class Description classDelimitedPayloadTokenFilterCharacters before the delimiter are the "token", those after are the payload.classNumericPayloadTokenFilterAssigns a payload to a token based on theTypeAttributeclassTokenOffsetPayloadTokenFilterAdds theOffsetAttribute.startOffset()andOffsetAttribute.endOffset()First 4 bytes are the startclassTypeAsPayloadTokenFilterMakes theTypeAttributea payload.Methods in org.apache.lucene.analysis.payloads with parameters of type TokenStream Modifier and Type Method Description DelimitedPayloadTokenFilterDelimitedPayloadTokenFilterFactory. create(TokenStream input)NumericPayloadTokenFilterNumericPayloadTokenFilterFactory. create(TokenStream input)TokenOffsetPayloadTokenFilterTokenOffsetPayloadTokenFilterFactory. create(TokenStream input)TypeAsPayloadTokenFilterTypeAsPayloadTokenFilterFactory. create(TokenStream input)Constructors in org.apache.lucene.analysis.payloads with parameters of type TokenStream Constructor Description DelimitedPayloadTokenFilter(TokenStream input, char delimiter, PayloadEncoder encoder)NumericPayloadTokenFilter(TokenStream input, float payload, java.lang.String typeMatch)TokenOffsetPayloadTokenFilter(TokenStream input)TypeAsPayloadTokenFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.phonetic
Subclasses of TokenStream in org.apache.lucene.analysis.phonetic Modifier and Type Class Description classBeiderMorseFilterTokenFilter for Beider-Morse phonetic encoding.classDaitchMokotoffSoundexFilterCreate tokens for phonetic matches based on Daitch–Mokotoff Soundex.classDoubleMetaphoneFilterFilter for DoubleMetaphone (supporting secondary codes)classPhoneticFilterCreate tokens for phonetic matches.Methods in org.apache.lucene.analysis.phonetic that return TokenStream Modifier and Type Method Description TokenStreamBeiderMorseFilterFactory. create(TokenStream input)Methods in org.apache.lucene.analysis.phonetic with parameters of type TokenStream Modifier and Type Method Description TokenStreamBeiderMorseFilterFactory. create(TokenStream input)DaitchMokotoffSoundexFilterDaitchMokotoffSoundexFilterFactory. create(TokenStream input)DoubleMetaphoneFilterDoubleMetaphoneFilterFactory. create(TokenStream input)PhoneticFilterPhoneticFilterFactory. create(TokenStream input)Constructors in org.apache.lucene.analysis.phonetic with parameters of type TokenStream Constructor Description BeiderMorseFilter(TokenStream input, org.apache.commons.codec.language.bm.PhoneticEngine engine)BeiderMorseFilter(TokenStream input, org.apache.commons.codec.language.bm.PhoneticEngine engine, org.apache.commons.codec.language.bm.Languages.LanguageSet languages)Create a new BeiderMorseFilterDaitchMokotoffSoundexFilter(TokenStream in, boolean inject)Creates a DaitchMokotoffSoundexFilter by either adding encoded forms as synonyms (inject=true) or replacing them.DoubleMetaphoneFilter(TokenStream input, int maxCodeLength, boolean inject)Creates a DoubleMetaphoneFilter with the specified maximum code length, and either adding encoded forms as synonyms (inject=true) or replacing them.PhoneticFilter(TokenStream in, org.apache.commons.codec.Encoder encoder, boolean inject)Creates a PhoneticFilter with the specified encoder, and either adding encoded forms as synonyms (inject=true) or replacing them. -
Uses of TokenStream in org.apache.lucene.analysis.pl
Methods in org.apache.lucene.analysis.pl that return TokenStream Modifier and Type Method Description protected TokenStreamPolishAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.pl with parameters of type TokenStream Modifier and Type Method Description protected TokenStreamPolishAnalyzer. normalize(java.lang.String fieldName, TokenStream in) -
Uses of TokenStream in org.apache.lucene.analysis.pt
Subclasses of TokenStream in org.apache.lucene.analysis.pt Modifier and Type Class Description classPortugueseLightStemFilterATokenFilterthat appliesPortugueseLightStemmerto stem Portuguese words.classPortugueseMinimalStemFilterATokenFilterthat appliesPortugueseMinimalStemmerto stem Portuguese words.classPortugueseStemFilterATokenFilterthat appliesPortugueseStemmerto stem Portuguese words.Methods in org.apache.lucene.analysis.pt that return TokenStream Modifier and Type Method Description TokenStreamPortugueseLightStemFilterFactory. create(TokenStream input)TokenStreamPortugueseMinimalStemFilterFactory. create(TokenStream input)TokenStreamPortugueseStemFilterFactory. create(TokenStream input)protected TokenStreamPortugueseAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.pt with parameters of type TokenStream Modifier and Type Method Description TokenStreamPortugueseLightStemFilterFactory. create(TokenStream input)TokenStreamPortugueseMinimalStemFilterFactory. create(TokenStream input)TokenStreamPortugueseStemFilterFactory. create(TokenStream input)protected TokenStreamPortugueseAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis.pt with parameters of type TokenStream Constructor Description PortugueseLightStemFilter(TokenStream input)PortugueseMinimalStemFilter(TokenStream input)PortugueseStemFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.reverse
Subclasses of TokenStream in org.apache.lucene.analysis.reverse Modifier and Type Class Description classReverseStringFilterReverse token string, for example "country" => "yrtnuoc".Methods in org.apache.lucene.analysis.reverse with parameters of type TokenStream Modifier and Type Method Description ReverseStringFilterReverseStringFilterFactory. create(TokenStream in)Constructors in org.apache.lucene.analysis.reverse with parameters of type TokenStream Constructor Description ReverseStringFilter(TokenStream in)Create a new ReverseStringFilter that reverses all tokens in the suppliedTokenStream.ReverseStringFilter(TokenStream in, char marker)Create a new ReverseStringFilter that reverses and marks all tokens in the suppliedTokenStream. -
Uses of TokenStream in org.apache.lucene.analysis.ro
Methods in org.apache.lucene.analysis.ro that return TokenStream Modifier and Type Method Description protected TokenStreamRomanianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.ro with parameters of type TokenStream Modifier and Type Method Description protected TokenStreamRomanianAnalyzer. normalize(java.lang.String fieldName, TokenStream in) -
Uses of TokenStream in org.apache.lucene.analysis.ru
Subclasses of TokenStream in org.apache.lucene.analysis.ru Modifier and Type Class Description classRussianLightStemFilterATokenFilterthat appliesRussianLightStemmerto stem Russian words.Methods in org.apache.lucene.analysis.ru that return TokenStream Modifier and Type Method Description TokenStreamRussianLightStemFilterFactory. create(TokenStream input)protected TokenStreamRussianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.ru with parameters of type TokenStream Modifier and Type Method Description TokenStreamRussianLightStemFilterFactory. create(TokenStream input)protected TokenStreamRussianAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis.ru with parameters of type TokenStream Constructor Description RussianLightStemFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.shingle
Subclasses of TokenStream in org.apache.lucene.analysis.shingle Modifier and Type Class Description classFixedShingleFilterA FixedShingleFilter constructs shingles (token n-grams) from a token stream.classShingleFilterA ShingleFilter constructs shingles (token n-grams) from a token stream.Methods in org.apache.lucene.analysis.shingle that return TokenStream Modifier and Type Method Description TokenStreamFixedShingleFilterFactory. create(TokenStream input)Methods in org.apache.lucene.analysis.shingle with parameters of type TokenStream Modifier and Type Method Description TokenStreamFixedShingleFilterFactory. create(TokenStream input)ShingleFilterShingleFilterFactory. create(TokenStream input)Constructors in org.apache.lucene.analysis.shingle with parameters of type TokenStream Constructor Description FixedShingleFilter(TokenStream input, int shingleSize)Creates a FixedShingleFilter over an input token streamFixedShingleFilter(TokenStream input, int shingleSize, java.lang.String tokenSeparator, java.lang.String fillerToken)Creates a FixedShingleFilter over an input token streamShingleFilter(TokenStream input)Construct a ShingleFilter with default shingle size: 2.ShingleFilter(TokenStream input, int maxShingleSize)Constructs a ShingleFilter with the specified shingle size from theTokenStreaminputShingleFilter(TokenStream input, int minShingleSize, int maxShingleSize)Constructs a ShingleFilter with the specified shingle size from theTokenStreaminputShingleFilter(TokenStream input, java.lang.String tokenType)Construct a ShingleFilter with the specified token type for shingle tokens and the default shingle size: 2 -
Uses of TokenStream in org.apache.lucene.analysis.sinks
Subclasses of TokenStream in org.apache.lucene.analysis.sinks Modifier and Type Class Description classTeeSinkTokenFilterThis TokenFilter provides the ability to set aside attribute states that have already been analyzed.static classTeeSinkTokenFilter.SinkTokenStreamTokenStream output from a tee.Methods in org.apache.lucene.analysis.sinks that return TokenStream Modifier and Type Method Description TokenStreamTeeSinkTokenFilter. newSinkTokenStream()Returns a newTeeSinkTokenFilter.SinkTokenStreamthat receives all tokens consumed by this stream.Constructors in org.apache.lucene.analysis.sinks with parameters of type TokenStream Constructor Description TeeSinkTokenFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.snowball
Subclasses of TokenStream in org.apache.lucene.analysis.snowball Modifier and Type Class Description classSnowballFilterA filter that stems words using a Snowball-generated stemmer.Methods in org.apache.lucene.analysis.snowball with parameters of type TokenStream Modifier and Type Method Description TokenFilterSnowballPorterFilterFactory. create(TokenStream input)Constructors in org.apache.lucene.analysis.snowball with parameters of type TokenStream Constructor Description SnowballFilter(TokenStream in, java.lang.String name)Construct the named stemming filter.SnowballFilter(TokenStream input, SnowballProgram stemmer) -
Uses of TokenStream in org.apache.lucene.analysis.sr
Subclasses of TokenStream in org.apache.lucene.analysis.sr Modifier and Type Class Description classSerbianNormalizationFilterNormalizes Serbian Cyrillic and Latin characters to "bald" Latin.classSerbianNormalizationRegularFilterNormalizes Serbian Cyrillic to Latin.Methods in org.apache.lucene.analysis.sr that return TokenStream Modifier and Type Method Description TokenStreamSerbianNormalizationFilterFactory. create(TokenStream input)TokenStreamSerbianNormalizationFilterFactory. normalize(TokenStream input)Methods in org.apache.lucene.analysis.sr with parameters of type TokenStream Modifier and Type Method Description TokenStreamSerbianNormalizationFilterFactory. create(TokenStream input)TokenStreamSerbianNormalizationFilterFactory. normalize(TokenStream input)Constructors in org.apache.lucene.analysis.sr with parameters of type TokenStream Constructor Description SerbianNormalizationFilter(TokenStream input)SerbianNormalizationRegularFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.standard
Subclasses of TokenStream in org.apache.lucene.analysis.standard Modifier and Type Class Description classClassicFilterNormalizes tokens extracted withClassicTokenizer.classClassicTokenizerA grammar-based tokenizer constructed with JFlexclassStandardTokenizerA grammar-based tokenizer constructed with JFlex.classUAX29URLEmailTokenizerThis class implements Word Break rules from the Unicode Text Segmentation algorithm, as specified in Unicode Standard Annex #29 URLs and email addresses are also tokenized according to the relevant RFCs.Methods in org.apache.lucene.analysis.standard that return TokenStream Modifier and Type Method Description protected TokenStreamClassicAnalyzer. normalize(java.lang.String fieldName, TokenStream in)protected TokenStreamStandardAnalyzer. normalize(java.lang.String fieldName, TokenStream in)protected TokenStreamUAX29URLEmailAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.standard with parameters of type TokenStream Modifier and Type Method Description TokenFilterClassicFilterFactory. create(TokenStream input)protected TokenStreamClassicAnalyzer. normalize(java.lang.String fieldName, TokenStream in)protected TokenStreamStandardAnalyzer. normalize(java.lang.String fieldName, TokenStream in)protected TokenStreamUAX29URLEmailAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis.standard with parameters of type TokenStream Constructor Description ClassicFilter(TokenStream in)Construct filtering in. -
Uses of TokenStream in org.apache.lucene.analysis.stempel
Subclasses of TokenStream in org.apache.lucene.analysis.stempel Modifier and Type Class Description classStempelFilterTransforms the token stream as per the stemming algorithm.Methods in org.apache.lucene.analysis.stempel that return TokenStream Modifier and Type Method Description TokenStreamStempelPolishStemFilterFactory. create(TokenStream input)Methods in org.apache.lucene.analysis.stempel with parameters of type TokenStream Modifier and Type Method Description TokenStreamStempelPolishStemFilterFactory. create(TokenStream input)Constructors in org.apache.lucene.analysis.stempel with parameters of type TokenStream Constructor Description StempelFilter(TokenStream in, StempelStemmer stemmer)Create filter using the supplied stemming table.StempelFilter(TokenStream in, StempelStemmer stemmer, int minLength)Create filter using the supplied stemming table. -
Uses of TokenStream in org.apache.lucene.analysis.sv
Subclasses of TokenStream in org.apache.lucene.analysis.sv Modifier and Type Class Description classSwedishLightStemFilterATokenFilterthat appliesSwedishLightStemmerto stem Swedish words.Methods in org.apache.lucene.analysis.sv that return TokenStream Modifier and Type Method Description TokenStreamSwedishLightStemFilterFactory. create(TokenStream input)protected TokenStreamSwedishAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.sv with parameters of type TokenStream Modifier and Type Method Description TokenStreamSwedishLightStemFilterFactory. create(TokenStream input)protected TokenStreamSwedishAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Constructors in org.apache.lucene.analysis.sv with parameters of type TokenStream Constructor Description SwedishLightStemFilter(TokenStream input) -
Uses of TokenStream in org.apache.lucene.analysis.synonym
Subclasses of TokenStream in org.apache.lucene.analysis.synonym Modifier and Type Class Description classSynonymFilterDeprecated.UseSynonymGraphFilterinstead, but be sure to also useFlattenGraphFilterat index time (not at search time) as well.classSynonymGraphFilterApplies single- or multi-token synonyms from aSynonymMapto an incomingTokenStream, producing a fully correct graph output.Methods in org.apache.lucene.analysis.synonym that return TokenStream Modifier and Type Method Description TokenStreamSynonymFilterFactory. create(TokenStream input)Deprecated.TokenStreamSynonymGraphFilterFactory. create(TokenStream input)Methods in org.apache.lucene.analysis.synonym with parameters of type TokenStream Modifier and Type Method Description TokenStreamSynonymFilterFactory. create(TokenStream input)Deprecated.TokenStreamSynonymGraphFilterFactory. create(TokenStream input)Constructors in org.apache.lucene.analysis.synonym with parameters of type TokenStream Constructor Description SynonymFilter(TokenStream input, SynonymMap synonyms, boolean ignoreCase)Deprecated.SynonymGraphFilter(TokenStream input, SynonymMap synonyms, boolean ignoreCase)Apply previously built synonyms to incoming tokens. -
Uses of TokenStream in org.apache.lucene.analysis.th
Subclasses of TokenStream in org.apache.lucene.analysis.th Modifier and Type Class Description classThaiTokenizerTokenizer that useBreakIteratorto tokenize Thai text.Methods in org.apache.lucene.analysis.th that return TokenStream Modifier and Type Method Description protected TokenStreamThaiAnalyzer. normalize(java.lang.String fieldName, TokenStream in)Methods in org.apache.lucene.analysis.th with parameters of type TokenStream Modifier and Type Method Description protected TokenStreamThaiAnalyzer. normalize(java.lang.String fieldName, TokenStream in) -
Uses of TokenStream in org.apache.lucene.analysis.tr
Subclasses of TokenStream in org.apache.lucene.analysis.tr Modifier and Type Class Description classApostropheFilterStrips all characters after an apostrophe (including the apostrophe itself).classTurkishLowerCaseFilterNormalizes Turkish token text to lower case.Methods in org.apache.lucene.analysis.tr that return TokenStream Modifier and Type Method Description TokenStreamApostropheFilterFactory. create(TokenStream input)TokenStreamTurkishLowerCaseFilterFactory. create(TokenStream input)protected TokenStreamTurkishAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamTurkishLowerCaseFilterFactory. normalize(TokenStream input)Methods in org.apache.lucene.analysis.tr with parameters of type TokenStream Modifier and Type Method Description TokenStreamApostropheFilterFactory. create(TokenStream input)TokenStreamTurkishLowerCaseFilterFactory. create(TokenStream input)protected TokenStreamTurkishAnalyzer. normalize(java.lang.String fieldName, TokenStream in)TokenStreamTurkishLowerCaseFilterFactory. normalize(TokenStream input)Constructors in org.apache.lucene.analysis.tr with parameters of type TokenStream Constructor Description ApostropheFilter(TokenStream in)TurkishLowerCaseFilter(TokenStream in)Create a new TurkishLowerCaseFilter, that normalizes Turkish token text to lower case. -
Uses of TokenStream in org.apache.lucene.analysis.util
Subclasses of TokenStream in org.apache.lucene.analysis.util Modifier and Type Class Description classCharTokenizerAn abstract base class for simple, character-oriented tokenizers.classElisionFilterRemoves elisions from aTokenStream.classSegmentingTokenizerBaseBreaks text into sentences with aBreakIteratorand allows subclasses to decompose these sentences into words.Methods in org.apache.lucene.analysis.util that return TokenStream Modifier and Type Method Description TokenStreamElisionFilterFactory. create(TokenStream input)abstract TokenStreamTokenFilterFactory. create(TokenStream input)Transform the specified input TokenStreamTokenStreamElisionFilterFactory. normalize(TokenStream input)TokenStreamTokenFilterFactory. normalize(TokenStream input)Normalize the specified input TokenStream While the default implementation returns input unchanged, filters that should be applied at normalization time can delegate tocreatemethod.Methods in org.apache.lucene.analysis.util with parameters of type TokenStream Modifier and Type Method Description TokenStreamElisionFilterFactory. create(TokenStream input)abstract TokenStreamTokenFilterFactory. create(TokenStream input)Transform the specified input TokenStreamTokenStreamElisionFilterFactory. normalize(TokenStream input)TokenStreamTokenFilterFactory. normalize(TokenStream input)Normalize the specified input TokenStream While the default implementation returns input unchanged, filters that should be applied at normalization time can delegate tocreatemethod.Constructors in org.apache.lucene.analysis.util with parameters of type TokenStream Constructor Description ElisionFilter(TokenStream input, CharArraySet articles)Constructs an elision filter with a Set of stop words -
Uses of TokenStream in org.apache.lucene.analysis.wikipedia
Subclasses of TokenStream in org.apache.lucene.analysis.wikipedia Modifier and Type Class Description classWikipediaTokenizerExtension of StandardTokenizer that is aware of Wikipedia syntax. -
Uses of TokenStream in org.apache.lucene.classification.document
Methods in org.apache.lucene.classification.document with parameters of type TokenStream Modifier and Type Method Description protected java.lang.String[]SimpleNaiveBayesDocumentClassifier. getTokenArray(TokenStream tokenizedText)Returns a token array from theTokenStreamin input -
Uses of TokenStream in org.apache.lucene.codecs
Methods in org.apache.lucene.codecs that return TokenStream Modifier and Type Method Description TokenStreamStoredFieldsWriter.MergeVisitor. tokenStream(Analyzer analyzer, TokenStream reuse)Methods in org.apache.lucene.codecs with parameters of type TokenStream Modifier and Type Method Description TokenStreamStoredFieldsWriter.MergeVisitor. tokenStream(Analyzer analyzer, TokenStream reuse) -
Uses of TokenStream in org.apache.lucene.document
Subclasses of TokenStream in org.apache.lucene.document Modifier and Type Class Description private static classFeatureField.FeatureTokenStreamprivate static classField.BinaryTokenStreamprivate static classField.StringTokenStreamFields in org.apache.lucene.document declared as TokenStream Modifier and Type Field Description protected TokenStreamField. tokenStreamPre-analyzed tokenStream for indexed fields; this is separate from fieldsData because you are allowed to have both; eg maybe field has a String value but you customize how it's tokenizedMethods in org.apache.lucene.document that return TokenStream Modifier and Type Method Description TokenStreamFeatureField. tokenStream(Analyzer analyzer, TokenStream reuse)TokenStreamField. tokenStream(Analyzer analyzer, TokenStream reuse)TokenStreamLazyDocument.LazyField. tokenStream(Analyzer analyzer, TokenStream reuse)TokenStreamField. tokenStreamValue()The TokenStream for this field to be used when indexing, or null.Methods in org.apache.lucene.document with parameters of type TokenStream Modifier and Type Method Description voidField. setTokenStream(TokenStream tokenStream)Expert: sets the token stream to be used for indexing and causes isIndexed() and isTokenized() to return true.TokenStreamFeatureField. tokenStream(Analyzer analyzer, TokenStream reuse)TokenStreamField. tokenStream(Analyzer analyzer, TokenStream reuse)TokenStreamLazyDocument.LazyField. tokenStream(Analyzer analyzer, TokenStream reuse)Constructors in org.apache.lucene.document with parameters of type TokenStream Constructor Description Field(java.lang.String name, TokenStream tokenStream, IndexableFieldType type)Create field with TokenStream value.TextField(java.lang.String name, TokenStream stream)Creates a new un-stored TextField with TokenStream value. -
Uses of TokenStream in org.apache.lucene.index
Fields in org.apache.lucene.index declared as TokenStream Modifier and Type Field Description (package private) TokenStreamDefaultIndexingChain.PerField. tokenStreamMethods in org.apache.lucene.index that return TokenStream Modifier and Type Method Description TokenStreamIndexableField. tokenStream(Analyzer analyzer, TokenStream reuse)Creates the TokenStream used for indexing this field.TokenStreamSortingStoredFieldsConsumer.CopyVisitor. tokenStream(Analyzer analyzer, TokenStream reuse)Methods in org.apache.lucene.index with parameters of type TokenStream Modifier and Type Method Description TokenStreamIndexableField. tokenStream(Analyzer analyzer, TokenStream reuse)Creates the TokenStream used for indexing this field.TokenStreamSortingStoredFieldsConsumer.CopyVisitor. tokenStream(Analyzer analyzer, TokenStream reuse) -
Uses of TokenStream in org.apache.lucene.index.memory
Methods in org.apache.lucene.index.memory that return TokenStream Modifier and Type Method Description <T> TokenStreamMemoryIndex. keywordTokenStream(java.util.Collection<T> keywords)Convenience method; Creates and returns a token stream that generates a token for each keyword in the given collection, "as is", without any transforming text analysis.Methods in org.apache.lucene.index.memory with parameters of type TokenStream Modifier and Type Method Description voidMemoryIndex. addField(java.lang.String fieldName, TokenStream stream)Iterates over the given token stream and adds the resulting terms to the index; Equivalent to adding a tokenized, indexed, termVectorStored, unstored, LuceneField.voidMemoryIndex. addField(java.lang.String fieldName, TokenStream stream, int positionIncrementGap)Iterates over the given token stream and adds the resulting terms to the index; Equivalent to adding a tokenized, indexed, termVectorStored, unstored, LuceneField.voidMemoryIndex. addField(java.lang.String fieldName, TokenStream tokenStream, int positionIncrementGap, int offsetGap)Iterates over the given token stream and adds the resulting terms to the index; Equivalent to adding a tokenized, indexed, termVectorStored, unstored, LuceneField.private voidMemoryIndex. storeTerms(MemoryIndex.Info info, TokenStream tokenStream, int positionIncrementGap, int offsetGap) -
Uses of TokenStream in org.apache.lucene.monitor
Subclasses of TokenStream in org.apache.lucene.monitor Modifier and Type Class Description (package private) classSuffixingNGramTokenFilter(package private) classTermsEnumTokenStreamA TokenStream created from aTermsEnumMethods in org.apache.lucene.monitor that return TokenStream Modifier and Type Method Description default TokenStreamCustomQueryHandler. wrapTermStream(java.lang.String field, TokenStream in)Adds additional processing to theTokenStreamover a document's terms indexTokenStreamRegexpQueryHandler. wrapTermStream(java.lang.String field, TokenStream ts)Methods in org.apache.lucene.monitor with parameters of type TokenStream Modifier and Type Method Description default TokenStreamCustomQueryHandler. wrapTermStream(java.lang.String field, TokenStream in)Adds additional processing to theTokenStreamover a document's terms indexTokenStreamRegexpQueryHandler. wrapTermStream(java.lang.String field, TokenStream ts)Constructors in org.apache.lucene.monitor with parameters of type TokenStream Constructor Description SuffixingNGramTokenFilter(TokenStream input, java.lang.String suffix, java.lang.String wildcardToken, int maxTokenLength)Creates SuffixingNGramTokenFilter. -
Uses of TokenStream in org.apache.lucene.search
Methods in org.apache.lucene.search with parameters of type TokenStream Modifier and Type Method Description TermAutomatonQueryTokenStreamToTermAutomatonQuery. toQuery(java.lang.String field, TokenStream in)Pulls the graph (includingPositionLengthAttribute) from the providedTokenStream, and creates the corresponding automaton where arcs are bytes (or Unicode code points if unicodeArcs = true) from each term. -
Uses of TokenStream in org.apache.lucene.search.highlight
Subclasses of TokenStream in org.apache.lucene.search.highlight Modifier and Type Class Description (package private) classLimitTokenOffsetFilterThis is a simplified version of org.apache.lucene.analysis.miscellaneous.LimitTokenOffsetFilter to prevent a dependency on analyzers-common.jar.classOffsetLimitTokenFilterThis TokenFilter limits the number of tokens while indexing by adding up the current offset.classTokenStreamFromTermVectorTokenStream created from a term vector field.Fields in org.apache.lucene.search.highlight declared as TokenStream Modifier and Type Field Description private TokenStreamWeightedSpanTermExtractor. tokenStreamMethods in org.apache.lucene.search.highlight that return TokenStream Modifier and Type Method Description static TokenStreamTokenSources. getAnyTokenStream(IndexReader reader, int docId, java.lang.String field, Analyzer analyzer)Deprecated.static TokenStreamTokenSources. getAnyTokenStream(IndexReader reader, int docId, java.lang.String field, Document document, Analyzer analyzer)Deprecated.static TokenStreamTokenSources. getTermVectorTokenStreamOrNull(java.lang.String field, Fields tvFields, int maxStartOffset)Get a token stream by un-inverting the term vector.static TokenStreamTokenSources. getTokenStream(java.lang.String field, java.lang.String contents, Analyzer analyzer)Deprecated.static TokenStreamTokenSources. getTokenStream(java.lang.String field, Fields tvFields, java.lang.String text, Analyzer analyzer, int maxStartOffset)Get a token stream from either un-inverting a term vector if possible, or by analyzing the text.static TokenStreamTokenSources. getTokenStream(Document doc, java.lang.String field, Analyzer analyzer)Deprecated.static TokenStreamTokenSources. getTokenStream(IndexReader reader, int docId, java.lang.String field, Analyzer analyzer)Deprecated.static TokenStreamTokenSources. getTokenStream(Terms tpv)Deprecated.static TokenStreamTokenSources. getTokenStream(Terms vector, boolean tokenPositionsGuaranteedContiguous)Deprecated.TokenStreamWeightedSpanTermExtractor. getTokenStream()Returns the tokenStream which may have been wrapped in a CachingTokenFilter.static TokenStreamTokenSources. getTokenStreamWithOffsets(IndexReader reader, int docId, java.lang.String field)Deprecated.TokenStreamQueryScorer. init(TokenStream tokenStream)TokenStreamQueryTermScorer. init(TokenStream tokenStream)TokenStreamScorer. init(TokenStream tokenStream)Called to init the Scorer with aTokenStream.private TokenStreamQueryScorer. initExtractor(TokenStream tokenStream)Methods in org.apache.lucene.search.highlight with parameters of type TokenStream Modifier and Type Method Description java.lang.StringHighlighter. getBestFragment(TokenStream tokenStream, java.lang.String text)Highlights chosen terms in a text, extracting the most relevant section.java.lang.String[]Highlighter. getBestFragments(TokenStream tokenStream, java.lang.String text, int maxNumFragments)Highlights chosen terms in a text, extracting the most relevant sections.java.lang.StringHighlighter. getBestFragments(TokenStream tokenStream, java.lang.String text, int maxNumFragments, java.lang.String separator)Highlights terms in the text , extracting the most relevant sections and concatenating the chosen fragments with a separator (typically "...").TextFragment[]Highlighter. getBestTextFragments(TokenStream tokenStream, java.lang.String text, boolean mergeContiguousFragments, int maxNumFragments)Low level api to get the most relevant (formatted) sections of the document.java.util.Map<java.lang.String,WeightedSpanTerm>WeightedSpanTermExtractor. getWeightedSpanTerms(Query query, float boost, TokenStream tokenStream)Creates a Map ofWeightedSpanTermsfrom the givenQueryandTokenStream.java.util.Map<java.lang.String,WeightedSpanTerm>WeightedSpanTermExtractor. getWeightedSpanTerms(Query query, float boost, TokenStream tokenStream, java.lang.String fieldName)Creates a Map ofWeightedSpanTermsfrom the givenQueryandTokenStream.java.util.Map<java.lang.String,WeightedSpanTerm>WeightedSpanTermExtractor. getWeightedSpanTermsWithScores(Query query, float boost, TokenStream tokenStream, java.lang.String fieldName, IndexReader reader)Creates a Map ofWeightedSpanTermsfrom the givenQueryandTokenStream.TokenStreamQueryScorer. init(TokenStream tokenStream)TokenStreamQueryTermScorer. init(TokenStream tokenStream)TokenStreamScorer. init(TokenStream tokenStream)Called to init the Scorer with aTokenStream.private TokenStreamQueryScorer. initExtractor(TokenStream tokenStream)voidFragmenter. start(java.lang.String originalText, TokenStream tokenStream)Initializes the Fragmenter.voidNullFragmenter. start(java.lang.String s, TokenStream tokenStream)voidSimpleFragmenter. start(java.lang.String originalText, TokenStream stream)voidSimpleSpanFragmenter. start(java.lang.String originalText, TokenStream tokenStream)Constructors in org.apache.lucene.search.highlight with parameters of type TokenStream Constructor Description LimitTokenOffsetFilter(TokenStream input, int maxStartOffset)OffsetLimitTokenFilter(TokenStream input, int offsetLimit)TokenGroup(TokenStream tokenStream) -
Uses of TokenStream in org.apache.lucene.search.suggest.analyzing
Subclasses of TokenStream in org.apache.lucene.search.suggest.analyzing Modifier and Type Class Description classSuggestStopFilterLikeStopFilterexcept it will not remove the last token if that token was not followed by some token separator.Methods in org.apache.lucene.search.suggest.analyzing that return TokenStream Modifier and Type Method Description TokenStreamSuggestStopFilterFactory. create(TokenStream input)Methods in org.apache.lucene.search.suggest.analyzing with parameters of type TokenStream Modifier and Type Method Description TokenStreamSuggestStopFilterFactory. create(TokenStream input)Constructors in org.apache.lucene.search.suggest.analyzing with parameters of type TokenStream Constructor Description SuggestStopFilter(TokenStream input, CharArraySet stopWords)Sole constructor. -
Uses of TokenStream in org.apache.lucene.search.suggest.document
Subclasses of TokenStream in org.apache.lucene.search.suggest.document Modifier and Type Class Description classCompletionTokenStreamAConcatenateGraphFilterbut we can set the payload and provide access to config options.private static classContextSuggestField.PrefixTokenFilterTheContextSuggestField.PrefixTokenFilterwraps aTokenStreamand adds a set prefixes ahead.Fields in org.apache.lucene.search.suggest.document declared as TokenStream Modifier and Type Field Description (package private) TokenStreamCompletionTokenStream. inputTokenStreamMethods in org.apache.lucene.search.suggest.document that return TokenStream Modifier and Type Method Description TokenStreamSuggestField. tokenStream(Analyzer analyzer, TokenStream reuse)Methods in org.apache.lucene.search.suggest.document with parameters of type TokenStream Modifier and Type Method Description TokenStreamSuggestField. tokenStream(Analyzer analyzer, TokenStream reuse)protected CompletionTokenStreamContextSuggestField. wrapTokenStream(TokenStream stream)protected CompletionTokenStreamSuggestField. wrapTokenStream(TokenStream stream)Wraps astreamwith a CompletionTokenStream.Constructors in org.apache.lucene.search.suggest.document with parameters of type TokenStream Constructor Description CompletionTokenStream(TokenStream inputTokenStream)CompletionTokenStream(TokenStream inputTokenStream, boolean preserveSep, boolean preservePositionIncrements, int maxGraphExpansions)PrefixTokenFilter(TokenStream input, char separator, java.lang.Iterable<java.lang.CharSequence> prefixes)Create a newContextSuggestField.PrefixTokenFilter -
Uses of TokenStream in org.apache.lucene.search.uhighlight
Subclasses of TokenStream in org.apache.lucene.search.uhighlight Modifier and Type Class Description private static classAnalysisOffsetStrategy.MultiValueTokenStreamWraps anAnalyzerand string text that represents multiple values delimited by a specified character.Fields in org.apache.lucene.search.uhighlight declared as TokenStream Modifier and Type Field Description (package private) TokenStreamTokenStreamOffsetStrategy.TokenStreamOffsetsEnum. streamMethods in org.apache.lucene.search.uhighlight that return TokenStream Modifier and Type Method Description protected TokenStreamAnalysisOffsetStrategy. tokenStream(java.lang.String content)Methods in org.apache.lucene.search.uhighlight with parameters of type TokenStream Modifier and Type Method Description private static FilteringTokenFilterMemoryIndexOffsetStrategy. newKeepWordFilter(TokenStream tokenStream, CharArrayMatcher matcher)Constructors in org.apache.lucene.search.uhighlight with parameters of type TokenStream Constructor Description MultiValueTokenStream(TokenStream subTokenStream, java.lang.String fieldName, Analyzer indexAnalyzer, java.lang.String content, char splitChar, int splitCharIdx)TokenStreamOffsetsEnum(TokenStream ts, CharArrayMatcher[] matchers) -
Uses of TokenStream in org.apache.lucene.util
Methods in org.apache.lucene.util with parameters of type TokenStream Modifier and Type Method Description protected QueryQueryBuilder. analyzeBoolean(java.lang.String field, TokenStream stream)Creates simple boolean query from the cached tokenstream contentsprotected QueryQueryBuilder. analyzeGraphBoolean(java.lang.String field, TokenStream source, BooleanClause.Occur operator)Creates a boolean query from a graph token stream.protected QueryQueryBuilder. analyzeGraphPhrase(TokenStream source, java.lang.String field, int phraseSlop)Creates graph phrase query from the tokenstream contentsprotected QueryQueryBuilder. analyzeMultiBoolean(java.lang.String field, TokenStream stream, BooleanClause.Occur operator)Creates complex boolean query from the cached tokenstream contentsprotected QueryQueryBuilder. analyzeMultiPhrase(java.lang.String field, TokenStream stream, int slop)Creates complex phrase query from the cached tokenstream contentsprotected QueryQueryBuilder. analyzePhrase(java.lang.String field, TokenStream stream, int slop)Creates simple phrase query from the cached tokenstream contentsprotected QueryQueryBuilder. analyzeTerm(java.lang.String field, TokenStream stream)Creates simple term query from the cached tokenstream contentsprotected QueryQueryBuilder. createFieldQuery(TokenStream source, BooleanClause.Occur operator, java.lang.String field, boolean quoted, int phraseSlop)Creates a query from a token stream.protected SpanQueryQueryBuilder. createSpanQuery(TokenStream in, java.lang.String field)Creates a span query from the tokenstream. -
Uses of TokenStream in org.apache.lucene.util.graph
Subclasses of TokenStream in org.apache.lucene.util.graph Modifier and Type Class Description private classGraphTokenStreamFiniteStrings.FiniteStringsTokenStreamMethods in org.apache.lucene.util.graph that return types with arguments of type TokenStream Modifier and Type Method Description java.util.Iterator<TokenStream>GraphTokenStreamFiniteStrings. getFiniteStrings()Get all finite strings from the automaton.java.util.Iterator<TokenStream>GraphTokenStreamFiniteStrings. getFiniteStrings(int startState, int endState)Get all finite strings that start atstartStateand end atendState.Methods in org.apache.lucene.util.graph with parameters of type TokenStream Modifier and Type Method Description private AutomatonGraphTokenStreamFiniteStrings. build(TokenStream in)Build an automaton from the providedTokenStream.Constructors in org.apache.lucene.util.graph with parameters of type TokenStream Constructor Description GraphTokenStreamFiniteStrings(TokenStream in)
-