Uses of Class
org.apache.lucene.analysis.TokenFilter
-
Packages that use TokenFilter Package Description org.apache.lucene.analysis Text analysis.org.apache.lucene.analysis.ar Analyzer for Arabic.org.apache.lucene.analysis.bg Analyzer for Bulgarian.org.apache.lucene.analysis.bn Analyzer for Bengali Language.org.apache.lucene.analysis.boost Provides various convenience classes for creating boosts on Tokens.org.apache.lucene.analysis.br Analyzer for Brazilian Portuguese.org.apache.lucene.analysis.cjk Analyzer for Chinese, Japanese, and Korean, which indexes bigrams.org.apache.lucene.analysis.ckb Analyzer for Sorani Kurdish.org.apache.lucene.analysis.commongrams Construct n-grams for frequently occurring terms and phrases.org.apache.lucene.analysis.compound A filter that decomposes compound words you find in many Germanic languages into the word parts.org.apache.lucene.analysis.core Basic, general-purpose analysis components.org.apache.lucene.analysis.cz Analyzer for Czech.org.apache.lucene.analysis.de Analyzer for German.org.apache.lucene.analysis.el Analyzer for Greek.org.apache.lucene.analysis.en Analyzer for English.org.apache.lucene.analysis.es Analyzer for Spanish.org.apache.lucene.analysis.fa Analyzer for Persian.org.apache.lucene.analysis.fi Analyzer for Finnish.org.apache.lucene.analysis.fr Analyzer for French.org.apache.lucene.analysis.ga Analyzer for Irish.org.apache.lucene.analysis.gl Analyzer for Galician.org.apache.lucene.analysis.hi Analyzer for Hindi.org.apache.lucene.analysis.hu Analyzer for Hungarian.org.apache.lucene.analysis.hunspell Stemming TokenFilter using a Java implementation of the Hunspell stemming algorithm.org.apache.lucene.analysis.icu Analysis components based on ICUorg.apache.lucene.analysis.id Analyzer for Indonesian.org.apache.lucene.analysis.in Analyzer for Indian languages.org.apache.lucene.analysis.it Analyzer for Italian.org.apache.lucene.analysis.ja Analyzer for Japanese.org.apache.lucene.analysis.ko Analyzer for Korean.org.apache.lucene.analysis.lv Analyzer for Latvian.org.apache.lucene.analysis.minhash MinHash filtering (for LSH).org.apache.lucene.analysis.miscellaneous Miscellaneous Tokenstreams.org.apache.lucene.analysis.ngram Character n-gram tokenizers and filters.org.apache.lucene.analysis.no Analyzer for Norwegian.org.apache.lucene.analysis.pattern Set of components for pattern-based (regex) analysis.org.apache.lucene.analysis.payloads Provides various convenience classes for creating payloads on Tokens.org.apache.lucene.analysis.phonetic Analysis components for phonetic search.org.apache.lucene.analysis.pt Analyzer for Portuguese.org.apache.lucene.analysis.reverse Filter to reverse token text.org.apache.lucene.analysis.ru Analyzer for Russian.org.apache.lucene.analysis.shingle Word n-gram filters.org.apache.lucene.analysis.sinks org.apache.lucene.analysis.snowball TokenFilterandAnalyzerimplementations that use Snowball stemmers.org.apache.lucene.analysis.sr Analyzer for Serbian.org.apache.lucene.analysis.standard Fast, general-purpose grammar-based tokenizerStandardTokenizerimplements the Word Break rules from the Unicode Text Segmentation algorithm, as specified in Unicode Standard Annex #29.org.apache.lucene.analysis.stempel Stempel: Algorithmic Stemmerorg.apache.lucene.analysis.sv Analyzer for Swedish.org.apache.lucene.analysis.synonym Analysis components for Synonyms.org.apache.lucene.analysis.tr Analyzer for Turkish.org.apache.lucene.analysis.util Utility functions for text analysis.org.apache.lucene.monitor Monitoring frameworkorg.apache.lucene.search.highlight Highlighting search terms.org.apache.lucene.search.suggest.analyzing Analyzer based autosuggest.org.apache.lucene.search.suggest.document Support for document suggestionorg.apache.lucene.search.uhighlight The UnifiedHighlighter -- a flexible highlighter that can get offsets from postings, term vectors, or analysis. -
-
Uses of TokenFilter in org.apache.lucene.analysis
Subclasses of TokenFilter in org.apache.lucene.analysis Modifier and Type Class Description classCachingTokenFilterThis class can be used if the token attributes of a TokenStream are intended to be consumed more than once.classFilteringTokenFilterAbstract base class for TokenFilters that may remove tokens.classGraphTokenFilterAn abstract TokenFilter that exposes its input stream as a graph CallGraphTokenFilter.incrementBaseToken()to move the root of the graph to the next position in the TokenStream,GraphTokenFilter.incrementGraphToken()to move along the current graph, andGraphTokenFilter.incrementGraph()to reset to the next graph based at the current root.classLowerCaseFilterNormalizes token text to lower case.classStopFilterRemoves stop words from a token stream. -
Uses of TokenFilter in org.apache.lucene.analysis.ar
Subclasses of TokenFilter in org.apache.lucene.analysis.ar Modifier and Type Class Description classArabicNormalizationFilterATokenFilterthat appliesArabicNormalizerto normalize the orthography.classArabicStemFilterATokenFilterthat appliesArabicStemmerto stem Arabic words.. -
Uses of TokenFilter in org.apache.lucene.analysis.bg
Subclasses of TokenFilter in org.apache.lucene.analysis.bg Modifier and Type Class Description classBulgarianStemFilterATokenFilterthat appliesBulgarianStemmerto stem Bulgarian words. -
Uses of TokenFilter in org.apache.lucene.analysis.bn
Subclasses of TokenFilter in org.apache.lucene.analysis.bn Modifier and Type Class Description classBengaliNormalizationFilterATokenFilterthat appliesBengaliNormalizerto normalize the orthography.classBengaliStemFilterATokenFilterthat appliesBengaliStemmerto stem Bengali words. -
Uses of TokenFilter in org.apache.lucene.analysis.boost
Subclasses of TokenFilter in org.apache.lucene.analysis.boost Modifier and Type Class Description classDelimitedBoostTokenFilterCharacters before the delimiter are the "token", those after are the boost. -
Uses of TokenFilter in org.apache.lucene.analysis.br
Subclasses of TokenFilter in org.apache.lucene.analysis.br Modifier and Type Class Description classBrazilianStemFilterATokenFilterthat appliesBrazilianStemmer. -
Uses of TokenFilter in org.apache.lucene.analysis.cjk
Subclasses of TokenFilter in org.apache.lucene.analysis.cjk Modifier and Type Class Description classCJKBigramFilterForms bigrams of CJK terms that are generated from StandardTokenizer or ICUTokenizer.classCJKWidthFilterATokenFilterthat normalizes CJK width differences: Folds fullwidth ASCII variants into the equivalent basic latin Folds halfwidth Katakana variants into the equivalent kana -
Uses of TokenFilter in org.apache.lucene.analysis.ckb
Subclasses of TokenFilter in org.apache.lucene.analysis.ckb Modifier and Type Class Description classSoraniNormalizationFilterATokenFilterthat appliesSoraniNormalizerto normalize the orthography.classSoraniStemFilterATokenFilterthat appliesSoraniStemmerto stem Sorani words. -
Uses of TokenFilter in org.apache.lucene.analysis.commongrams
Subclasses of TokenFilter in org.apache.lucene.analysis.commongrams Modifier and Type Class Description classCommonGramsFilterConstruct bigrams for frequently occurring terms while indexing.classCommonGramsQueryFilterWrap a CommonGramsFilter optimizing phrase queries by only returning single words when they are not a member of a bigram.Methods in org.apache.lucene.analysis.commongrams that return TokenFilter Modifier and Type Method Description TokenFilterCommonGramsFilterFactory. create(TokenStream input)TokenFilterCommonGramsQueryFilterFactory. create(TokenStream input)Create a CommonGramsFilter and wrap it with a CommonGramsQueryFilter -
Uses of TokenFilter in org.apache.lucene.analysis.compound
Subclasses of TokenFilter in org.apache.lucene.analysis.compound Modifier and Type Class Description classCompoundWordTokenFilterBaseBase class for decomposition token filters.classDictionaryCompoundWordTokenFilterATokenFilterthat decomposes compound words found in many Germanic languages.classHyphenationCompoundWordTokenFilterATokenFilterthat decomposes compound words found in many Germanic languages.Methods in org.apache.lucene.analysis.compound that return TokenFilter Modifier and Type Method Description TokenFilterHyphenationCompoundWordTokenFilterFactory. create(TokenStream input) -
Uses of TokenFilter in org.apache.lucene.analysis.core
Subclasses of TokenFilter in org.apache.lucene.analysis.core Modifier and Type Class Description classDecimalDigitFilterFolds all Unicode digits in[:General_Category=Decimal_Number:]to Basic Latin digits (0-9).classFlattenGraphFilterConverts an incoming graph token stream, such as one fromSynonymGraphFilter, into a flat form so that all nodes form a single linear chain with no side paths.classLowerCaseFilterNormalizes token text to lower case.classStopFilterRemoves stop words from a token stream.classTypeTokenFilterRemoves tokens whose types appear in a set of blocked types from a token stream.classUpperCaseFilterNormalizes token text to UPPER CASE. -
Uses of TokenFilter in org.apache.lucene.analysis.cz
Subclasses of TokenFilter in org.apache.lucene.analysis.cz Modifier and Type Class Description classCzechStemFilterATokenFilterthat appliesCzechStemmerto stem Czech words. -
Uses of TokenFilter in org.apache.lucene.analysis.de
Subclasses of TokenFilter in org.apache.lucene.analysis.de Modifier and Type Class Description classGermanLightStemFilterATokenFilterthat appliesGermanLightStemmerto stem German words.classGermanMinimalStemFilterATokenFilterthat appliesGermanMinimalStemmerto stem German words.classGermanNormalizationFilterNormalizes German characters according to the heuristics of the German2 snowball algorithm.classGermanStemFilterATokenFilterthat stems German words. -
Uses of TokenFilter in org.apache.lucene.analysis.el
Subclasses of TokenFilter in org.apache.lucene.analysis.el Modifier and Type Class Description classGreekLowerCaseFilterNormalizes token text to lower case, removes some Greek diacritics, and standardizes final sigma to sigma.classGreekStemFilterATokenFilterthat appliesGreekStemmerto stem Greek words. -
Uses of TokenFilter in org.apache.lucene.analysis.en
Subclasses of TokenFilter in org.apache.lucene.analysis.en Modifier and Type Class Description classEnglishMinimalStemFilterATokenFilterthat appliesEnglishMinimalStemmerto stem English words.classEnglishPossessiveFilterTokenFilter that removes possessives (trailing 's) from words.classKStemFilterA high-performance kstem filter for english.classPorterStemFilterTransforms the token stream as per the Porter stemming algorithm.Methods in org.apache.lucene.analysis.en that return TokenFilter Modifier and Type Method Description TokenFilterKStemFilterFactory. create(TokenStream input) -
Uses of TokenFilter in org.apache.lucene.analysis.es
Subclasses of TokenFilter in org.apache.lucene.analysis.es Modifier and Type Class Description classSpanishLightStemFilterATokenFilterthat appliesSpanishLightStemmerto stem Spanish words.classSpanishMinimalStemFilterATokenFilterthat appliesSpanishMinimalStemmerto stem Spanish words. -
Uses of TokenFilter in org.apache.lucene.analysis.fa
Subclasses of TokenFilter in org.apache.lucene.analysis.fa Modifier and Type Class Description classPersianNormalizationFilterATokenFilterthat appliesPersianNormalizerto normalize the orthography. -
Uses of TokenFilter in org.apache.lucene.analysis.fi
Subclasses of TokenFilter in org.apache.lucene.analysis.fi Modifier and Type Class Description classFinnishLightStemFilterATokenFilterthat appliesFinnishLightStemmerto stem Finnish words. -
Uses of TokenFilter in org.apache.lucene.analysis.fr
Subclasses of TokenFilter in org.apache.lucene.analysis.fr Modifier and Type Class Description classFrenchLightStemFilterATokenFilterthat appliesFrenchLightStemmerto stem French words.classFrenchMinimalStemFilterATokenFilterthat appliesFrenchMinimalStemmerto stem French words. -
Uses of TokenFilter in org.apache.lucene.analysis.ga
Subclasses of TokenFilter in org.apache.lucene.analysis.ga Modifier and Type Class Description classIrishLowerCaseFilterNormalises token text to lower case, handling t-prothesis and n-eclipsis (i.e., that 'nAthair' should become 'n-athair') -
Uses of TokenFilter in org.apache.lucene.analysis.gl
Subclasses of TokenFilter in org.apache.lucene.analysis.gl Modifier and Type Class Description classGalicianMinimalStemFilterATokenFilterthat appliesGalicianMinimalStemmerto stem Galician words.classGalicianStemFilterATokenFilterthat appliesGalicianStemmerto stem Galician words. -
Uses of TokenFilter in org.apache.lucene.analysis.hi
Subclasses of TokenFilter in org.apache.lucene.analysis.hi Modifier and Type Class Description classHindiNormalizationFilterATokenFilterthat appliesHindiNormalizerto normalize the orthography.classHindiStemFilterATokenFilterthat appliesHindiStemmerto stem Hindi words. -
Uses of TokenFilter in org.apache.lucene.analysis.hu
Subclasses of TokenFilter in org.apache.lucene.analysis.hu Modifier and Type Class Description classHungarianLightStemFilterATokenFilterthat appliesHungarianLightStemmerto stem Hungarian words. -
Uses of TokenFilter in org.apache.lucene.analysis.hunspell
Subclasses of TokenFilter in org.apache.lucene.analysis.hunspell Modifier and Type Class Description classHunspellStemFilterTokenFilter that uses hunspell affix rules and words to stem tokens. -
Uses of TokenFilter in org.apache.lucene.analysis.icu
Subclasses of TokenFilter in org.apache.lucene.analysis.icu Modifier and Type Class Description classICUFoldingFilterA TokenFilter that applies search term folding to Unicode text, applying foldings from UTR#30 Character Foldings.classICUNormalizer2FilterNormalize token text with ICU'sNormalizer2classICUTransformFilterATokenFilterthat transforms text with ICU. -
Uses of TokenFilter in org.apache.lucene.analysis.id
Subclasses of TokenFilter in org.apache.lucene.analysis.id Modifier and Type Class Description classIndonesianStemFilterATokenFilterthat appliesIndonesianStemmerto stem Indonesian words. -
Uses of TokenFilter in org.apache.lucene.analysis.in
Subclasses of TokenFilter in org.apache.lucene.analysis.in Modifier and Type Class Description classIndicNormalizationFilterATokenFilterthat appliesIndicNormalizerto normalize text in Indian Languages. -
Uses of TokenFilter in org.apache.lucene.analysis.it
Subclasses of TokenFilter in org.apache.lucene.analysis.it Modifier and Type Class Description classItalianLightStemFilterATokenFilterthat appliesItalianLightStemmerto stem Italian words. -
Uses of TokenFilter in org.apache.lucene.analysis.ja
Subclasses of TokenFilter in org.apache.lucene.analysis.ja Modifier and Type Class Description classJapaneseBaseFormFilterReplaces term text with theBaseFormAttribute.classJapaneseKatakanaStemFilterATokenFilterthat normalizes common katakana spelling variations ending in a long sound character by removing this character (U+30FC).classJapaneseNumberFilterATokenFilterthat normalizes Japanese numbers (kansūji) to regular Arabic decimal numbers in half-width characters.classJapanesePartOfSpeechStopFilterRemoves tokens that match a set of part-of-speech tags.classJapaneseReadingFormFilterATokenFilterthat replaces the term attribute with the reading of a token in either katakana or romaji form. -
Uses of TokenFilter in org.apache.lucene.analysis.ko
Subclasses of TokenFilter in org.apache.lucene.analysis.ko Modifier and Type Class Description classKoreanNumberFilterATokenFilterthat normalizes Korean numbers to regular Arabic decimal numbers in half-width characters.classKoreanPartOfSpeechStopFilterRemoves tokens that match a set of part-of-speech tags.classKoreanReadingFormFilterReplaces term text with theReadingAttributewhich is the Hangul transcription of Hanja characters. -
Uses of TokenFilter in org.apache.lucene.analysis.lv
Subclasses of TokenFilter in org.apache.lucene.analysis.lv Modifier and Type Class Description classLatvianStemFilterATokenFilterthat appliesLatvianStemmerto stem Latvian words. -
Uses of TokenFilter in org.apache.lucene.analysis.minhash
Subclasses of TokenFilter in org.apache.lucene.analysis.minhash Modifier and Type Class Description classMinHashFilterGenerate min hash tokens from an incoming stream of tokens. -
Uses of TokenFilter in org.apache.lucene.analysis.miscellaneous
Subclasses of TokenFilter in org.apache.lucene.analysis.miscellaneous Modifier and Type Class Description classASCIIFoldingFilterThis class converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the "Basic Latin" Unicode block) into their ASCII equivalents, if one exists.classCapitalizationFilterA filter to apply normal capitalization rules to Tokens.classCodepointCountFilterRemoves words that are too long or too short from the stream.classConditionalTokenFilterAllows skipping TokenFilters based on the current set of attributes.classDateRecognizerFilterFilters all tokens that cannot be parsed to a date, using the providedDateFormat.classDelimitedTermFrequencyTokenFilterCharacters before the delimiter are the "token", the textual integer after is the term frequency.classFingerprintFilterFilter outputs a single token which is a concatenation of the sorted and de-duplicated set of input tokens.classFixBrokenOffsetsFilterDeprecated.Fix the token filters that create broken offsets in the first place.classHyphenatedWordsFilterWhen the plain text is extracted from documents, we will often have many words hyphenated and broken into two lines.classKeepWordFilterA TokenFilter that only keeps tokens with text contained in the required words.classKeywordMarkerFilterMarks terms as keywords via theKeywordAttribute.classKeywordRepeatFilterThis TokenFilter emits each incoming token twice once as keyword and once non-keyword, in other words once withKeywordAttribute.setKeyword(boolean)set totrueand once set tofalse.classLengthFilterRemoves words that are too long or too short from the stream.classLimitTokenCountFilterThis TokenFilter limits the number of tokens while indexing.classLimitTokenOffsetFilterLets all tokens pass through until it sees one with a start offset <= a configured limit, which won't pass and ends the stream.classLimitTokenPositionFilterThis TokenFilter limits its emitted tokens to those with positions that are not greater than the configured limit.classPatternKeywordMarkerFilterMarks terms as keywords via theKeywordAttribute.classProtectedTermFilterA ConditionalTokenFilter that only applies its wrapped filters to tokens that are not contained in a protected set.classRemoveDuplicatesTokenFilterA TokenFilter which filters out Tokens at the same position and Term text as the previous token in the stream.classScandinavianFoldingFilterThis filter folds Scandinavian characters åÅäæÄÆ->a and öÖøØ->o.classScandinavianNormalizationFilterThis filter normalize use of the interchangeable Scandinavian characters æÆäÄöÖøØ and folded variants (aa, ao, ae, oe and oo) by transforming them to åÅæÆøØ.classSetKeywordMarkerFilterMarks terms as keywords via theKeywordAttribute.classStemmerOverrideFilterProvides the ability to override anyKeywordAttributeaware stemmer with custom dictionary-based stemming.classTrimFilterTrims leading and trailing whitespace from Tokens in the stream.classTruncateTokenFilterA token filter for truncating the terms into a specific length.classTypeAsSynonymFilterAdds theTypeAttribute.type()as a synonym, i.e.classWordDelimiterFilterDeprecated.UseWordDelimiterGraphFilterinstead: it produces a correct token graph so that e.g.classWordDelimiterGraphFilterSplits words into subwords and performs optional transformations on subword groups, producing a correct token graph so that e.g.Methods in org.apache.lucene.analysis.miscellaneous that return TokenFilter Modifier and Type Method Description TokenFilterWordDelimiterFilterFactory. create(TokenStream input)Deprecated.TokenFilterWordDelimiterGraphFilterFactory. create(TokenStream input) -
Uses of TokenFilter in org.apache.lucene.analysis.ngram
Subclasses of TokenFilter in org.apache.lucene.analysis.ngram Modifier and Type Class Description classEdgeNGramTokenFilterTokenizes the given token into n-grams of given size(s).classNGramTokenFilterTokenizes the input into n-grams of the given size(s).Methods in org.apache.lucene.analysis.ngram that return TokenFilter Modifier and Type Method Description TokenFilterEdgeNGramFilterFactory. create(TokenStream input)TokenFilterNGramFilterFactory. create(TokenStream input) -
Uses of TokenFilter in org.apache.lucene.analysis.no
Subclasses of TokenFilter in org.apache.lucene.analysis.no Modifier and Type Class Description classNorwegianLightStemFilterATokenFilterthat appliesNorwegianLightStemmerto stem Norwegian words.classNorwegianMinimalStemFilterATokenFilterthat appliesNorwegianMinimalStemmerto stem Norwegian words. -
Uses of TokenFilter in org.apache.lucene.analysis.pattern
Subclasses of TokenFilter in org.apache.lucene.analysis.pattern Modifier and Type Class Description classPatternCaptureGroupTokenFilterCaptureGroup uses Java regexes to emit multiple tokens - one for each capture group in one or more patterns.classPatternReplaceFilterA TokenFilter which applies a Pattern to each token in the stream, replacing match occurrences with the specified replacement string. -
Uses of TokenFilter in org.apache.lucene.analysis.payloads
Subclasses of TokenFilter in org.apache.lucene.analysis.payloads Modifier and Type Class Description classDelimitedPayloadTokenFilterCharacters before the delimiter are the "token", those after are the payload.classNumericPayloadTokenFilterAssigns a payload to a token based on theTypeAttributeclassTokenOffsetPayloadTokenFilterAdds theOffsetAttribute.startOffset()andOffsetAttribute.endOffset()First 4 bytes are the startclassTypeAsPayloadTokenFilterMakes theTypeAttributea payload. -
Uses of TokenFilter in org.apache.lucene.analysis.phonetic
Subclasses of TokenFilter in org.apache.lucene.analysis.phonetic Modifier and Type Class Description classBeiderMorseFilterTokenFilter for Beider-Morse phonetic encoding.classDaitchMokotoffSoundexFilterCreate tokens for phonetic matches based on Daitch–Mokotoff Soundex.classDoubleMetaphoneFilterFilter for DoubleMetaphone (supporting secondary codes)classPhoneticFilterCreate tokens for phonetic matches. -
Uses of TokenFilter in org.apache.lucene.analysis.pt
Subclasses of TokenFilter in org.apache.lucene.analysis.pt Modifier and Type Class Description classPortugueseLightStemFilterATokenFilterthat appliesPortugueseLightStemmerto stem Portuguese words.classPortugueseMinimalStemFilterATokenFilterthat appliesPortugueseMinimalStemmerto stem Portuguese words.classPortugueseStemFilterATokenFilterthat appliesPortugueseStemmerto stem Portuguese words. -
Uses of TokenFilter in org.apache.lucene.analysis.reverse
Subclasses of TokenFilter in org.apache.lucene.analysis.reverse Modifier and Type Class Description classReverseStringFilterReverse token string, for example "country" => "yrtnuoc". -
Uses of TokenFilter in org.apache.lucene.analysis.ru
Subclasses of TokenFilter in org.apache.lucene.analysis.ru Modifier and Type Class Description classRussianLightStemFilterATokenFilterthat appliesRussianLightStemmerto stem Russian words. -
Uses of TokenFilter in org.apache.lucene.analysis.shingle
Subclasses of TokenFilter in org.apache.lucene.analysis.shingle Modifier and Type Class Description classFixedShingleFilterA FixedShingleFilter constructs shingles (token n-grams) from a token stream.classShingleFilterA ShingleFilter constructs shingles (token n-grams) from a token stream. -
Uses of TokenFilter in org.apache.lucene.analysis.sinks
Subclasses of TokenFilter in org.apache.lucene.analysis.sinks Modifier and Type Class Description classTeeSinkTokenFilterThis TokenFilter provides the ability to set aside attribute states that have already been analyzed. -
Uses of TokenFilter in org.apache.lucene.analysis.snowball
Subclasses of TokenFilter in org.apache.lucene.analysis.snowball Modifier and Type Class Description classSnowballFilterA filter that stems words using a Snowball-generated stemmer.Methods in org.apache.lucene.analysis.snowball that return TokenFilter Modifier and Type Method Description TokenFilterSnowballPorterFilterFactory. create(TokenStream input) -
Uses of TokenFilter in org.apache.lucene.analysis.sr
Subclasses of TokenFilter in org.apache.lucene.analysis.sr Modifier and Type Class Description classSerbianNormalizationFilterNormalizes Serbian Cyrillic and Latin characters to "bald" Latin.classSerbianNormalizationRegularFilterNormalizes Serbian Cyrillic to Latin. -
Uses of TokenFilter in org.apache.lucene.analysis.standard
Subclasses of TokenFilter in org.apache.lucene.analysis.standard Modifier and Type Class Description classClassicFilterNormalizes tokens extracted withClassicTokenizer.Methods in org.apache.lucene.analysis.standard that return TokenFilter Modifier and Type Method Description TokenFilterClassicFilterFactory. create(TokenStream input) -
Uses of TokenFilter in org.apache.lucene.analysis.stempel
Subclasses of TokenFilter in org.apache.lucene.analysis.stempel Modifier and Type Class Description classStempelFilterTransforms the token stream as per the stemming algorithm. -
Uses of TokenFilter in org.apache.lucene.analysis.sv
Subclasses of TokenFilter in org.apache.lucene.analysis.sv Modifier and Type Class Description classSwedishLightStemFilterATokenFilterthat appliesSwedishLightStemmerto stem Swedish words. -
Uses of TokenFilter in org.apache.lucene.analysis.synonym
Subclasses of TokenFilter in org.apache.lucene.analysis.synonym Modifier and Type Class Description classSynonymFilterDeprecated.UseSynonymGraphFilterinstead, but be sure to also useFlattenGraphFilterat index time (not at search time) as well.classSynonymGraphFilterApplies single- or multi-token synonyms from aSynonymMapto an incomingTokenStream, producing a fully correct graph output. -
Uses of TokenFilter in org.apache.lucene.analysis.tr
Subclasses of TokenFilter in org.apache.lucene.analysis.tr Modifier and Type Class Description classApostropheFilterStrips all characters after an apostrophe (including the apostrophe itself).classTurkishLowerCaseFilterNormalizes Turkish token text to lower case. -
Uses of TokenFilter in org.apache.lucene.analysis.util
Subclasses of TokenFilter in org.apache.lucene.analysis.util Modifier and Type Class Description classElisionFilterRemoves elisions from aTokenStream. -
Uses of TokenFilter in org.apache.lucene.monitor
Subclasses of TokenFilter in org.apache.lucene.monitor Modifier and Type Class Description (package private) classSuffixingNGramTokenFilter -
Uses of TokenFilter in org.apache.lucene.search.highlight
Subclasses of TokenFilter in org.apache.lucene.search.highlight Modifier and Type Class Description (package private) classLimitTokenOffsetFilterThis is a simplified version of org.apache.lucene.analysis.miscellaneous.LimitTokenOffsetFilter to prevent a dependency on analyzers-common.jar.classOffsetLimitTokenFilterThis TokenFilter limits the number of tokens while indexing by adding up the current offset. -
Uses of TokenFilter in org.apache.lucene.search.suggest.analyzing
Subclasses of TokenFilter in org.apache.lucene.search.suggest.analyzing Modifier and Type Class Description classSuggestStopFilterLikeStopFilterexcept it will not remove the last token if that token was not followed by some token separator. -
Uses of TokenFilter in org.apache.lucene.search.suggest.document
Subclasses of TokenFilter in org.apache.lucene.search.suggest.document Modifier and Type Class Description classCompletionTokenStreamAConcatenateGraphFilterbut we can set the payload and provide access to config options.private static classContextSuggestField.PrefixTokenFilterTheContextSuggestField.PrefixTokenFilterwraps aTokenStreamand adds a set prefixes ahead. -
Uses of TokenFilter in org.apache.lucene.search.uhighlight
Subclasses of TokenFilter in org.apache.lucene.search.uhighlight Modifier and Type Class Description private static classAnalysisOffsetStrategy.MultiValueTokenStreamWraps anAnalyzerand string text that represents multiple values delimited by a specified character.
-