Uses of Class
org.apache.lucene.util.AttributeSource
-
Packages that use AttributeSource Package Description org.apache.lucene.analysis Text analysis.org.apache.lucene.analysis.ar Analyzer for Arabic.org.apache.lucene.analysis.bg Analyzer for Bulgarian.org.apache.lucene.analysis.bn Analyzer for Bengali Language.org.apache.lucene.analysis.boost Provides various convenience classes for creating boosts on Tokens.org.apache.lucene.analysis.br Analyzer for Brazilian Portuguese.org.apache.lucene.analysis.cjk Analyzer for Chinese, Japanese, and Korean, which indexes bigrams.org.apache.lucene.analysis.ckb Analyzer for Sorani Kurdish.org.apache.lucene.analysis.cn.smart Analyzer for Simplified Chinese, which indexes words.org.apache.lucene.analysis.commongrams Construct n-grams for frequently occurring terms and phrases.org.apache.lucene.analysis.compound A filter that decomposes compound words you find in many Germanic languages into the word parts.org.apache.lucene.analysis.core Basic, general-purpose analysis components.org.apache.lucene.analysis.cz Analyzer for Czech.org.apache.lucene.analysis.de Analyzer for German.org.apache.lucene.analysis.el Analyzer for Greek.org.apache.lucene.analysis.en Analyzer for English.org.apache.lucene.analysis.es Analyzer for Spanish.org.apache.lucene.analysis.fa Analyzer for Persian.org.apache.lucene.analysis.fi Analyzer for Finnish.org.apache.lucene.analysis.fr Analyzer for French.org.apache.lucene.analysis.ga Analyzer for Irish.org.apache.lucene.analysis.gl Analyzer for Galician.org.apache.lucene.analysis.hi Analyzer for Hindi.org.apache.lucene.analysis.hu Analyzer for Hungarian.org.apache.lucene.analysis.hunspell Stemming TokenFilter using a Java implementation of the Hunspell stemming algorithm.org.apache.lucene.analysis.icu Analysis components based on ICUorg.apache.lucene.analysis.icu.segmentation Tokenizer that breaks text into words with the Unicode Text Segmentation algorithm.org.apache.lucene.analysis.id Analyzer for Indonesian.org.apache.lucene.analysis.in Analyzer for Indian languages.org.apache.lucene.analysis.it Analyzer for Italian.org.apache.lucene.analysis.ja Analyzer for Japanese.org.apache.lucene.analysis.ko Analyzer for Korean.org.apache.lucene.analysis.lv Analyzer for Latvian.org.apache.lucene.analysis.minhash MinHash filtering (for LSH).org.apache.lucene.analysis.miscellaneous Miscellaneous Tokenstreams.org.apache.lucene.analysis.ngram Character n-gram tokenizers and filters.org.apache.lucene.analysis.no Analyzer for Norwegian.org.apache.lucene.analysis.path Analysis components for path-like strings such as filenames.org.apache.lucene.analysis.pattern Set of components for pattern-based (regex) analysis.org.apache.lucene.analysis.payloads Provides various convenience classes for creating payloads on Tokens.org.apache.lucene.analysis.phonetic Analysis components for phonetic search.org.apache.lucene.analysis.pt Analyzer for Portuguese.org.apache.lucene.analysis.reverse Filter to reverse token text.org.apache.lucene.analysis.ru Analyzer for Russian.org.apache.lucene.analysis.shingle Word n-gram filters.org.apache.lucene.analysis.sinks org.apache.lucene.analysis.snowball TokenFilterandAnalyzerimplementations that use Snowball stemmers.org.apache.lucene.analysis.sr Analyzer for Serbian.org.apache.lucene.analysis.standard Fast, general-purpose grammar-based tokenizerStandardTokenizerimplements the Word Break rules from the Unicode Text Segmentation algorithm, as specified in Unicode Standard Annex #29.org.apache.lucene.analysis.stempel Stempel: Algorithmic Stemmerorg.apache.lucene.analysis.sv Analyzer for Swedish.org.apache.lucene.analysis.synonym Analysis components for Synonyms.org.apache.lucene.analysis.th Analyzer for Thai.org.apache.lucene.analysis.tr Analyzer for Turkish.org.apache.lucene.analysis.util Utility functions for text analysis.org.apache.lucene.analysis.wikipedia Tokenizer that is aware of Wikipedia syntax.org.apache.lucene.codecs Codecs API: API for customization of the encoding and structure of the index.org.apache.lucene.codecs.uniformsplit.sharedterms Pluggable term index / block terms dictionary implementations.org.apache.lucene.document The logical representation of aDocumentfor indexing and searching.org.apache.lucene.index Code to maintain and access indices.org.apache.lucene.monitor Monitoring frameworkorg.apache.lucene.search Code to search indices.org.apache.lucene.search.highlight Highlighting search terms.org.apache.lucene.search.join Support for index-time and query-time joins.org.apache.lucene.search.suggest.analyzing Analyzer based autosuggest.org.apache.lucene.search.suggest.document Support for document suggestionorg.apache.lucene.search.uhighlight The UnifiedHighlighter -- a flexible highlighter that can get offsets from postings, term vectors, or analysis.org.apache.lucene.util Some utility classes.org.apache.lucene.util.graph Utility classes for working with token streams as graphs. -
-
Uses of AttributeSource in org.apache.lucene.analysis
Subclasses of AttributeSource in org.apache.lucene.analysis Modifier and Type Class Description private static classAnalyzer.StringTokenStreamclassCachingTokenFilterThis class can be used if the token attributes of a TokenStream are intended to be consumed more than once.classFilteringTokenFilterAbstract base class for TokenFilters that may remove tokens.classGraphTokenFilterAn abstract TokenFilter that exposes its input stream as a graph CallGraphTokenFilter.incrementBaseToken()to move the root of the graph to the next position in the TokenStream,GraphTokenFilter.incrementGraphToken()to move along the current graph, andGraphTokenFilter.incrementGraph()to reset to the next graph based at the current root.classLowerCaseFilterNormalizes token text to lower case.classStopFilterRemoves stop words from a token stream.classTokenFilterA TokenFilter is a TokenStream whose input is another TokenStream.classTokenizerA Tokenizer is a TokenStream whose input is a Reader.classTokenStreamFields in org.apache.lucene.analysis declared as AttributeSource Modifier and Type Field Description (package private) AttributeSourceGraphTokenFilter.Token. attSourceMethods in org.apache.lucene.analysis with parameters of type AttributeSource Modifier and Type Method Description (package private) voidGraphTokenFilter.Token. reset(AttributeSource attSource)Constructors in org.apache.lucene.analysis with parameters of type AttributeSource Constructor Description Token(AttributeSource attSource)TokenStream(AttributeSource input)A TokenStream that uses the same attributes as the supplied one. -
Uses of AttributeSource in org.apache.lucene.analysis.ar
Subclasses of AttributeSource in org.apache.lucene.analysis.ar Modifier and Type Class Description classArabicNormalizationFilterATokenFilterthat appliesArabicNormalizerto normalize the orthography.classArabicStemFilterATokenFilterthat appliesArabicStemmerto stem Arabic words.. -
Uses of AttributeSource in org.apache.lucene.analysis.bg
Subclasses of AttributeSource in org.apache.lucene.analysis.bg Modifier and Type Class Description classBulgarianStemFilterATokenFilterthat appliesBulgarianStemmerto stem Bulgarian words. -
Uses of AttributeSource in org.apache.lucene.analysis.bn
Subclasses of AttributeSource in org.apache.lucene.analysis.bn Modifier and Type Class Description classBengaliNormalizationFilterATokenFilterthat appliesBengaliNormalizerto normalize the orthography.classBengaliStemFilterATokenFilterthat appliesBengaliStemmerto stem Bengali words. -
Uses of AttributeSource in org.apache.lucene.analysis.boost
Subclasses of AttributeSource in org.apache.lucene.analysis.boost Modifier and Type Class Description classDelimitedBoostTokenFilterCharacters before the delimiter are the "token", those after are the boost. -
Uses of AttributeSource in org.apache.lucene.analysis.br
Subclasses of AttributeSource in org.apache.lucene.analysis.br Modifier and Type Class Description classBrazilianStemFilterATokenFilterthat appliesBrazilianStemmer. -
Uses of AttributeSource in org.apache.lucene.analysis.cjk
Subclasses of AttributeSource in org.apache.lucene.analysis.cjk Modifier and Type Class Description classCJKBigramFilterForms bigrams of CJK terms that are generated from StandardTokenizer or ICUTokenizer.classCJKWidthFilterATokenFilterthat normalizes CJK width differences: Folds fullwidth ASCII variants into the equivalent basic latin Folds halfwidth Katakana variants into the equivalent kana -
Uses of AttributeSource in org.apache.lucene.analysis.ckb
Subclasses of AttributeSource in org.apache.lucene.analysis.ckb Modifier and Type Class Description classSoraniNormalizationFilterATokenFilterthat appliesSoraniNormalizerto normalize the orthography.classSoraniStemFilterATokenFilterthat appliesSoraniStemmerto stem Sorani words. -
Uses of AttributeSource in org.apache.lucene.analysis.cn.smart
Subclasses of AttributeSource in org.apache.lucene.analysis.cn.smart Modifier and Type Class Description classHMMChineseTokenizerTokenizer for Chinese or mixed Chinese-English text. -
Uses of AttributeSource in org.apache.lucene.analysis.commongrams
Subclasses of AttributeSource in org.apache.lucene.analysis.commongrams Modifier and Type Class Description classCommonGramsFilterConstruct bigrams for frequently occurring terms while indexing.classCommonGramsQueryFilterWrap a CommonGramsFilter optimizing phrase queries by only returning single words when they are not a member of a bigram. -
Uses of AttributeSource in org.apache.lucene.analysis.compound
Subclasses of AttributeSource in org.apache.lucene.analysis.compound Modifier and Type Class Description classCompoundWordTokenFilterBaseBase class for decomposition token filters.classDictionaryCompoundWordTokenFilterATokenFilterthat decomposes compound words found in many Germanic languages.classHyphenationCompoundWordTokenFilterATokenFilterthat decomposes compound words found in many Germanic languages. -
Uses of AttributeSource in org.apache.lucene.analysis.core
Subclasses of AttributeSource in org.apache.lucene.analysis.core Modifier and Type Class Description classDecimalDigitFilterFolds all Unicode digits in[:General_Category=Decimal_Number:]to Basic Latin digits (0-9).classFlattenGraphFilterConverts an incoming graph token stream, such as one fromSynonymGraphFilter, into a flat form so that all nodes form a single linear chain with no side paths.classKeywordTokenizerEmits the entire input as a single token.classLetterTokenizerA LetterTokenizer is a tokenizer that divides text at non-letters.classLowerCaseFilterNormalizes token text to lower case.classStopFilterRemoves stop words from a token stream.classTypeTokenFilterRemoves tokens whose types appear in a set of blocked types from a token stream.classUnicodeWhitespaceTokenizerA UnicodeWhitespaceTokenizer is a tokenizer that divides text at whitespace.classUpperCaseFilterNormalizes token text to UPPER CASE.classWhitespaceTokenizerA tokenizer that divides text at whitespace characters as defined byCharacter.isWhitespace(int). -
Uses of AttributeSource in org.apache.lucene.analysis.cz
Subclasses of AttributeSource in org.apache.lucene.analysis.cz Modifier and Type Class Description classCzechStemFilterATokenFilterthat appliesCzechStemmerto stem Czech words. -
Uses of AttributeSource in org.apache.lucene.analysis.de
Subclasses of AttributeSource in org.apache.lucene.analysis.de Modifier and Type Class Description classGermanLightStemFilterATokenFilterthat appliesGermanLightStemmerto stem German words.classGermanMinimalStemFilterATokenFilterthat appliesGermanMinimalStemmerto stem German words.classGermanNormalizationFilterNormalizes German characters according to the heuristics of the German2 snowball algorithm.classGermanStemFilterATokenFilterthat stems German words. -
Uses of AttributeSource in org.apache.lucene.analysis.el
Subclasses of AttributeSource in org.apache.lucene.analysis.el Modifier and Type Class Description classGreekLowerCaseFilterNormalizes token text to lower case, removes some Greek diacritics, and standardizes final sigma to sigma.classGreekStemFilterATokenFilterthat appliesGreekStemmerto stem Greek words. -
Uses of AttributeSource in org.apache.lucene.analysis.en
Subclasses of AttributeSource in org.apache.lucene.analysis.en Modifier and Type Class Description classEnglishMinimalStemFilterATokenFilterthat appliesEnglishMinimalStemmerto stem English words.classEnglishPossessiveFilterTokenFilter that removes possessives (trailing 's) from words.classKStemFilterA high-performance kstem filter for english.classPorterStemFilterTransforms the token stream as per the Porter stemming algorithm. -
Uses of AttributeSource in org.apache.lucene.analysis.es
Subclasses of AttributeSource in org.apache.lucene.analysis.es Modifier and Type Class Description classSpanishLightStemFilterATokenFilterthat appliesSpanishLightStemmerto stem Spanish words.classSpanishMinimalStemFilterATokenFilterthat appliesSpanishMinimalStemmerto stem Spanish words. -
Uses of AttributeSource in org.apache.lucene.analysis.fa
Subclasses of AttributeSource in org.apache.lucene.analysis.fa Modifier and Type Class Description classPersianNormalizationFilterATokenFilterthat appliesPersianNormalizerto normalize the orthography. -
Uses of AttributeSource in org.apache.lucene.analysis.fi
Subclasses of AttributeSource in org.apache.lucene.analysis.fi Modifier and Type Class Description classFinnishLightStemFilterATokenFilterthat appliesFinnishLightStemmerto stem Finnish words. -
Uses of AttributeSource in org.apache.lucene.analysis.fr
Subclasses of AttributeSource in org.apache.lucene.analysis.fr Modifier and Type Class Description classFrenchLightStemFilterATokenFilterthat appliesFrenchLightStemmerto stem French words.classFrenchMinimalStemFilterATokenFilterthat appliesFrenchMinimalStemmerto stem French words. -
Uses of AttributeSource in org.apache.lucene.analysis.ga
Subclasses of AttributeSource in org.apache.lucene.analysis.ga Modifier and Type Class Description classIrishLowerCaseFilterNormalises token text to lower case, handling t-prothesis and n-eclipsis (i.e., that 'nAthair' should become 'n-athair') -
Uses of AttributeSource in org.apache.lucene.analysis.gl
Subclasses of AttributeSource in org.apache.lucene.analysis.gl Modifier and Type Class Description classGalicianMinimalStemFilterATokenFilterthat appliesGalicianMinimalStemmerto stem Galician words.classGalicianStemFilterATokenFilterthat appliesGalicianStemmerto stem Galician words. -
Uses of AttributeSource in org.apache.lucene.analysis.hi
Subclasses of AttributeSource in org.apache.lucene.analysis.hi Modifier and Type Class Description classHindiNormalizationFilterATokenFilterthat appliesHindiNormalizerto normalize the orthography.classHindiStemFilterATokenFilterthat appliesHindiStemmerto stem Hindi words. -
Uses of AttributeSource in org.apache.lucene.analysis.hu
Subclasses of AttributeSource in org.apache.lucene.analysis.hu Modifier and Type Class Description classHungarianLightStemFilterATokenFilterthat appliesHungarianLightStemmerto stem Hungarian words. -
Uses of AttributeSource in org.apache.lucene.analysis.hunspell
Subclasses of AttributeSource in org.apache.lucene.analysis.hunspell Modifier and Type Class Description classHunspellStemFilterTokenFilter that uses hunspell affix rules and words to stem tokens. -
Uses of AttributeSource in org.apache.lucene.analysis.icu
Subclasses of AttributeSource in org.apache.lucene.analysis.icu Modifier and Type Class Description classICUFoldingFilterA TokenFilter that applies search term folding to Unicode text, applying foldings from UTR#30 Character Foldings.classICUNormalizer2FilterNormalize token text with ICU'sNormalizer2classICUTransformFilterATokenFilterthat transforms text with ICU. -
Uses of AttributeSource in org.apache.lucene.analysis.icu.segmentation
Subclasses of AttributeSource in org.apache.lucene.analysis.icu.segmentation Modifier and Type Class Description classICUTokenizerBreaks text into words according to UAX #29: Unicode Text Segmentation (http://www.unicode.org/reports/tr29/) -
Uses of AttributeSource in org.apache.lucene.analysis.id
Subclasses of AttributeSource in org.apache.lucene.analysis.id Modifier and Type Class Description classIndonesianStemFilterATokenFilterthat appliesIndonesianStemmerto stem Indonesian words. -
Uses of AttributeSource in org.apache.lucene.analysis.in
Subclasses of AttributeSource in org.apache.lucene.analysis.in Modifier and Type Class Description classIndicNormalizationFilterATokenFilterthat appliesIndicNormalizerto normalize text in Indian Languages. -
Uses of AttributeSource in org.apache.lucene.analysis.it
Subclasses of AttributeSource in org.apache.lucene.analysis.it Modifier and Type Class Description classItalianLightStemFilterATokenFilterthat appliesItalianLightStemmerto stem Italian words. -
Uses of AttributeSource in org.apache.lucene.analysis.ja
Subclasses of AttributeSource in org.apache.lucene.analysis.ja Modifier and Type Class Description classJapaneseBaseFormFilterReplaces term text with theBaseFormAttribute.classJapaneseKatakanaStemFilterATokenFilterthat normalizes common katakana spelling variations ending in a long sound character by removing this character (U+30FC).classJapaneseNumberFilterATokenFilterthat normalizes Japanese numbers (kansūji) to regular Arabic decimal numbers in half-width characters.classJapanesePartOfSpeechStopFilterRemoves tokens that match a set of part-of-speech tags.classJapaneseReadingFormFilterATokenFilterthat replaces the term attribute with the reading of a token in either katakana or romaji form.classJapaneseTokenizerTokenizer for Japanese that uses morphological analysis. -
Uses of AttributeSource in org.apache.lucene.analysis.ko
Subclasses of AttributeSource in org.apache.lucene.analysis.ko Modifier and Type Class Description classKoreanNumberFilterATokenFilterthat normalizes Korean numbers to regular Arabic decimal numbers in half-width characters.classKoreanPartOfSpeechStopFilterRemoves tokens that match a set of part-of-speech tags.classKoreanReadingFormFilterReplaces term text with theReadingAttributewhich is the Hangul transcription of Hanja characters.classKoreanTokenizerTokenizer for Korean that uses morphological analysis. -
Uses of AttributeSource in org.apache.lucene.analysis.lv
Subclasses of AttributeSource in org.apache.lucene.analysis.lv Modifier and Type Class Description classLatvianStemFilterATokenFilterthat appliesLatvianStemmerto stem Latvian words. -
Uses of AttributeSource in org.apache.lucene.analysis.minhash
Subclasses of AttributeSource in org.apache.lucene.analysis.minhash Modifier and Type Class Description classMinHashFilterGenerate min hash tokens from an incoming stream of tokens. -
Uses of AttributeSource in org.apache.lucene.analysis.miscellaneous
Subclasses of AttributeSource in org.apache.lucene.analysis.miscellaneous Modifier and Type Class Description classASCIIFoldingFilterThis class converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the "Basic Latin" Unicode block) into their ASCII equivalents, if one exists.classCapitalizationFilterA filter to apply normal capitalization rules to Tokens.classCodepointCountFilterRemoves words that are too long or too short from the stream.classConcatenateGraphFilterConcatenates/Joins every incoming token with a separator into one output token for every path through the token stream (which is a graph).classConcatenatingTokenStreamA TokenStream that takes an array of input TokenStreams as sources, and concatenates them together.classConditionalTokenFilterAllows skipping TokenFilters based on the current set of attributes.private classConditionalTokenFilter.OneTimeWrapperclassDateRecognizerFilterFilters all tokens that cannot be parsed to a date, using the providedDateFormat.classDelimitedTermFrequencyTokenFilterCharacters before the delimiter are the "token", the textual integer after is the term frequency.classEmptyTokenStreamAn always exhausted token stream.classFingerprintFilterFilter outputs a single token which is a concatenation of the sorted and de-duplicated set of input tokens.classFixBrokenOffsetsFilterDeprecated.Fix the token filters that create broken offsets in the first place.classHyphenatedWordsFilterWhen the plain text is extracted from documents, we will often have many words hyphenated and broken into two lines.classKeepWordFilterA TokenFilter that only keeps tokens with text contained in the required words.classKeywordMarkerFilterMarks terms as keywords via theKeywordAttribute.classKeywordRepeatFilterThis TokenFilter emits each incoming token twice once as keyword and once non-keyword, in other words once withKeywordAttribute.setKeyword(boolean)set totrueand once set tofalse.classLengthFilterRemoves words that are too long or too short from the stream.classLimitTokenCountFilterThis TokenFilter limits the number of tokens while indexing.classLimitTokenOffsetFilterLets all tokens pass through until it sees one with a start offset <= a configured limit, which won't pass and ends the stream.classLimitTokenPositionFilterThis TokenFilter limits its emitted tokens to those with positions that are not greater than the configured limit.classPatternKeywordMarkerFilterMarks terms as keywords via theKeywordAttribute.classProtectedTermFilterA ConditionalTokenFilter that only applies its wrapped filters to tokens that are not contained in a protected set.classRemoveDuplicatesTokenFilterA TokenFilter which filters out Tokens at the same position and Term text as the previous token in the stream.classScandinavianFoldingFilterThis filter folds Scandinavian characters åÅäæÄÆ->a and öÖøØ->o.classScandinavianNormalizationFilterThis filter normalize use of the interchangeable Scandinavian characters æÆäÄöÖøØ and folded variants (aa, ao, ae, oe and oo) by transforming them to åÅæÆøØ.classSetKeywordMarkerFilterMarks terms as keywords via theKeywordAttribute.classStemmerOverrideFilterProvides the ability to override anyKeywordAttributeaware stemmer with custom dictionary-based stemming.classTrimFilterTrims leading and trailing whitespace from Tokens in the stream.classTruncateTokenFilterA token filter for truncating the terms into a specific length.classTypeAsSynonymFilterAdds theTypeAttribute.type()as a synonym, i.e.classWordDelimiterFilterDeprecated.UseWordDelimiterGraphFilterinstead: it produces a correct token graph so that e.g.classWordDelimiterGraphFilterSplits words into subwords and performs optional transformations on subword groups, producing a correct token graph so that e.g.Methods in org.apache.lucene.analysis.miscellaneous that return AttributeSource Modifier and Type Method Description private static AttributeSourceConcatenatingTokenStream. combineSources(TokenStream... sources)Constructors in org.apache.lucene.analysis.miscellaneous with parameters of type AttributeSource Constructor Description OneTimeWrapper(AttributeSource attributeSource) -
Uses of AttributeSource in org.apache.lucene.analysis.ngram
Subclasses of AttributeSource in org.apache.lucene.analysis.ngram Modifier and Type Class Description classEdgeNGramTokenFilterTokenizes the given token into n-grams of given size(s).classEdgeNGramTokenizerTokenizes the input from an edge into n-grams of given size(s).classNGramTokenFilterTokenizes the input into n-grams of the given size(s).classNGramTokenizerTokenizes the input into n-grams of the given size(s). -
Uses of AttributeSource in org.apache.lucene.analysis.no
Subclasses of AttributeSource in org.apache.lucene.analysis.no Modifier and Type Class Description classNorwegianLightStemFilterATokenFilterthat appliesNorwegianLightStemmerto stem Norwegian words.classNorwegianMinimalStemFilterATokenFilterthat appliesNorwegianMinimalStemmerto stem Norwegian words. -
Uses of AttributeSource in org.apache.lucene.analysis.path
Subclasses of AttributeSource in org.apache.lucene.analysis.path Modifier and Type Class Description classPathHierarchyTokenizerTokenizer for path-like hierarchies.classReversePathHierarchyTokenizerTokenizer for domain-like hierarchies. -
Uses of AttributeSource in org.apache.lucene.analysis.pattern
Subclasses of AttributeSource in org.apache.lucene.analysis.pattern Modifier and Type Class Description classPatternCaptureGroupTokenFilterCaptureGroup uses Java regexes to emit multiple tokens - one for each capture group in one or more patterns.classPatternReplaceFilterA TokenFilter which applies a Pattern to each token in the stream, replacing match occurrences with the specified replacement string.classPatternTokenizerThis tokenizer uses regex pattern matching to construct distinct tokens for the input stream.classSimplePatternSplitTokenizerclassSimplePatternTokenizer -
Uses of AttributeSource in org.apache.lucene.analysis.payloads
Subclasses of AttributeSource in org.apache.lucene.analysis.payloads Modifier and Type Class Description classDelimitedPayloadTokenFilterCharacters before the delimiter are the "token", those after are the payload.classNumericPayloadTokenFilterAssigns a payload to a token based on theTypeAttributeclassTokenOffsetPayloadTokenFilterAdds theOffsetAttribute.startOffset()andOffsetAttribute.endOffset()First 4 bytes are the startclassTypeAsPayloadTokenFilterMakes theTypeAttributea payload. -
Uses of AttributeSource in org.apache.lucene.analysis.phonetic
Subclasses of AttributeSource in org.apache.lucene.analysis.phonetic Modifier and Type Class Description classBeiderMorseFilterTokenFilter for Beider-Morse phonetic encoding.classDaitchMokotoffSoundexFilterCreate tokens for phonetic matches based on Daitch–Mokotoff Soundex.classDoubleMetaphoneFilterFilter for DoubleMetaphone (supporting secondary codes)classPhoneticFilterCreate tokens for phonetic matches. -
Uses of AttributeSource in org.apache.lucene.analysis.pt
Subclasses of AttributeSource in org.apache.lucene.analysis.pt Modifier and Type Class Description classPortugueseLightStemFilterATokenFilterthat appliesPortugueseLightStemmerto stem Portuguese words.classPortugueseMinimalStemFilterATokenFilterthat appliesPortugueseMinimalStemmerto stem Portuguese words.classPortugueseStemFilterATokenFilterthat appliesPortugueseStemmerto stem Portuguese words. -
Uses of AttributeSource in org.apache.lucene.analysis.reverse
Subclasses of AttributeSource in org.apache.lucene.analysis.reverse Modifier and Type Class Description classReverseStringFilterReverse token string, for example "country" => "yrtnuoc". -
Uses of AttributeSource in org.apache.lucene.analysis.ru
Subclasses of AttributeSource in org.apache.lucene.analysis.ru Modifier and Type Class Description classRussianLightStemFilterATokenFilterthat appliesRussianLightStemmerto stem Russian words. -
Uses of AttributeSource in org.apache.lucene.analysis.shingle
Subclasses of AttributeSource in org.apache.lucene.analysis.shingle Modifier and Type Class Description classFixedShingleFilterA FixedShingleFilter constructs shingles (token n-grams) from a token stream.classShingleFilterA ShingleFilter constructs shingles (token n-grams) from a token stream.Fields in org.apache.lucene.analysis.shingle declared as AttributeSource Modifier and Type Field Description (package private) AttributeSourceShingleFilter.InputWindowToken. attSourceprivate AttributeSourceShingleFilter. nextInputStreamTokenWhen the next input stream token has a position increment greater than one, it is stored in this field until sufficient filler tokens have been inserted to account for the position increment.Constructors in org.apache.lucene.analysis.shingle with parameters of type AttributeSource Constructor Description InputWindowToken(AttributeSource attSource) -
Uses of AttributeSource in org.apache.lucene.analysis.sinks
Subclasses of AttributeSource in org.apache.lucene.analysis.sinks Modifier and Type Class Description classTeeSinkTokenFilterThis TokenFilter provides the ability to set aside attribute states that have already been analyzed.static classTeeSinkTokenFilter.SinkTokenStreamTokenStream output from a tee.Constructors in org.apache.lucene.analysis.sinks with parameters of type AttributeSource Constructor Description SinkTokenStream(AttributeSource source, TeeSinkTokenFilter.States cachedStates) -
Uses of AttributeSource in org.apache.lucene.analysis.snowball
Subclasses of AttributeSource in org.apache.lucene.analysis.snowball Modifier and Type Class Description classSnowballFilterA filter that stems words using a Snowball-generated stemmer. -
Uses of AttributeSource in org.apache.lucene.analysis.sr
Subclasses of AttributeSource in org.apache.lucene.analysis.sr Modifier and Type Class Description classSerbianNormalizationFilterNormalizes Serbian Cyrillic and Latin characters to "bald" Latin.classSerbianNormalizationRegularFilterNormalizes Serbian Cyrillic to Latin. -
Uses of AttributeSource in org.apache.lucene.analysis.standard
Subclasses of AttributeSource in org.apache.lucene.analysis.standard Modifier and Type Class Description classClassicFilterNormalizes tokens extracted withClassicTokenizer.classClassicTokenizerA grammar-based tokenizer constructed with JFlexclassStandardTokenizerA grammar-based tokenizer constructed with JFlex.classUAX29URLEmailTokenizerThis class implements Word Break rules from the Unicode Text Segmentation algorithm, as specified in Unicode Standard Annex #29 URLs and email addresses are also tokenized according to the relevant RFCs. -
Uses of AttributeSource in org.apache.lucene.analysis.stempel
Subclasses of AttributeSource in org.apache.lucene.analysis.stempel Modifier and Type Class Description classStempelFilterTransforms the token stream as per the stemming algorithm. -
Uses of AttributeSource in org.apache.lucene.analysis.sv
Subclasses of AttributeSource in org.apache.lucene.analysis.sv Modifier and Type Class Description classSwedishLightStemFilterATokenFilterthat appliesSwedishLightStemmerto stem Swedish words. -
Uses of AttributeSource in org.apache.lucene.analysis.synonym
Subclasses of AttributeSource in org.apache.lucene.analysis.synonym Modifier and Type Class Description classSynonymFilterDeprecated.UseSynonymGraphFilterinstead, but be sure to also useFlattenGraphFilterat index time (not at search time) as well.classSynonymGraphFilterApplies single- or multi-token synonyms from aSynonymMapto an incomingTokenStream, producing a fully correct graph output. -
Uses of AttributeSource in org.apache.lucene.analysis.th
Subclasses of AttributeSource in org.apache.lucene.analysis.th Modifier and Type Class Description classThaiTokenizerTokenizer that useBreakIteratorto tokenize Thai text. -
Uses of AttributeSource in org.apache.lucene.analysis.tr
Subclasses of AttributeSource in org.apache.lucene.analysis.tr Modifier and Type Class Description classApostropheFilterStrips all characters after an apostrophe (including the apostrophe itself).classTurkishLowerCaseFilterNormalizes Turkish token text to lower case. -
Uses of AttributeSource in org.apache.lucene.analysis.util
Subclasses of AttributeSource in org.apache.lucene.analysis.util Modifier and Type Class Description classCharTokenizerAn abstract base class for simple, character-oriented tokenizers.classElisionFilterRemoves elisions from aTokenStream.classSegmentingTokenizerBaseBreaks text into sentences with aBreakIteratorand allows subclasses to decompose these sentences into words. -
Uses of AttributeSource in org.apache.lucene.analysis.wikipedia
Subclasses of AttributeSource in org.apache.lucene.analysis.wikipedia Modifier and Type Class Description classWikipediaTokenizerExtension of StandardTokenizer that is aware of Wikipedia syntax. -
Uses of AttributeSource in org.apache.lucene.codecs
Methods in org.apache.lucene.codecs that return AttributeSource Modifier and Type Method Description AttributeSourceDocValuesConsumer.MergedTermsEnum. attributes() -
Uses of AttributeSource in org.apache.lucene.codecs.uniformsplit.sharedterms
Methods in org.apache.lucene.codecs.uniformsplit.sharedterms that return AttributeSource Modifier and Type Method Description AttributeSourceSTMergingTermsEnum. attributes() -
Uses of AttributeSource in org.apache.lucene.document
Subclasses of AttributeSource in org.apache.lucene.document Modifier and Type Class Description private static classFeatureField.FeatureTokenStreamprivate static classField.BinaryTokenStreamprivate static classField.StringTokenStream -
Uses of AttributeSource in org.apache.lucene.index
Fields in org.apache.lucene.index declared as AttributeSource Modifier and Type Field Description (package private) AttributeSourceFieldInvertState. attributeSourceprivate AttributeSourceBaseTermsEnum. attsMethods in org.apache.lucene.index that return AttributeSource Modifier and Type Method Description AttributeSourceBaseTermsEnum. attributes()AttributeSourceFilteredTermsEnum. attributes()Returns the related attributes, the returnedAttributeSourceis shared with the delegateTermsEnum.AttributeSourceFilterLeafReader.FilterTermsEnum. attributes()abstract AttributeSourceTermsEnum. attributes()Returns the related attributes.AttributeSourceFieldInvertState. getAttributeSource()Returns theAttributeSourcefrom theTokenStreamthat provided the indexed tokens for this field.Methods in org.apache.lucene.index with parameters of type AttributeSource Modifier and Type Method Description (package private) voidFieldInvertState. setAttributeSource(AttributeSource attributeSource)Sets attributeSource to a new instance. -
Uses of AttributeSource in org.apache.lucene.monitor
Subclasses of AttributeSource in org.apache.lucene.monitor Modifier and Type Class Description (package private) classSuffixingNGramTokenFilter(package private) classTermsEnumTokenStreamA TokenStream created from aTermsEnum -
Uses of AttributeSource in org.apache.lucene.search
Fields in org.apache.lucene.search declared as AttributeSource Modifier and Type Field Description AttributeSourceTermCollectingRewrite.TermCollector. attributesattributes used for communication with the enumprivate AttributeSourceFuzzyTermsEnum. attsMethods in org.apache.lucene.search that return AttributeSource Modifier and Type Method Description AttributeSourceFuzzyTermsEnum. attributes()Methods in org.apache.lucene.search with parameters of type AttributeSource Modifier and Type Method Description protected TermsEnumAutomatonQuery. getTermsEnum(Terms terms, AttributeSource atts)protected TermsEnumFuzzyQuery. getTermsEnum(Terms terms, AttributeSource atts)protected abstract TermsEnumMultiTermQuery. getTermsEnum(Terms terms, AttributeSource atts)Construct the enumeration to be used, expanding the pattern term.protected TermsEnumMultiTermQuery.RewriteMethod. getTermsEnum(MultiTermQuery query, Terms terms, AttributeSource atts)Returns theMultiTermQuerysTermsEnumConstructors in org.apache.lucene.search with parameters of type AttributeSource Constructor Description FuzzyTermsEnum(Terms terms, AttributeSource atts, Term term, int maxEdits, int prefixLength, boolean transpositions)Constructor for enumeration of all terms from specifiedreaderwhich share a prefix of lengthprefixLengthwithtermand which have at mostmaxEditsedits.FuzzyTermsEnum(Terms terms, AttributeSource atts, Term term, java.util.function.Supplier<FuzzyAutomatonBuilder> automatonBuilder) -
Uses of AttributeSource in org.apache.lucene.search.highlight
Subclasses of AttributeSource in org.apache.lucene.search.highlight Modifier and Type Class Description (package private) classLimitTokenOffsetFilterThis is a simplified version of org.apache.lucene.analysis.miscellaneous.LimitTokenOffsetFilter to prevent a dependency on analyzers-common.jar.classOffsetLimitTokenFilterThis TokenFilter limits the number of tokens while indexing by adding up the current offset.classTokenStreamFromTermVectorTokenStream created from a term vector field. -
Uses of AttributeSource in org.apache.lucene.search.join
Methods in org.apache.lucene.search.join with parameters of type AttributeSource Modifier and Type Method Description protected TermsEnumTermsQuery. getTermsEnum(Terms terms, AttributeSource atts) -
Uses of AttributeSource in org.apache.lucene.search.suggest.analyzing
Subclasses of AttributeSource in org.apache.lucene.search.suggest.analyzing Modifier and Type Class Description classSuggestStopFilterLikeStopFilterexcept it will not remove the last token if that token was not followed by some token separator. -
Uses of AttributeSource in org.apache.lucene.search.suggest.document
Subclasses of AttributeSource in org.apache.lucene.search.suggest.document Modifier and Type Class Description classCompletionTokenStreamAConcatenateGraphFilterbut we can set the payload and provide access to config options.private static classContextSuggestField.PrefixTokenFilterTheContextSuggestField.PrefixTokenFilterwraps aTokenStreamand adds a set prefixes ahead. -
Uses of AttributeSource in org.apache.lucene.search.uhighlight
Subclasses of AttributeSource in org.apache.lucene.search.uhighlight Modifier and Type Class Description private static classAnalysisOffsetStrategy.MultiValueTokenStreamWraps anAnalyzerand string text that represents multiple values delimited by a specified character. -
Uses of AttributeSource in org.apache.lucene.util
Methods in org.apache.lucene.util that return AttributeSource Modifier and Type Method Description AttributeSourceAttributeSource. cloneAttributes()Performs a clone of allAttributeImplinstances returned in a newAttributeSourceinstance.Methods in org.apache.lucene.util with parameters of type AttributeSource Modifier and Type Method Description voidAttributeSource. copyTo(AttributeSource target)Copies the contents of thisAttributeSourceto the given targetAttributeSource.Constructors in org.apache.lucene.util with parameters of type AttributeSource Constructor Description AttributeSource(AttributeSource input)An AttributeSource that uses the same attributes as the supplied one. -
Uses of AttributeSource in org.apache.lucene.util.graph
Subclasses of AttributeSource in org.apache.lucene.util.graph Modifier and Type Class Description private classGraphTokenStreamFiniteStrings.FiniteStringsTokenStreamFields in org.apache.lucene.util.graph declared as AttributeSource Modifier and Type Field Description private AttributeSource[]GraphTokenStreamFiniteStrings. tokensMethods in org.apache.lucene.util.graph that return types with arguments of type AttributeSource Modifier and Type Method Description java.util.List<AttributeSource>GraphTokenStreamFiniteStrings. getTerms(int state)Returns the list of tokens that start at the provided state
-