Package org.apache.lucene.analysis.icu
Class ICUNormalizer2FilterFactory
- java.lang.Object
-
- org.apache.lucene.analysis.util.AbstractAnalysisFactory
-
- org.apache.lucene.analysis.util.TokenFilterFactory
-
- org.apache.lucene.analysis.icu.ICUNormalizer2FilterFactory
-
public class ICUNormalizer2FilterFactory extends TokenFilterFactory
Factory forICUNormalizer2FilterSupports the following attributes:
- name: A Unicode Normalization Form, one of 'nfc','nfkc', 'nfkc_cf'. Default is nfkc_cf.
- mode: Either 'compose' or 'decompose'. Default is compose. Use "decompose" with nfc or nfkc, to get nfd or nfkd, respectively.
- filter: A
UnicodeSetpattern. Codepoints outside the set are always left unchanged. Default is [] (the null set, no filtering).
- Since:
- 3.1.0
- See Also:
ICUNormalizer2Filter,Normalizer2,FilteredNormalizer2
-
-
Field Summary
Fields Modifier and Type Field Description static java.lang.StringNAMESPI nameprivate com.ibm.icu.text.Normalizer2normalizer-
Fields inherited from class org.apache.lucene.analysis.util.AbstractAnalysisFactory
LUCENE_MATCH_VERSION_PARAM, luceneMatchVersion
-
-
Constructor Summary
Constructors Constructor Description ICUNormalizer2FilterFactory(java.util.Map<java.lang.String,java.lang.String> args)Creates a new ICUNormalizer2FilterFactory
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description TokenStreamcreate(TokenStream input)Transform the specified input TokenStreamTokenStreamnormalize(TokenStream input)Normalize the specified input TokenStream While the default implementation returns input unchanged, filters that should be applied at normalization time can delegate tocreatemethod.-
Methods inherited from class org.apache.lucene.analysis.util.TokenFilterFactory
availableTokenFilters, findSPIName, forName, lookupClass, reloadTokenFilters
-
Methods inherited from class org.apache.lucene.analysis.util.AbstractAnalysisFactory
get, get, get, get, get, getBoolean, getChar, getClassArg, getFloat, getInt, getLines, getLuceneMatchVersion, getOriginalArgs, getPattern, getSet, getSnowballWordSet, getWordSet, isExplicitLuceneMatchVersion, require, require, require, requireBoolean, requireChar, requireFloat, requireInt, setExplicitLuceneMatchVersion, splitAt, splitFileNames
-
-
-
-
Field Detail
-
NAME
public static final java.lang.String NAME
SPI name- See Also:
- Constant Field Values
-
normalizer
private final com.ibm.icu.text.Normalizer2 normalizer
-
-
Method Detail
-
create
public TokenStream create(TokenStream input)
Description copied from class:TokenFilterFactoryTransform the specified input TokenStream- Specified by:
createin classTokenFilterFactory
-
normalize
public TokenStream normalize(TokenStream input)
Description copied from class:TokenFilterFactoryNormalize the specified input TokenStream While the default implementation returns input unchanged, filters that should be applied at normalization time can delegate tocreatemethod.- Overrides:
normalizein classTokenFilterFactory
-
-