Package org.apache.lucene.analysis.icu
Class ICUNormalizer2CharFilterFactory
- java.lang.Object
-
- org.apache.lucene.analysis.util.AbstractAnalysisFactory
-
- org.apache.lucene.analysis.util.CharFilterFactory
-
- org.apache.lucene.analysis.icu.ICUNormalizer2CharFilterFactory
-
public class ICUNormalizer2CharFilterFactory extends CharFilterFactory
Factory forICUNormalizer2CharFilter
Supports the following attributes:
- name: A Unicode Normalization Form, one of 'nfc','nfkc', 'nfkc_cf'. Default is nfkc_cf.
- mode: Either 'compose' or 'decompose'. Default is compose. Use "decompose" with nfc or nfkc, to get nfd or nfkd, respectively.
- filter: A
UnicodeSet
pattern. Codepoints outside the set are always left unchanged. Default is [] (the null set, no filtering).
- Since:
- 4.10.0
- See Also:
ICUNormalizer2CharFilter
,Normalizer2
,FilteredNormalizer2
-
-
Field Summary
Fields Modifier and Type Field Description static java.lang.String
NAME
SPI nameprivate com.ibm.icu.text.Normalizer2
normalizer
-
Fields inherited from class org.apache.lucene.analysis.util.AbstractAnalysisFactory
LUCENE_MATCH_VERSION_PARAM, luceneMatchVersion
-
-
Constructor Summary
Constructors Constructor Description ICUNormalizer2CharFilterFactory(java.util.Map<java.lang.String,java.lang.String> args)
Creates a new ICUNormalizer2CharFilterFactory
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description java.io.Reader
create(java.io.Reader input)
Wraps the given Reader with a CharFilter.java.io.Reader
normalize(java.io.Reader input)
Normalize the specified input Reader While the default implementation returns input unchanged, char filters that should be applied at normalization time can delegate tocreate
method.-
Methods inherited from class org.apache.lucene.analysis.util.CharFilterFactory
availableCharFilters, findSPIName, forName, lookupClass, reloadCharFilters
-
Methods inherited from class org.apache.lucene.analysis.util.AbstractAnalysisFactory
get, get, get, get, get, getBoolean, getChar, getClassArg, getFloat, getInt, getLines, getLuceneMatchVersion, getOriginalArgs, getPattern, getSet, getSnowballWordSet, getWordSet, isExplicitLuceneMatchVersion, require, require, require, requireBoolean, requireChar, requireFloat, requireInt, setExplicitLuceneMatchVersion, splitAt, splitFileNames
-
-
-
-
Field Detail
-
NAME
public static final java.lang.String NAME
SPI name- See Also:
- Constant Field Values
-
normalizer
private final com.ibm.icu.text.Normalizer2 normalizer
-
-
Method Detail
-
create
public java.io.Reader create(java.io.Reader input)
Description copied from class:CharFilterFactory
Wraps the given Reader with a CharFilter.- Specified by:
create
in classCharFilterFactory
-
normalize
public java.io.Reader normalize(java.io.Reader input)
Description copied from class:CharFilterFactory
Normalize the specified input Reader While the default implementation returns input unchanged, char filters that should be applied at normalization time can delegate tocreate
method.- Overrides:
normalize
in classCharFilterFactory
-
-