public class LanguageDetectorBuilder extends Object
LanguageDetector
.
This class does no internal synchronization.
Modifier and Type | Method and Description |
---|---|
LanguageDetectorBuilder |
affixFactor(double affixFactor)
Sets prefixFactor() and suffixFactor() both to the given value.
|
LanguageDetectorBuilder |
alpha(double alpha) |
LanguageDetector |
build() |
static LanguageDetectorBuilder |
create(@NotNull NgramExtractor ngramExtractor) |
LanguageDetectorBuilder |
languagePriorities(@Nullable Map<LdLocale,Double> langWeightingMap)
TODO document exactly.
|
LanguageDetectorBuilder |
minimalConfidence(double minimalConfidence)
LanguageDetector.detect(java.lang.CharSequence) returns a language if the best detected language has at least this probability. |
LanguageDetectorBuilder |
prefixFactor(double prefixFactor)
To weight n-grams that are on the left border of a word differently from n-grams
in the middle of words, assign a value here.
|
LanguageDetectorBuilder |
probabilityThreshold(double probabilityThreshold)
LanguageDetector.getProbabilities(java.lang.CharSequence) does not return languages with less probability than this. |
LanguageDetectorBuilder |
seed(long seed) |
LanguageDetectorBuilder |
seed(@NotNull com.google.common.base.Optional<Long> seed) |
LanguageDetectorBuilder |
shortTextAlgorithm(int shortTextAlgorithm)
Defaults to 0, which means don't use this feature.
|
LanguageDetectorBuilder |
suffixFactor(double suffixFactor)
Defaults to 1.0, which means don't use this feature.
|
LanguageDetectorBuilder |
withProfile(LanguageProfile languageProfile) |
LanguageDetectorBuilder |
withProfiles(Iterable<LanguageProfile> languageProfiles) |
public static LanguageDetectorBuilder create(@NotNull @NotNull NgramExtractor ngramExtractor)
public LanguageDetectorBuilder alpha(double alpha)
public LanguageDetectorBuilder seed(long seed)
public LanguageDetectorBuilder seed(@NotNull @NotNull com.google.common.base.Optional<Long> seed)
public LanguageDetectorBuilder shortTextAlgorithm(int shortTextAlgorithm)
public LanguageDetectorBuilder affixFactor(double affixFactor)
prefixFactor(double)
public LanguageDetectorBuilder prefixFactor(double prefixFactor)
prefixFactor
- 0.0 to 10.0, a suggested value is 1.5public LanguageDetectorBuilder suffixFactor(double suffixFactor)
suffixFactor
- 0.0 to 10.0, a suggested value is 2.0prefixFactor(double)
public LanguageDetectorBuilder probabilityThreshold(double probabilityThreshold)
LanguageDetector.getProbabilities(java.lang.CharSequence)
does not return languages with less probability than this.
The default currently is 0.1 (the old hardcoded value), but don't rely on it, if you need to be sure
then set one.public LanguageDetectorBuilder minimalConfidence(double minimalConfidence)
LanguageDetector.detect(java.lang.CharSequence)
returns a language if the best detected language has at least this probability.
The default currently is 0.9999d, but don't rely on it, if you need to be sure then set one.public LanguageDetectorBuilder languagePriorities(@Nullable @Nullable Map<LdLocale,Double> langWeightingMap)
public LanguageDetectorBuilder withProfile(LanguageProfile languageProfile) throws IllegalStateException
IllegalStateException
- if a profile for the same language was added already (must be a userland bug).public LanguageDetectorBuilder withProfiles(Iterable<LanguageProfile> languageProfiles) throws IllegalStateException
IllegalStateException
- if a profile for the same language was added already (must be a userland bug).public LanguageDetector build() throws IllegalStateException
IllegalStateException
- if no LanguageProfile was added
.Copyright © 2019. All rights reserved.