|
Carrot2 Framework
API Specification |
||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectorg.carrot2.filter.lingo.common.CarrotLibTokenizerPreprocessingStrategy
public final class CarrotLibTokenizerPreprocessingStrategy
A preprocessing strategy utilizing an internal tokenizer, languages map and stemmers from the new Carrot2 core.
| Field Summary | |
|---|---|
protected Map |
caseCheck
|
protected Map |
inflectedFreqSets
|
protected Map |
inflectedSets
|
protected Map |
languages
|
protected static org.apache.log4j.Logger |
logger
Logger |
protected Set |
lowCaseWords
|
protected Map |
nonStopWordSets
|
protected Set |
queryWords
|
protected Map |
stemSets
Linguistic information |
protected Map |
stopWordSets
|
protected Set |
strongWords
|
| Constructor Summary | |
|---|---|
CarrotLibTokenizerPreprocessingStrategy()
|
|
| Method Summary | |
|---|---|
Snippet[] |
preprocess(AbstractClusteringContext clusteringContext)
|
protected Snippet |
preprocess(Snippet snippet,
LanguageTokenizer tokenizer)
Method clean. |
protected Snippet |
stemming(Snippet snippet)
|
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
protected static final org.apache.log4j.Logger logger
protected Map stemSets
protected Map inflectedSets
protected Map stopWordSets
protected Map nonStopWordSets
protected Map languages
protected Set strongWords
protected Set queryWords
protected Set lowCaseWords
protected Map caseCheck
protected Map inflectedFreqSets
| Constructor Detail |
|---|
public CarrotLibTokenizerPreprocessingStrategy()
| Method Detail |
|---|
public Snippet[] preprocess(AbstractClusteringContext clusteringContext)
preprocess in interface PreprocessingStrategy
protected Snippet preprocess(Snippet snippet,
LanguageTokenizer tokenizer)
protected Snippet stemming(Snippet snippet)
|
Please refer to project documentation at
http://project.carrot2.org |
||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||