|
||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
public interface FilteringAPI
Top-level API for filtering
Filtering is applied to the result of an extraction in order
to obtain a controled sub-set.
ExtractAPI
Method Summary | |
---|---|
java.lang.String |
apply(ExtractAPI exAPI,
int countingMinimum,
int countingMaximum,
java.util.ArrayList<FilteringPattern> patterns,
double share,
double proxi,
java.lang.String cannedText,
java.lang.String criterion,
java.util.Set<java.lang.String> alreadyCreated,
java.util.Set<java.lang.String> alreadyRejected,
java.util.Set<java.lang.String> thesaurus,
boolean pruneWhenSameWeakTypography,
java.util.Set<java.lang.String> exclusion)
Run the filtering after an extraction. |
java.util.TreeMap<java.lang.String,Candidate> |
getResult()
Returns the candidates, i.e. the result of filtering Each result is a pair composed of: - the term's lemmatised form, as a TreeMap key - the Candidate instance, as a TreeMap value |
void |
writeResult(java.lang.String fileName)
Write the result in an XML file |
Method Detail |
---|
java.lang.String apply(ExtractAPI exAPI, int countingMinimum, int countingMaximum, java.util.ArrayList<FilteringPattern> patterns, double share, double proxi, java.lang.String cannedText, java.lang.String criterion, java.util.Set<java.lang.String> alreadyCreated, java.util.Set<java.lang.String> alreadyRejected, java.util.Set<java.lang.String> thesaurus, boolean pruneWhenSameWeakTypography, java.util.Set<java.lang.String> exclusion)
exAPI
- the extraction result, see ExtractAPIcountingMinimum
- floor limit for counting criterioncountingMaximum
- upper limit for counting criterionpatterns
- list of patterns for structural criterionshare
- variable for distributional criterionproxi
- variable for distributional criterioncannedText
- character string for textual criterioncriterion
- criterion or criterion combination to be taken inalreadyCreated
- lemmatised forms to be ignored when candidatealreadyRejected
- lemmatised forms to be ignored when candidatethesaurus
- lemmatised forms to be ignored when candidatepruneWhenSameWeakTypography
- to factorize the terms that have the same weak typography lemmatised formexclusion
- to ignore a candidate when one of these strings is in the lemmatized form
java.util.TreeMap<java.lang.String,Candidate> getResult()
Each result is a pair composed of:
- the term's lemmatised form, as a TreeMap key
- the Candidate instance, as a TreeMap value
void writeResult(java.lang.String fileName)
|
||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |