-
Notifications
You must be signed in to change notification settings - Fork 21
/
Copy pathCHANGES.txt
27 lines (27 loc) · 1.63 KB
/
CHANGES.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
1.6
- upgraded liblinear 1.8
- uses Maven to build and manage the dependencies
1.5
- added parameter to ModelUtils to show only the top N most important attributes per category
- upgraded lisvm 3.0 + rely on external lib instead of modified code
- upgraded liblinear 1.7
- can specify weighting schemes per field
- added TestWeightingSchemes
- can remap the attribute numbers before generating the vector file
- bugfix : could not use SimpleDocuments with a file backed corpus (jiuren)
- bugfix : loading new Learner from existing dataset generates a new Lexicon (jiuren)
1.4
- added XMLCorpusReader and XMLCorpusClassifier which generate a raw file from a XML corpus or classify a XML corpus using an existing model
- added java implementation of liblinear
- added toString() method on Fields
- bugfix : tokenforms excessively normalised in class Lexicon - was losing non european characters
- TextClassifier can take unzip working directory to /tmp before using it for classifying
1.3
- added CorpusUtils class for generating vector file from raw file, generate subset of raw file, filter fields from raw file, display best attributes
- Bug fix : Lexicon differenciates between attribute number and highest attribute ID
- added object attributescorer and refactored logLikelihoodAttributeFilter to return a Scorer object. This Scorer is to filter a lexicon
- normalise attribute forms to prevent \n or \t in lexicon file
1.2
- made setlogLikelihoodratio() and keepTopNAttributesLLR() public
- uses Enums for Parameters.WeightingMethods instead of list of static ints
- Parameters.WeightingMethods toString / methodFromString throw RunTimeException if value is not known