Downloads

Doc Classifier - tool for automatic document classification

Inserted by:Doc. Ing. Pavel Král, Ph.D.
Date last modified:29.12.2013
Year of insertion2012
Size:10.5 MB
Number of downloads:5
Abbreviation:doc_classifier

Product description

Doc Classifier is a tool designed for automatic single or multi-label text document classification. Three classifiers: Naive Bayes (NB), Support Vectors Machine (SVM) and Maximum Entropy classifier are integrated. For feature selection, five methods are used: Document Frequency (DF), Information Gain (IG), Mutual Information (MI), Chi-square and GSS methods. The Doc Classifier tool was developed mainly for testing and evaluation of the document classification methods and for adjusting parameters influencing the accuracy of these methods.


Download

The use of this product is governed by the following license:GNU-GPL

GNU General Public License v.3



Product files

#TitleDescriptionSize
1.doc_classifier.tgz10768 kB