|
Title:
|
A NOVEL SEMANTIC APPROACH TO DOCUMENT COLLECTIONS |
|
Author(s):
|
Andrea Addis , Manuela Angioni , Giuliano Armano , Roberto Demontis , Franco Tuveri , Eloisa Vargiu |
|
ISBN:
|
978-972-8924-60-7 |
|
Editors:
|
António Palma dos Reis |
|
Year:
|
2008 |
|
Edition:
|
Single |
|
Keywords:
|
Text Categorization, Document Collections, Intelligent Software Systems, Machine Learning |
|
Type:
|
Full Paper |
|
First Page:
|
53 |
|
Last Page:
|
60 |
|
Language:
|
English |
|
Cover:
|
|
|
Full Contents:
|
click to dowload
|
|
Paper Abstract:
|
Available document collections are more and more required for supervised text categorization tasks. They typically are
collections of documents classified by domain engineers. In this paper, we propose a semantic text categorization
approach able to automatically create document collections in which documents are classified according to WordNet
Domains taxonomy. Experiments have been performed by training a classifier with an automatic document collection and
comparing results with those obtained by training the same classifier on a hand-made document collection. Experimental
results point out that, on average, the performances of the automatic approach are quite similar to those obtained on a
document collection classified by domain engineers. |
|
|
|
|
|
|