Case study in using linguistic phrases for text categorization on the WWW

Update Item Information
Publication Type Journal Article
School or College College of Engineering
Department Computing, School of
Creator Riloff, Ellen M.
Other Author Furnkranz, Johannes; Mitchell, Tom
Title Case study in using linguistic phrases for text categorization on the WWW
Date 1998
Description Most learning algorithms that arc applied to text categorization problems rely on a bag-of-words document representation, i.e., each word occurring in the document is considered as a separate feature. In this paper, we investigate the use of linguistic phrases as input features for text categorization problems. These features are based on information extraction patterns that are generated and used by the AUTOSLOG- TS system. We present experimental results on using such features as background knowledge for two machine learning algorithms on a classification task on the WWW. The results show that phrasal features can improve the precision of learned theories at the expense of coverage.
Type Text
Publisher Association for the Advancement of Artificial Intelligence (AAAI)
First Page 1
Last Page 8
Subject Learning algorithms; Text categorization; Linguistic phrases; Information extraction patterns; AutoSlog-TS
Subject LCSH Information retrieval
Language eng
Bibliographic Citation Furnkranz, J., Mitchell, T., & Riloff, E. M. (1998). Case study in using linguistic phrases for text categorization on the WWW. AAAI/ICML Workshop on Learning for Text Categorization, 1-8.
Rights Management (c)AAAI http://www.aaai.org/
Format Medium application/pdf
Format Extent 962,469 bytes
Identifier ir-main,12440
ARK ark:/87278/s6h13kb5
Setname ir_uspace
ID 704338
Reference URL https://collections.lib.utah.edu/ark:/87278/s6h13kb5