Investigation into text classification with kernel based schemes
Benveniste, Steven M.
Fargues, Monique P.
MetadataShow full item record
The development of the Internet has resulted in a rapid explosion of information available on the Web. In addition, the speed and anonymity of internet media "publishing" make this medium ideal for rapid dissemination of various contents. As a result, there is a strong need for automated text analysis and mining tools, which can identify the main topics of texts, chat room discussions, Web postings, etc. This thesis investigates whether the nonlinear kernel-based feature vector selection approach may be beneficial for categorizing unstructured text documents. Results using a nonlinear kernel-based classification are compared to results obtained using the Latent Semantic Analysis (LSA) Approach commonly used in text categorization applications. The nonlinear kernel-based scheme considered in this work applies the feature vector selection (FVS) approach followed by the Linear Discriminant Analysis (LDA) scheme. Titles, along with abstracts from IEEE journal articles published between 1990 and 1999 with specific key terms, were used to construct the data set for classification. Overall, taking into account both classification performance and timing issues, results showed the FVS-LDA with a polynomial kernel of degree 1, and an added constant of 1, to be the best classifier for the database considered.
Approved for public release; distribution is unlimited
Showing items related by title, author, creator and subject.
Domboulas, Dimitrios I. (Monterey, California. Naval Postgraduate School, 2004-12);In recent years there has been an increased interest in effective individual control and enhanced security measures, and face recognition schemes play an important role in this increasing market. In the past, most face ...
Fargues, Monique P. (Monterey, California. Naval Postgraduate School, 2001-06); NPS-EC-01-005Extracting relevant features that allow for class discrimination is the first critical step in classification applications. However, this step often leads to high-dimensional feature spaces, which requires large datasets ...
Alexandropoulos, Ioannis M. (Monterey, California. Naval Postgraduate School, 2006-09);A considerable amount of research has been recently conducted on face recognition tasks, due to increasing demands for security and authentication applications. Recent technological developments in uncooled IR imagery ...