title: |
Protein-Protein Interaction Document Mining |
|
publication: |
||
part of series: |
Advances in Intelligent Systems Research | |
ISBN: |
978-90-78677-01-7 | |
ISSN: |
1951-6851 | |
DOI: |
doi:10.2991/jcis.2006.250 (how to use a DOI) | |
author(s): |
Shing Doong, Shu-Fen Hong |
|
corresponding author: |
||
publication date: |
October 2006 |
|
keywords: |
Latent semantic index, document mining, support vector machine, protein-protein interaction |
|
abstract: |
Protein-protein interactions (PPI) are very important to the understanding of metabolic pathway. Many digital publications are available today; some of them discuss PPI and some of them do not. If machine learning techniques can be used to detect those PPI documents automatically, it would save researchers tremendous amount of time to construct a biological pathway. In this study, we analyze this document mining problem by using different kinds of feature representations and classification algorithms. Latent semantic indexing (LSI) and information gain (IG) were used to extract features from a document for classification, while support vector machine (SVM) and Naïve Bayesian (NB) were the selected algorithms. It is found that the combination of LSI and SVM provided the best solution. |
|
copyright: |
©
Atlantis Press. This article is distributed under the
terms of the Creative Commons Attribution License, which permits
non-commercial use, distribution and reproduction in any medium,
provided the original work is properly cited. |
|
full text: |