back to author index
   
title:
 
Protein-Protein Interaction Document Mining
publication:
 
JCIS-2006 Proceedings
part of series:
  Advances in Intelligent Systems Research
ISBN:
  978-90-78677-01-7
ISSN:
  1951-6851
DOI:
  doi:10.2991/jcis.2006.250 (how to use a DOI)
author(s):
 
Shing Doong, Shu-Fen Hong
corresponding author:
 
Shing Doong
publication date:
 
October 2006
keywords:
 
Latent semantic index, document mining, support vector machine, protein-protein interaction
abstract:
 
Protein-protein interactions (PPI) are very important to the understanding of metabolic pathway. Many digital publications are available today; some of them discuss PPI and some of them do not. If machine learning techniques can be used to detect those PPI documents automatically, it would save researchers tremendous amount of time to construct a biological pathway. In this study, we analyze this document mining problem by using different kinds of feature representations and classification algorithms. Latent semantic indexing (LSI) and information gain (IG) were used to extract features from a document for classification, while support vector machine (SVM) and Naïve Bayesian (NB) were the selected algorithms. It is found that the combination of LSI and SVM provided the best solution.
copyright:
 
© Atlantis Press. This article is distributed under the terms of the Creative Commons Attribution License, which permits non-commercial use, distribution and reproduction in any medium, provided the original work is properly cited.
full text: