Proceedings of the 2007 International Conference on Intelligent Systems and Knowledge Engineering (ISKE 2007) - datamining | Atlantis Press

Series:Advances in Intelligent Systems Research

Proceedings of the 2007 International Conference on Intelligent Systems and Knowledge Engineering (ISKE 2007)

Session: Data Mining

15 articles

Proceedings Article

Data Mining on Prices of Highway Project Main Material Based on WEB

Jie He, Yikai Chen, Wen Hang, Zhiguo Qi

The management of main material prices of provincial highway project quota has problems of lag and blindness. Framework of provincial highway project quota data MIS and main material price data warehouse were established based on WEB firstly. Then concrete processes of provincial highway project main...

Article details
Download article (PDF)

Proceedings Article

Optimization of Hidden Markov Model by a Genetic Algorithm for Web Information Extraction

Jiyi Xiao, Lamei Zou, Chuanqi Li

This paper demonstrates a new training method based on GA and Baum-Welch algorithms to obtain an HMM model with optimized number of states in the HMM models and its model parameters for web information extraction. This method is not only able to overcome the shortcomings of the slow convergence speed...

Article details
Download article (PDF)

Proceedings Article

ETL Process Design and Quality Control Research Based on Radar Spatial Data

Xuejun Chen

In order to study nowcasting operations using spatial data mining, a spatial data warehouse must be built via ETL process to store radar spatial data reflects strong convection weather. Also, the quality control of radar spatial data in the ETL process, which directed at weather station observed data...

Article details
Download article (PDF)

Proceedings Article

Mining Combined Detection of Tumor Marker Based on Cloud Model

Feng Guo, Shaozi Li

The cloud model is an effective tool in transforming between qualitative concepts and their quantitative expressions. In the field of tumor marker, detects combined markers can improve the performance of cancer detection. But the discovery of combined tumor markers depends on doctor’s experience, and...

Article details
Download article (PDF)

Proceedings Article

The bioinformatics analysis of hepatitis C virus E2 protein

Tailin Guo

By the methods of bioinformatics, the sequences of HCV E2 protein from all the genotypes are studied on the variation and the B-cell epitopes. We find that even in the HVR1 and HVR2 regions, there are also some conservative sites, such as T2, G6, G23, and Q26, all of which belong to polar R-base amino...

Article details
Download article (PDF)

Proceedings Article

Lag correlation analysis based on Boolean presentation over multiple data streams

Dejun Yue, Tiancheng Zhang, Ge Yu, Yu Gu

Correlation analysis is a basic problem in the field of data stream mining. Traditional method is not suitable for real time processing with huge amount of stream data. We propose a new method based on Boolean representation for lag correlation analysis among multiple data streams. The raw stream sequence...

Article details
Download article (PDF)

Proceedings Article

A Data Cleaning Method Based on Association Rules

Weijie Wei, Mingwei Zhang, Bin Zhang, Xiaochun Tang

The quality of the data affects the usability of the data mining’s results. Making a data preparation before the mining can improve the quality. If the data are collected from the multi-data source, data preparation becomes very difficult. In this paper, a data-cleaning method based on the association...

Article details
Download article (PDF)

Proceedings Article

Associative data mining for alarm groupings in chemical processes

Savo Kordic, Peng Lam, Jitian Xiao, Huaizhong Li

Complex industrial processes such as nuclear power plants, chemical plants and petroleum refineries are usually equipped with alarm systems capable of monitoring thousands of process variables and generating tens of thousands of alarms which are used as mechanisms for alerting operators to take actions...

Article details
Download article (PDF)

Proceedings Article

Improved Isomap Algorithm for Motion Analysis

Honggui Li, Xingguo Li

Euclidean distance, Hausdorff distance and SSP distance are discussed, and SSP distance is used for improved Isomap algorithm. Two methods are put forward for improving Isomap algorithm. One is aligning input data of original Isomap algorithm, the other is modifying Isomap algorithm itself. SSP distance...

Article details
Download article (PDF)

Proceedings Article

Knowledge Discovery of Interesting Classification Rules Based on Adaptive Genetic Algorithm

Yong Zhou

Data classification is a very important point in Data Mining,but the existing classification algorithms always only discover the classification rules with high accuracy,and the research about interesting classification rules is few. So this paper proposed an algorithm to find the interesting classification...

Article details
Download article (PDF)

Proceedings Article

EA DTW: Early Abandon to Accelerate Exactly Warping Matching of Time Series

Junkui Li, Yuanzhen Wang

Dynamic Time Warping(DTW) is one of the important distance measures in similarity search of time series, however, the exact calculation of DTW has become a bottleneck. We propose an approach, named Early Abandon DTW(EA_DTW) to accelerate the calculation of DTW. The method checks if value of cells in...

Article details
Download article (PDF)

Proceedings Article

Online Detect Polymorphic Exploit Based on Data Mining

Wei Wang, Huazhang Wang, Daisheng Luo, Yong Fang

In recent years, Internet worms increasingly threaten the Internet hosts and service and polymorphic worms can evade signature-based intrusion detection systems. We propose DMPolD (Data Ming Polymorphism Detection) to detect polymorphic exploit based on semantic signature and data-mining. We analyze...

Article details
Download article (PDF)

Proceedings Article

A Partition Rule for SAT Solvers: The Multiple Partition Rule (MPR)

Juan Segura-Salazar, Juan Frausto-Solís

We propose a new partition rule for DPLL-based SAT Solvers. Most of the complete SAT solver usually are based on Davis, Logemann and Loveland (DPLL) rules. One most DPLL rule actually used in the modern algorithms is the Classical Partition Rule (CPR), that divides the problem into sub-problems (resolvents)...

Article details
Download article (PDF)

Proceedings Article

An Approach to Classification Based on Fuzzy Association Rules

Guoqing Chen, Zuoliang Chen

Classification based on association rules is considered effective and advantageous in many cases. However, the "sharp boundary" problem in association rules mining with numerical data may lead to semantics retortion of discovered rules, which may further disturb the understandability, even the accuracy...

Article details
Download article (PDF)

Proceedings Article

Extracting Linguistic Rules from Database using Linguistic Aggregation Operator

Zheng Pei, Yingchao Shao

In real-world database, most attribute values of objects are numerical, numeral is too detail to obtaining good information or decision. Hence, linguistic rules of a set of data would be very desirable and human consistent. Based on a new aggregation operator for aggregating linguistic terms, extracting...

Article details
Download article (PDF)