Analysis of Simple Data Imputation in Disease Dataset

Fetty Tri Anggraeny; Intan Yuniar Purbasari; M. Syahrul Munir; Faisal Muttaqin; Eka Prakarsa Mandyarta; Fawwaz Ali Akbar

doi:10.2991/icst-18.2018.98

<Previous Article In Volume

Next Article In Volume>

Analysis of Simple Data Imputation in Disease Dataset

Authors

Fetty Tri Anggraeny, Intan Yuniar Purbasari, M. Syahrul Munir, Faisal Muttaqin, Eka Prakarsa Mandyarta, Fawwaz Ali Akbar

Corresponding Author

Fetty Tri Anggraeny

Available Online December 2018.

DOI: 10.2991/icst-18.2018.98 How to use a DOI?
Keywords: analysis; simple Imputation; disease dataset; fuzzy c-means
Abstract: In the statistical data collection it is very possible that there are variables that do not respond or in other words empty, called missing value, that can cause problems in data analysis. In this research we will analyze some simple imputation technique to solve the missing value problem, are zero imputation, mean imputation median imputation, and random imputation. This study used a Pima Indians, hepatitis and breast cancer Wisconsin dataset from UCI Machine Learning. We also compare with incomplete data removal technique. The application of various simple imputations in the disease dataset can increase the accuracy value when compared to deficient data deletion techniques. And the zero imputation technique shows the best performance compared to other imputation techniques and deficient data removal techniques.
Copyright: © 2018, the Authors. Published by Atlantis Press.
Open Access: This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

<Previous Article In Volume

Next Article In Volume>

Volume Title: Proceedings of the International Conference on Science and Technology (ICST 2018)
Series: Atlantis Highlights in Engineering
Publication Date: December 2018
ISBN: 978-94-6252-650-1
ISSN: 2589-4943
DOI: 10.2991/icst-18.2018.98 How to use a DOI?
Open Access: This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

ris enw bib

TY  - CONF
AU  - Fetty Tri Anggraeny
AU  - Intan Yuniar Purbasari
AU  - M. Syahrul Munir
AU  - Faisal Muttaqin
AU  - Eka Prakarsa Mandyarta
AU  - Fawwaz Ali Akbar
PY  - 2018/12
DA  - 2018/12
TI  - Analysis of Simple Data Imputation in Disease Dataset
BT  - Proceedings of the International Conference on Science and Technology (ICST 2018)
PB  - Atlantis Press
SP  - 471
EP  - 475
SN  - 2589-4943
UR  - https://doi.org/10.2991/icst-18.2018.98
DO  - 10.2991/icst-18.2018.98
ID  - Anggraeny2018/12
ER  -

download .riscopy to clipboard