Proceedings of the 2015 Conference of the International Fuzzy Systems Association and the European Society for Fuzzy Logic and Technology

Detecting similarity of R functions via a fusion of multiple heuristic methods

Authors
Maciej Bartoszuk, Marek Gagolewski
Corresponding Author
Maciej Bartoszuk
Available Online June 2015.
DOI
10.2991/ifsa-eusflat-15.2015.61How to use a DOI?
Keywords
R, plagiarism and code cloning detection, fuzzy proximity relations, aggregation, program dependence graph, t-norms.
Abstract

In this paper we describe recent advances in our R code similarity detection algorithm. We propose a modification of the Program Dependence Graph (PDG) procedure used in the GPLAG system that better fits the nature of functional programming languages like R. The major strength of our approach lies in a proper aggregation of outputs of multiple plagiarism detection methods, as it is well known that no single technique gives perfect results. It turns out that the incorporation of the PDG algorithm significantly improves the recall ratio, i.e. it is better in indicating true positive cases of plagiarism or code cloning patterns. The implemented system is available as web application at http://SimilaR.Rexamine.com/.

Copyright
© 2015, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

Volume Title
Proceedings of the 2015 Conference of the International Fuzzy Systems Association and the European Society for Fuzzy Logic and Technology
Series
Advances in Intelligent Systems Research
Publication Date
June 2015
ISBN
978-94-62520-77-6
ISSN
1951-6851
DOI
10.2991/ifsa-eusflat-15.2015.61How to use a DOI?
Copyright
© 2015, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - CONF
AU  - Maciej Bartoszuk
AU  - Marek Gagolewski
PY  - 2015/06
DA  - 2015/06
TI  - Detecting similarity of R functions via a fusion of multiple heuristic methods
BT  - Proceedings of the 2015 Conference of the International Fuzzy Systems Association and the European Society for Fuzzy Logic and Technology
PB  - Atlantis Press
SP  - 419
EP  - 426
SN  - 1951-6851
UR  - https://doi.org/10.2991/ifsa-eusflat-15.2015.61
DO  - 10.2991/ifsa-eusflat-15.2015.61
ID  - Bartoszuk2015/06
ER  -