Detecting similarity of R functions via a fusion of multiple heuristic methods
Maciej Bartoszuk, Marek Gagolewski
Available Online June 2015.
- 10.2991/ifsa-eusflat-15.2015.61How to use a DOI?
- R, plagiarism and code cloning detection, fuzzy proximity relations, aggregation, program dependence graph, t-norms.
In this paper we describe recent advances in our R code similarity detection algorithm. We propose a modification of the Program Dependence Graph (PDG) procedure used in the GPLAG system that better fits the nature of functional programming languages like R. The major strength of our approach lies in a proper aggregation of outputs of multiple plagiarism detection methods, as it is well known that no single technique gives perfect results. It turns out that the incorporation of the PDG algorithm significantly improves the recall ratio, i.e. it is better in indicating true positive cases of plagiarism or code cloning patterns. The implemented system is available as web application at http://SimilaR.Rexamine.com/.
- © 2015, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - Maciej Bartoszuk AU - Marek Gagolewski PY - 2015/06 DA - 2015/06 TI - Detecting similarity of R functions via a fusion of multiple heuristic methods BT - Proceedings of the 2015 Conference of the International Fuzzy Systems Association and the European Society for Fuzzy Logic and Technology PB - Atlantis Press SP - 419 EP - 426 SN - 1951-6851 UR - https://doi.org/10.2991/ifsa-eusflat-15.2015.61 DO - 10.2991/ifsa-eusflat-15.2015.61 ID - Bartoszuk2015/06 ER -