Mostrar el registro sencillo del ítem

Data from "Heterogeneous tree structure classification to label Java programmers according to their expertise level"

dc.contributor.authorOrtín Soler, Francisco 
dc.contributor.authorRodríguez Prieto, Óscar 
dc.contributor.authorPascual, Nicolás
dc.contributor.authorGarcía Rodríguez, Miguel 
dc.date.accessioned2024-01-16T07:46:28Z
dc.date.available2024-01-16T07:46:28Z
dc.date.issued2019-06-13
dc.identifier.urihttps://hdl.handle.net/10651/70832
dc.descriptionData from the article "F. Ortin, O. Rodriguez-Prieto, N. Pascual, M. Garcia. Heterogeneous tree structure classification to label Java programmers according to their expertise level. Future Generation Computer Systems (105), pp. 380-394, 2020. https://doi.org/10.1016/j.future.2019.12.016"spa
dc.description.abstractOpen-source code repositories are a valuable asset to creating different kinds of tools and services, utilizing machine learning and probabilistic reasoning. Syntactic models process Abstract Syntax Trees (AST) of source code to build systems capable of predicting different software properties. The main difficulty of building such models comes from the heterogeneous and compound structures of ASTs, and that traditional machine learning algorithms require instances to be represented as n-dimensional vectors rather than trees. In this article, we propose a new approach to classify ASTs using traditional supervised-learning algorithms, where a feature learning process selects the most representative syntax patterns for the child subtrees of different syntax constructs. Those syntax patterns are used to enrich the context information of each AST, allowing the classification of compound heterogeneous tree structures. The proposed approach is applied to the problem of labeling the expertise level of Java programmers. The system is able to label expert and novice programs with an average accuracy of 99.6%. Moreover, other code fragments such as types, fields, methods, statements and expressions could also be classified, with average accuracies of 99.5%, 91.4%, 95.2%, 88.3% and 78.1%, respectively.spa
dc.description.sponsorshipThis work has been partially funded by the Spanish Department of Science, Innovation and Universities: project RTI2018-099235-B-I00. The authors have also received funds from the University of Oviedo through its support of official research groups (GR-2011-0040).spa
dc.language.isoengspa
dc.relation.isreferencedbyF. Ortin, O. Rodriguez-Prieto, N. Pascual, M. Garcia. Heterogeneous tree structure classification to label Java programmers according to their expertise level. Future Generation Computer Systems (105), pp. 380-394, 2020. https://doi.org/10.1016/j.future.2019.12.016spa
dc.rightsOpen Data Commons Attribution License (ODC-By)spa
dc.rights.urihttps://opendatacommons.org/licenses/by/
dc.subjectBig codespa
dc.subjectMachine learningspa
dc.subjectSyntax patternsspa
dc.subjectAbstract syntax treesspa
dc.subjectProgrammer expertisespa
dc.subjectDecision treesspa
dc.subjectBig dataspa
dc.titleData from "Heterogeneous tree structure classification to label Java programmers according to their expertise level"spa
dc.typedatasetspa
dc.identifier.doi10.17811/ruo_datasets.70832
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/RTI2018-099235-B-I00/ES/MODELADO DE USUARIO PARA PERSONALIZACION DE INTERFAZ GUIADO POR ANALISIS AUTOMATICO DE PATRONES DE COMPORTAMIENTO/ spa
dc.relation.projectIDinfo:eu-repo/grantAgreement/University of Oviedo/Plan Propio 2019 - Grants for the maintenance of research activities/GR-2011-0040/ES/Computational Reflection Research Group/spa
dc.rights.accessRightsopen accessspa
dc.relation.ispartofURIhttp://hdl.handle.net/10651/54618
dc.publication.year2019


Ficheros en el ítem

untranslated
untranslated

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem