Mostrar el registro sencillo del ítem

A family of admissible heuristics for A* to perform inference in probabilistic classifier chains

dc.contributor.authorMena Waldo, Deiner 
dc.contributor.authorMontañés Roces, Elena 
dc.contributor.authorQuevedo Pérez, José Ramón 
dc.contributor.authorCoz Velasco, Juan José del 
dc.date.accessioned2017-02-22T10:34:45Z
dc.date.available2017-02-22T10:34:45Z
dc.date.issued2017
dc.identifier.citationMachine Learning, 106(1), p. 143-169 (2017); doi:10.1007/s10994-016-5593-5
dc.identifier.issn0885-6125
dc.identifier.urihttp://hdl.handle.net/10651/40518
dc.description.abstractProbabilistic classifier chains have recently gained interest in multi-label classification, due to their ability to optimally estimate the joint probability of a set of labels. The main hindrance is the excessive computational cost of performing inference in the prediction stage. This pitfall has opened the door to propose efficient inference alternatives that avoid exploring all the possible solutions. The ϵϵ -approximate algorithm, beam search and Monte Carlo sampling are appropriate techniques, but only ϵϵ -approximate algorithm with ϵ=0ϵ=0 theoretically guarantees reaching an optimal solution in terms of subset 0/1 loss. This paper offers another alternative based on heuristic search that keeps such optimality. It consists of applying the A* algorithm providing an admissible heuristic able to explore fewer nodes than the ϵϵ -approximate algorithm with ϵ=0ϵ=0 . A preliminary study has already coped with this goal, but at the expense of the high computational time of evaluating the heuristic and only for linear models. In this paper, we propose a family of heuristics defined by a parameter that controls the trade-off between the number of nodes explored and the cost of computing the heuristic. Besides, a certain value of the parameter provides a method that is also suitable for the non-linear case. The experiments reported over several benchmark datasets show that the number of nodes explored remains quite steady for different values of the parameter, although the time considerably increases for high values. Hence, low values of the parameter give heuristics that theoretically guarantee exploring fewer nodes than the ϵϵ -approximate algorithm with ϵ=0ϵ=0 and show competitive computational time. Finally, the results exhibit the good behavior of the A* algorithm using these heuristics in complex situations such as the presence of noise
dc.description.sponsorshipThis research has been supported by the Spanish Ministerio de Economía y Competitividad (grants TIN2011-23558, TIN2015-65069)
dc.format.extentp. 143-169
dc.language.isoeng
dc.publisherSpringer
dc.relation.ispartofMachine Learning, 106(1)
dc.rights© 2017 Springer
dc.subjectMulti-label classification
dc.subjectProbabilistic classifier chains
dc.subjectInference A*
dc.subjectAdmissible heuristics
dc.titleA family of admissible heuristics for A* to perform inference in probabilistic classifier chainseng
dc.typejournal article
dc.identifier.doi10.1007/s10994-016-5593-5
dc.relation.projectIDMINECO/TIN2011-23558
dc.relation.projectIDMINECO/TIN2015-65069
dc.relation.publisherversionhttp://dx.doi.org/10.1007/s10994-016-5593-5
dc.rights.accessRightsopen access
dc.type.hasVersionAM


Ficheros en el ítem

untranslated

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem