A family of admissible heuristics for A* to perform inference in probabilistic classifier chains

Mena Waldo, Deiner; Montañés Roces, Elena; Quevedo Pérez, José Ramón; Coz Velasco, Juan José del

doi:10.1007/s10994-016-5593-5

Repositorio

Cómo publicar

Recursos

FAQs

Mostrar el registro sencillo del ítem

A family of admissible heuristics for A* to perform inference in probabilistic classifier chains

dc.contributor.author	Mena Waldo, Deiner
dc.contributor.author	Montañés Roces, Elena
dc.contributor.author	Quevedo Pérez, José Ramón
dc.contributor.author	Coz Velasco, Juan José del
dc.date.accessioned	2017-02-22T10:34:45Z
dc.date.available	2017-02-22T10:34:45Z
dc.date.issued	2017
dc.identifier.citation	Machine Learning, 106(1), p. 143-169 (2017); doi:10.1007/s10994-016-5593-5
dc.identifier.issn	0885-6125
dc.identifier.uri	http://hdl.handle.net/10651/40518
dc.description.abstract	Probabilistic classifier chains have recently gained interest in multi-label classification, due to their ability to optimally estimate the joint probability of a set of labels. The main hindrance is the excessive computational cost of performing inference in the prediction stage. This pitfall has opened the door to propose efficient inference alternatives that avoid exploring all the possible solutions. The ϵϵ -approximate algorithm, beam search and Monte Carlo sampling are appropriate techniques, but only ϵϵ -approximate algorithm with ϵ=0ϵ=0 theoretically guarantees reaching an optimal solution in terms of subset 0/1 loss. This paper offers another alternative based on heuristic search that keeps such optimality. It consists of applying the A* algorithm providing an admissible heuristic able to explore fewer nodes than the ϵϵ -approximate algorithm with ϵ=0ϵ=0 . A preliminary study has already coped with this goal, but at the expense of the high computational time of evaluating the heuristic and only for linear models. In this paper, we propose a family of heuristics defined by a parameter that controls the trade-off between the number of nodes explored and the cost of computing the heuristic. Besides, a certain value of the parameter provides a method that is also suitable for the non-linear case. The experiments reported over several benchmark datasets show that the number of nodes explored remains quite steady for different values of the parameter, although the time considerably increases for high values. Hence, low values of the parameter give heuristics that theoretically guarantee exploring fewer nodes than the ϵϵ -approximate algorithm with ϵ=0ϵ=0 and show competitive computational time. Finally, the results exhibit the good behavior of the A* algorithm using these heuristics in complex situations such as the presence of noise
dc.description.sponsorship	This research has been supported by the Spanish Ministerio de Economía y Competitividad (grants TIN2011-23558, TIN2015-65069)
dc.format.extent	p. 143-169
dc.language.iso	eng
dc.publisher	Springer
dc.relation.ispartof	Machine Learning, 106(1)
dc.rights	© 2017 Springer
dc.subject	Multi-label classification
dc.subject	Probabilistic classifier chains
dc.subject	Inference A*
dc.subject	Admissible heuristics
dc.title	A family of admissible heuristics for A* to perform inference in probabilistic classifier chains	eng
dc.type	journal article
dc.identifier.doi	10.1007/s10994-016-5593-5
dc.relation.projectID	MINECO/TIN2011-23558
dc.relation.projectID	MINECO/TIN2015-65069
dc.relation.publisherversion	http://dx.doi.org/10.1007/s10994-016-5593-5
dc.rights.accessRights	open access
dc.type.hasVersion	AM

Ficheros en el ítem

Nombre:: ipcc-versionGreenAccess.pdf
Tamaño:: 1.174Mb
Formato:: PDF
Descripción:: Postprint

Este ítem aparece en la(s) siguiente(s) colección(ones)

Artículos [37321]
Informática [869]
Investigaciones y Documentos OpenAIRE [8276]
Publicaciones resultado de proyectos financiados con fondos públicos

Mostrar el registro sencillo del ítem