dc.contributor.author | Morán Barbón, Jesús | |
dc.contributor.author | Bertolino, Antonia | |
dc.contributor.author | Riva Álvarez, Claudio A. de la | |
dc.contributor.author | Tuya González, Pablo Javier | |
dc.date.accessioned | 2024-04-25T07:30:59Z | |
dc.date.available | 2024-04-25T07:30:59Z | |
dc.date.issued | 2024 | |
dc.identifier.citation | IEEE Transactions on Software Engineering, 50(4), p. 956-978 (2024); doi:10.1109/TSE.2024.3369766 | |
dc.identifier.issn | 0098-5589 | |
dc.identifier.uri | https://hdl.handle.net/10651/72406 | |
dc.description.abstract | Among the current technologies to analyse large data, the MapReduce processing model stands out in Big Data. MapReduce is implemented in frameworks such as Hadoop, Spark or Flink that are able to manage the program executions according to the resources available at runtime. The developer should design the program in order to support all possible non-deterministic executions. However, the program may fail due to a design fault. Debugging these kinds of faults is difficult because the data are executed non-deterministically in parallel and the fault is not caused directly by the code, but by its design. This paper presents a framework called MRDebug which includes two debugging techniques focused on the MapReduce design faults. A spectrum-based fault localization technique locates the root cause of these faults analysing several executions of the test case, and a Delta Debugging technique isolates the data relevant to trigger the failure. An empirical evaluation with 13 programs shows that MRDebug is effective in debugging the faults, especially when the localization is done with the reduced data. In summary, MRDebug automatically provides valuable information to understand MapReduce design faults as it helps locate their root cause and obtains a minimal data that triggers the failure. | spa |
dc.description.sponsorship | This work was supported in part by the project
PID2019-105455GB-C32 under Grant MCIN/AEI/10.13039/501100011033
(Spain), in part by the project PID2022-137646OB-C32 under Grant MCIN/
AEI/10.13039/501100011033/FEDER, UE, and in part by the project
RDS_2022-2024_2.1_Progetto_CYBER under Grant MASE/PTR_22_24_INT_
2_1 (Italy). | spa |
dc.format.extent | p. 956-978 | spa |
dc.language.iso | eng | spa |
dc.publisher | IEEE | spa |
dc.relation.ispartof | IEEE Transactions on Software Engineering, 50 (4) | spa |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 Internacional | * |
dc.rights | © 2024 The Authors | |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | Debugging aids | spa |
dc.subject | Testing and debugging | spa |
dc.title | Automatic Debugging of Design Faults in MapReduce Applications | spa |
dc.type | journal article | spa |
dc.identifier.doi | 10.1109/TSE.2024.3369766 | |
dc.relation.projectID | PID2019-105455GB-C3 | spa |
dc.relation.projectID | MCIN/AEI/10.13039/501100011033 | spa |
dc.relation.projectID | PID2022-137646OB-C32 | spa |
dc.relation.projectID | MCIN/ AEI/10.13039/501100011033/FEDER | spa |
dc.relation.projectID | RDS_2022-2024_2.1_Progetto_CYBER | spa |
dc.relation.projectID | MASE/PTR_22_24_INT_ | spa |
dc.relation.publisherversion | http://dx.doi.org/10.1109/TSE.2024.3369766 | spa |
dc.rights.accessRights | open access | spa |
dc.relation.ispartofURI | https://hdl.handle.net/10651/72731 | |
dc.type.hasVersion | VoR | spa |