dc.contributor.author | Rodríguez-Mazahua, Nidia | |
dc.contributor.author | Rodríguez-Mazahua, Lisbeth | |
dc.contributor.author | López Chau, Asdrúbal | |
dc.contributor.author | Alor-Hernández, Giner | |
dc.date.accessioned | 2022-06-29T03:11:20Z | |
dc.date.available | 2022-06-29T03:11:20Z | |
dc.date.issued | 2020-12-07 | |
dc.identifier.issn | 2389-8186 | |
dc.identifier.uri | http://repositorios.orizaba.tecnm.mx:8080/xmlui/handle/123456789/588 | |
dc.description | One of the main problems faced by Data Warehouse designers is fragmentation. Several studies have proposed data mining-based horizontal fragmentation methods. However, not exists a horizontal fragmentation technique that uses a decision tree. This
paper presents the analysis of different decision tree algorithms to select the best one to implement the fragmentation method. Such analysis was performed under version 3.9.4 of Weka, considering four evaluation metrics (Precision, ROC Area, Recall and F-measure) for different selected data sets using the Star Schema Benchmark. The results showed that the two best algorithms were J48 and Random Forest in most cases; nevertheless, J48 was selected because it is more efficient in building the model. | es |
dc.description.abstract | One of the main problems faced by Data Warehouse designers is fragmentation. Several studies have proposed data mining-based horizontal fragmentation methods. However, not exists a horizontal fragmentation technique that uses a decision tree. This
paper presents the analysis of different decision tree algorithms to select the best one to implement the fragmentation method. Such analysis was performed under version 3.9.4 of Weka, considering four evaluation metrics (Precision, ROC Area, Recall and F-measure) for different selected data sets using the Star Schema Benchmark. The results showed that the two best algorithms were J48 and Random Forest in most cases; nevertheless, J48 was selected because it is more efficient in building the model. | es |
dc.description.sponsorship | Fondo Sectorial de Investigación para la Educación (SEP-CONACYT),
Tecnológico Nacional de México (TecNM) | es |
dc.language.iso | en_US | es |
dc.publisher | CEIPA | es |
dc.relation.ispartofseries | Revista Perspectiva Empresarial; | |
dc.subject | Data analysis | es |
dc.subject | Computer systems | es |
dc.subject | Databases | es |
dc.subject | Artificial Intelligence | es |
dc.subject | Decision making | es |
dc.title | Comparative Analysis of Decision Tree Algorithms for Data Warehouse Fragmentation | es |
dc.type | Article | es |