dc.contributor.author | Rodríguez-Mazahua, Nidia | |
dc.contributor.author | Rodríguez-Mazahua, Lisbeth | |
dc.contributor.author | López-Chau, Asdrúbal | |
dc.contributor.author | Alor-Hernández, Giner | |
dc.date.accessioned | 2022-06-29T03:32:44Z | |
dc.date.available | 2022-06-29T03:32:44Z | |
dc.date.issued | 2020-08-01 | |
dc.identifier.isbn | 978-607-506-395-9 | |
dc.identifier.uri | http://repositorios.orizaba.tecnm.mx:8080/xmlui/handle/123456789/590 | |
dc.description | One of the main problems faced by Data Warehouse (DW) designers is fragmentation. Several studies have proposed data mining-based horizontal fragmentation methods, which focus on optimizing the query response time and execution cost to make the DW more efficient. However, to the best of our knowledge there not exist a horizontal fragmentation technique that uses a decision tree to carry out fragmentation. Given the importance of decision trees in classification, since they allow obtaining pure partitions (subsets of tuples) in a data set using measures such as Information Gain, Gain Ratio and the Gini Index, the aim of this work is to use decision trees in the DW fragmentation. For this, the requirements necessary to carry out horizontal fragmentation using decision trees will be determined, and the fragmentation method will be designed, which will consist of determining the most frequent OLAP (On-line Analytical Processing) queries, analyzing the predicates used by the queries, and based on this build the decision tree, from which the horizontal fragments will be generated. The method will be implemented and validated using a case study in tourism. | es |
dc.description.abstract | One of the main problems faced by Data Warehouse (DW) designers is fragmentation. Several studies have proposed data mining-based horizontal fragmentation methods, which focus on optimizing the query response time and execution cost to make the DW more efficient. However, to the best of our knowledge there not exist a horizontal fragmentation technique that uses a decision tree to carry out fragmentation. Given the importance of decision trees in classification, since they allow obtaining pure partitions (subsets of tuples) in a data set using measures such as Information Gain, Gain Ratio and the Gini Index, the aim of this work is to use decision trees in the DW fragmentation. For this, the requirements necessary to carry out horizontal fragmentation using decision trees will be determined, and the fragmentation method will be designed, which will consist of determining the most frequent OLAP (On-line Analytical Processing) queries, analyzing the predicates used by the queries, and based on this build the decision tree, from which the horizontal fragments will be generated. The method will be implemented and validated using a case study in tourism. | es |
dc.description.sponsorship | Fondo Sectorial de Investigación para la Educación (SEP-CONACYT),
Tecnológico Nacional de México (TecNM) | es |
dc.language.iso | es | es |
dc.publisher | Universidad Autónoma de Coahuila, Centro de Investigación en Matemáticas Aplicadas | es |
dc.relation.ispartofseries | Aplicaciones de la Computación; | |
dc.subject | Fragmentation | es |
dc.subject | Data Warehouse | es |
dc.subject | Decision trees | es |
dc.title | Horizontal Fragmentation of Data Warehouses Using Decision Trees | es |
dc.type | Book chapter | es |