Temporal Data Management and Incremental Data Recomputation with Wide-column Stores and MapReduce
- In recent years, ”Big Data” has become an important topic in academia and industry. To handle the challenges and problems caused by Big Data, new types of data storage systems called ”NoSQL stores” (means ”Not-only- SQL”) have emerged. ”Wide-column stores” are one kind of NoSQL stores. Compared to relational database systems, wide-column stores introduce a new data model, new IRUD (Insert, Retrieve, Update and Delete) semantics with support for schema-flexibility, single-row transactions and data expiration constraints. Moreover, each column stores multiple data versions with associated time- stamps. Well-known examples are Google’s ”Big-table” and its open sourced counterpart ”HBase”. Recently, such systems are increasingly used in business intelligence and data warehouse environments to provide decision support, controlling and revision capabilities. Besides managing the current values, data warehouses also require management and processing of historical, time-related data. Data warehouses frequently employ techniques for processing changes in various data sources and incrementally applying such changes to the warehouse to keep it up-to- date. Although both incremental data warehousing maintenance and temporal data management have been the subject of intensive research in the relational database and finally commercial database products have picked up the ability for temporal data processing and management, such capabilities have not been explored systematically for today’s wide-column stores. This thesis helps to address the shortcomings mentioned above. It care- fully analyzes the properties of wide-column stores and the applicability of mechanisms for temporal data management and incremental data ware- house maintenance known from relational databases, extends well-known approaches and develops new capabilities for providing equivalent support in wide-column stores.
Author: | Yong Hu |
---|---|
URN: | urn:nbn:de:hbz:386-kluedo-49654 |
Advisor: | Stefan Deßloch, Norbert Ritter |
Document Type: | Doctoral Thesis |
Language of publication: | English |
Date of Publication (online): | 2017/10/26 |
Year of first Publication: | 2017 |
Publishing Institution: | Technische Universität Kaiserslautern |
Granting Institution: | Technische Universität Kaiserslautern |
Acceptance Date of the Thesis: | 2017/07/31 |
Date of the Publication (Server): | 2017/10/26 |
GND Keyword: | Wide-column stores; Temporal data processing; MapReduce; Incremental recomputation |
Page Number: | IV, 190 |
Faculties / Organisational entities: | Kaiserslautern - Fachbereich Informatik |
CCS-Classification (computer science): | E. Data |
H. Information Systems | |
J. Computer Applications | |
DDC-Cassification: | 0 Allgemeines, Informatik, Informationswissenschaft / 004 Informatik |
Licence (German): |