Temporal Data Management and Incremental Data Recomputation with Wide-column Stores and MapReduce

  • In recent years, ”Big Data” has become an important topic in academia and industry. To handle the challenges and problems caused by Big Data, new types of data storage systems called ”NoSQL stores” (means ”Not-only- SQL”) have emerged. ”Wide-column stores” are one kind of NoSQL stores. Compared to relational database systems, wide-column stores introduce a new data model, new IRUD (Insert, Retrieve, Update and Delete) semantics with support for schema-flexibility, single-row transactions and data expiration constraints. Moreover, each column stores multiple data versions with associated time- stamps. Well-known examples are Google’s ”Big-table” and its open sourced counterpart ”HBase”. Recently, such systems are increasingly used in business intelligence and data warehouse environments to provide decision support, controlling and revision capabilities. Besides managing the current values, data warehouses also require management and processing of historical, time-related data. Data warehouses frequently employ techniques for processing changes in various data sources and incrementally applying such changes to the warehouse to keep it up-to- date. Although both incremental data warehousing maintenance and temporal data management have been the subject of intensive research in the relational database and finally commercial database products have picked up the ability for temporal data processing and management, such capabilities have not been explored systematically for today’s wide-column stores. This thesis helps to address the shortcomings mentioned above. It care- fully analyzes the properties of wide-column stores and the applicability of mechanisms for temporal data management and incremental data ware- house maintenance known from relational databases, extends well-known approaches and develops new capabilities for providing equivalent support in wide-column stores.
Metadaten
Verfasser*innenangaben:Yong Hu
URN:urn:nbn:de:hbz:386-kluedo-49654
Betreuer*in:Stefan Deßloch, Norbert Ritter
Dokumentart:Dissertation
Sprache der Veröffentlichung:Englisch
Datum der Veröffentlichung (online):26.10.2017
Jahr der Erstveröffentlichung:2017
Veröffentlichende Institution:Technische Universität Kaiserslautern
Titel verleihende Institution:Technische Universität Kaiserslautern
Datum der Annahme der Abschlussarbeit:31.07.2017
Datum der Publikation (Server):26.10.2017
GND-Schlagwort:Wide-column stores; Temporal data processing; MapReduce; Incremental recomputation
Seitenzahl:IV, 190
Fachbereiche / Organisatorische Einheiten:Kaiserslautern - Fachbereich Informatik
CCS-Klassifikation (Informatik):E. Data
H. Information Systems
J. Computer Applications
DDC-Sachgruppen:0 Allgemeines, Informatik, Informationswissenschaft / 004 Informatik
Lizenz (Deutsch):Creative Commons 4.0 - Namensnennung, nicht kommerziell, keine Bearbeitung (CC BY-NC-ND 4.0)