Master of Science in Computer Science
First Committee Member
Second Committee Member
Third Committee Member
Fourth Committee Member
Fifth Committee Member
Number of Pages
Abstract In this thesis, we present a summary of our activities associated with the storage and query processing of Google 1T 5-gram data set. We rst give a brief introduction to some of the implementation techniques for the relational algebra followed by a Map Reduce implementation of the same operators. We then implement a database schema in Hive for the Google 1T 5-gram data set.
The thesis will further look into the query processing with Hive and Pig in the Hadoop setting.
More specially, we report statistics for our queries in this environment.
Bozorgi, Mandana, "Data Analysis With Map Reduce Programming Paradigm" (2015). UNLV Theses, Dissertations, Professional Papers, and Capstones. 2467.