Systems, apparatuses, methods, and computer readable media for processing and analyzing big data using columnar index data format

Fecha de publicación: 18/08/2016
Fuente: Wipo "BigData"
Provided are systems, apparatuses, methods and non-transitory computer readable media for efficiently processing and analyzing big data using a columnar index data format. A method of processing big data at a processing system configured as a computer may include generating a dictionary by sorting data based on a column unit of the big data; classifying the sorted data into one or more data blocks for each dictionary based on a data size; generating an index that includes first data values of the respective data blocks in order of the data blocks, for each dictionary; and generating a column ID for each column based on row order of the big data.