System for rapid interactive exploration of big data

Fecha de publicación: 15/07/2021
Fuente: Wipo "BigData"
An apparatus for estimating analytics and interactive exploration of big data, stored and/or streaming, using approximate query processing is presented. The apparatus comprises a model constructor and a sampler. The model constructor identifies important predictors variables in big data using feature selection, predictor variables, and outcome variables and partitions the important predictor variables into one or more stratifications based either the identified interactions or identified relationships. The sampler generates a subset of data by querying the big data using a query constructed based on at least one stratification. The subset of data can be fed into an analytics generator. The analytics generator generates analytics data for the outcome variables based on the subset of data and an analytics algorithm and a visualization, e.g. an interactive visualization, comprising the outcome variables, the important predictor variables, the stratification, the subset of data, and the analytics data.