Generating a recommendation associated with an extraction rule for big-data analysis

Fecha de publicación: 21/01/2021
Fuente: Wipo "BigData"
Disclosed is a system for generating a recommendation associated with an extraction rule for big-data analysis. The system may receive a set of data blocks and an extraction rule. The extraction rule comprises an identifier and a first range associated with the identifier in the set of data blocks. The system identifies a set of words associated with the identifier in one or more data blocks from the set of data blocks based on a provenance knowledge. The provenance knowledge is indicative of a relationship between the word and the identifier. The system identifies a set of locations associated with the set of words in the one or more data blocks. The system generates a second range associated with the identifier based on the set of locations. The system recommends the second range for modifying the extraction rule for big-data analysis based on an accuracy change.