Fecha de publicación:
21/09/2016
Fuente: Wipo "BigData"
The invention relates to the field of natural language processing, in particular to an information extraction method based on a bi-directional recurrent neural network. The information extraction method applies the technology of the bi-directional recurrent neural network, the basic elements, which include characters, words, punctuations and the like, of a text are subjected to the automatic learning of characteristics, series modeling is carried out through the RNN (Recurrent Neural Network), and the defect that the characteristics need to be manually set in a traditional way is overcome. In addition, the bi-directional communication RNN is used to overcome the problem of information asymmetry in a prediction process of a unidirectional RNN, so that the classification judgment result of a natural language series to be identified depends on both preamble information and postamble information, and therefore, information extraction and judgment accuracy is higher. The method is especially suitable for entity name extraction in big data analysis, and has an important application value in the big data analysis.