文章摘要
Ouyang Xiaoye(欧阳小叶)* **,Chen Shudong* **,Wang Rong***.[J].高技术通讯(英文),2021,27(4):373~380
Positive unlabeled named entity recognition with multi-granularity linguistic information
  
DOI:10.3772/j.issn.1006-6748.2021.04.005
中文关键词: 
英文关键词: named entity recognition (NER), deep learning, neural network, positive-unlabeled learning, label-few domain, multi-granularity (PU)
基金项目:
Author NameAffiliation
Ouyang Xiaoye(欧阳小叶)* ** (*Institute of Microelectronics, Chinese Academy of Sciences, Beijing 100029, P.R.China) (**University of Chinese Academy of Sciences, Beijing 100049, P.R.China) (***Key Laboratory of Space Object Measurement Department, Beijing Institute of Tracking and Telecommunications Technology, Beijing 100094, P.R.China) 
Chen Shudong* ** (*Institute of Microelectronics, Chinese Academy of Sciences, Beijing 100029, P.R.China) (**University of Chinese Academy of Sciences, Beijing 100049, P.R.China) (***Key Laboratory of Space Object Measurement Department, Beijing Institute of Tracking and Telecommunications Technology, Beijing 100094, P.R.China) 
Wang Rong*** (*Institute of Microelectronics, Chinese Academy of Sciences, Beijing 100029, P.R.China) (**University of Chinese Academy of Sciences, Beijing 100049, P.R.China) (***Key Laboratory of Space Object Measurement Department, Beijing Institute of Tracking and Telecommunications Technology, Beijing 100094, P.R.China) 
Hits: 986
Download times: 883
中文摘要:
      
英文摘要:
      The research on named entity recognition for label-few domain is becoming increasingly important. In this paper, a novel algorithm, positive unlabeled named entity recognition (PUNER) with multi-granularity language information, is proposed, which combines positive unlabeled (PU) learning and deep learning to obtain the multi-granularity language information from a few labeled instances and many unlabeled instances to recognize named entities.First, PUNER selects reliable negative instances from unlabeled datasets, uses positive instances and a corresponding number of negative instances to train the PU learning classifier, and iterates continuously to label all unlabeled instances. Second, a neural network-based architecture to implement the PU learning classifier is used, and comprehensive text semantics through multi-granular language information are obtained, which helps the classifier correctly recognize named entities. Performance tests of the PUNER are carried out on three multilingual NER datasets, which are CoNLL2003, CoNLL 2002 and SIGHAN Bakeoff 2006. Experimental results demonstrate the effectiveness of the proposed PUNER.
View Full Text   View/Add Comment  Download reader
Close

分享按钮