Ontology-Based Information Extraction of Crop Diseases on Chinese Web Pages
|
Title | Ontology-Based Information Extraction of Crop Diseases on Chinese Web Pages |
Authors | |
Abstract | This paper proposes a method for extracting information of crop diseases on Chinese web pages. First, we define some special labels of the DOM tree[1] to partition the web page into some content blocks. Then the noise content in the web pages is eliminated according to the location and the word number of a content block. We employ an ontology-based way to implement information extraction from the content blocks. A top-down method is adopted to construct the ontology of crop diseases. In the extraction process, the concepts, relations and instances of ontology is used to extract the entities. The event is extracted by an optimal classification of paragraph groups in a content block. Experiments demonstrate the performance of the proposed method is satisfactory. |
Publisher | ACADEMY PUBLISHER |
Date | 2013-01-01 |
Source | Journal of Computers Vol 8, No 1 (2013): Special Issue: Parallel Architecture, Algorithms and Programming |
Rights | Copyright © ACADEMY PUBLISHER - All Rights Reserved.To request permission, please check out URL: http://www.academypublisher.com/copyrightpermission.html. |