Logo Goletty

Simple Classification into Large Topic Ontology of Web Documents
Journal Title CIT. Journal of Computing and Information Technology
Journal Abbreviation CIT
Publisher Group University of Zagreb
Website http://cit.srce.unizg.hr/index.php/CIT
PDF (2,651 kb)
   
Title Simple Classification into Large Topic Ontology of Web Documents
Authors Grobelnik, Marko; Mladenić, Dunja
Abstract The paper presents an approach to classifying Web documents into large topic ontology. The main emphasis is on having a simple approach appropriate for handling a large ontology and providing it with enriched data by including additional information on the Web page context obtained from the link structure of the Web. The context is generated from the in-coming and out-going links of the Web document we want to classify (the target document), meaning that for representing a document we use, not only text of the document itself, but also the text from the documents pointing to the target document, as well as the text from the documents the target document is pointing to. The idea is that providing enriched data is compensating for the simplicity of the approach while keeping it efficient and capable of handling large topic ontology.
Publisher University of Zagreb, University Computing Centre - SRCE
Date 1970-01-01
Source Journal of Computing and Information Technology Vol 13, No 4 (2005)
Rights CIT. Journal of Computing and Information Technology is an open access journal.Authors who publish with this journal agree to the following terms:Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work´s authorship and initial publication in this journal.Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal´s published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).

 

See other article in the same Issue


Goletty © 2024