Logo Goletty

Using Inverted Files to Compress Text
Journal Title CIT. Journal of Computing and Information Technology
Journal Abbreviation CIT
Publisher Group University of Zagreb
Website http://cit.srce.unizg.hr/index.php/CIT
PDF (117 kb)
   
Title Using Inverted Files to Compress Text
Authors Ristov, Strahil
Abstract This is the first report on a new approach to text compression. It consists of representing the text file with compressed inverted file index in conjunction with very compact lexicon, where lexicon includes every word in the text. The index is compressed using standard index compression techniques, and lexicon is compressed by original dictionary compression method that gives better compression results than existing procedures. Compression procedure is complex, but decompression time is linear with the file size, although it requires two passes and hence can not be performed online. First experiments show that this method, when refined, can be competitive for larger texts that only need to be decompressed in the real time.
Publisher University of Zagreb, University Computing Centre - SRCE
Date 1970-01-01
Source Journal of Computing and Information Technology Vol 10, No 3 (2002): Special Issue on ITI 2002 - Information Technology Interfaces
Rights CIT. Journal of Computing and Information Technology is an open access journal.Authors who publish with this journal agree to the following terms:Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work´s authorship and initial publication in this journal.Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal´s published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).

 

See other article in the same Issue


Goletty © 2024