Logo Goletty

A Web Crawler System Design Based on Distributed Technology
Journal Title Journal of Networks
Journal Abbreviation jnw
Publisher Group Academy Publisher
Website http://ojs.academypublisher.com
PDF (419 kb)
   
Title A Web Crawler System Design Based on Distributed Technology
Authors Deng, Zhijuan; Zhong, Shaojun
Abstract A practical distributed web crawler architecture is designed. The distributed cooperative grasping algorithm is put forward to solve the problem of distributed Web Crawler grasping. Log structure and Hash structure are combined and a large-scale web store structure is devised, which can meet not only the need of a large amount of random accesses, but also the need of newly added pages. Experiment results have shown that the distributed Web Crawlers performance, scalability, and load balance are better.
Publisher ACADEMY PUBLISHER
Date 2011-12-01
Source Journal of Networks Vol 6, No 12 (2011)
Rights Copyright © ACADEMY PUBLISHER - All Rights Reserved.To request permission, please check out URL: http://www.academypublisher.com/copyrightpermission.html. 

 

See other article in the same Issue


Goletty © 2024