Information Retrieval System: a Domain Specific Parallel Crawler

Nidhi Tyagi

Kirjaudu sisään saadaksesi viestin kun tuotetta on jälleen saatavilla

Asiakkaamme sanovat:

Top-vurdering på Google Reviews, baseret på tusinder af anmeldelser.

14 päivän palautusoikeus

Huippuarvosana Trustpilotissa

Lisää iMusic-toivelistallesi

Eller

Information Retrieval System: a Domain Specific Parallel Crawler

The World Wide Web is an interlinked collection of billions of documents formatted using HTML. Due to the growing and dynamic nature of the web, it has become a challenge to traverse all URLs in the web documents and handle these URLs, so it has become imperative to parallelize a crawling process. The crawler process is further being parallelized in the form ecology of crawler workers that parallely download information from the web. This paper proposes a novel architecture of parallel crawler, which is based on domain specific crawling, makes crawling task more effective, scalable and load-sharing among the different crawlers which parallel download web pages related to different domains specific URLs.

Media	Kirjat Paperback Book (Kirja pehmeillä kansilla ja liimatulla selällä)
Julkaisupäivämäärä	keskiviikko 24. elokuuta 2011
ISBN13	9783639377798
Tuottaja	VDM Verlag Dr. Müller
Sivujen määrä	92
Mitta	150 × 6 × 226 mm · 145 g
Kieli	English

Katso kaikki joka sisältää Nidhi Tyagi ( Esim. Paperback Book )