<sub id="jznnx"></sub>

<address id="jznnx"></address>

    <sub id="jznnx"></sub>

        <address id="jznnx"></address>

          ************

          淺論外文翻譯___基于網絡爬蟲的有效URL緩存

          導讀:外文翻譯---基于網絡爬蟲的有效URL緩存
          n 5, and our remendations for practical algorithms and
          data structures for URL caching are presented in Section 6. Section 7 contains our conclusions and directions for further research.
          2. CRAWLING
          Web crawlers are almost as old as the web itself, and numerous crawling systems have been described in the literature. In this section, we present a brief survey of these crawlers (in historical order) and then discuss why most of these crawlers could benefit from URL caching.
          The crawler used by the Inter Archive [10] employs multiple crawling processes, each of which performs an exhaustive crawl of 64 hosts at a time. The crawling processes save non-local URLs to disk; at the end of a crawl, a batch job adds these URLs to the per-host seed sets of the next crawl.
          The original Google crawler, described in [7], implements the different crawler ponents as different processes. A single URL server process maintains the set of URLs to download; crawling processes fetch pages; indexing processes extract words and links; and URL resolver processes convert relative into absolute URLs, which are then fed to the URL Server. The various processes municate via the file system. For the experiments
          上一篇論文:探討論文寫作 下一篇論文:淺議機電工程學院畢業設計規范(改)
          相關論文
          業務范圍
          免費本科范文
          免費碩士范文
          免費職稱范文
          論文****
          職稱論文****表
          五分pk10 www.58777n.com | 4025w.com | www.9680app.com | www.50024t.com | 0907hb.com | www.7036ee.com | www.097wy.com | 822063.com | www.hg7664.com | www.770816.com | 55797u.com | www.5966rrr.com | www.0586777.com | 7003k.com | www.4260022.com | www.591733.com | 44077u.com | www.15365z.com | hd2629.com | www.9068uu.com | www.1466b.com | cc63777.com | www.v1186.com | www.314611.com | www.8520z.com | www.4323c.com | 5802ll.com | www.hg2216.com | www.560103.com | 838388p.com | www.6880ww.com | yth003.net | www.2221188.com | www.369073.com | www.033033f.com | www.5446l.com | 1489n.com | www.603234.com | www.109307.com | www.86339s.com | yinhe100.cc | www.yh8396.com | hg15515.com | 17159455.com | www.flb677.com | 77339193.com | www.50999f.com | 30007a.com | www.22gg940.com | www.774808.com | 20188n.com | www.66332r.com | wns28b.com | www.8866kcd.com | 2019lcc | www.92518.com | www.271902.com | www.vns6266.com | www.h679.com | 3245m.com | www.904029.com | 67890nnn.com | www.828177.com | a81570.com | www.7893.ag | 9949.com | www.68003.com | www.021037.com | www.j223344.com |