<sub id="jznnx"></sub>

<address id="jznnx"></address>

    <sub id="jznnx"></sub>

        <address id="jznnx"></address>

          ************

          淺論外文翻譯___基于網絡爬蟲的有效URL緩存

          導讀:外文翻譯---基于網絡爬蟲的有效URL緩存
          pages in many ways, among them direct URL submission, paid inclusion, and URL extraction from nonweb sources, but the bulk of the corpus is obtained by recursively exploring the web, a process known as crawling or SPIDERing. The basic algorithm is

          (a) Fetch a page (b) Parse it to extract all linked URLs (c) For all the URLs not seen before, repeat (a)–(c) Crawling typically starts from a set of seed URLs, made up of URLs obtained by other means as described above and/or made up of URLs collected during previous crawls. Sometimes crawls are started from a single well connected page, or a directory such as yahoo., but in this case a relatively large portion of the web (estimated at over 20%) is never reached. See [9] for a discussion of the graph structure of the web that leads to this phenomenon.
          If we view web pages as nodes in a graph, and hyperlinks as directed edges among these nodes, then crawling bees a process known in mathematical circles as graph traversal. Various strategies for graph traversal differ in their choice of which node among the nodes not yet explored to explore next. Two standard strategies for graph traversal are Depth First Search (DFS) and Breadth First
          上一篇論文:探討論文寫作 下一篇論文:淺議機電工程學院畢業設計規范(改)
          相關論文
          業務范圍
          免費本科范文
          免費碩士范文
          免費職稱范文
          論文****
          職稱論文****表
          五分pk10 67877x.com | www.975126.com | 29918a.com | 4541q.com | www.h7788y.com | www.0031331.com | www.775740.com | 3242g.com | www.8888888b.com | www.yh920955.com | 4052a.com | www.55yh765.com | www.820031.com | 2349001.com | 9737oo.me | www.0044xpj.net | www.882358.com | mi789.vip | www.365109b.com | www.yh7771.com | v3544.com | www.8494d.com | www.60886b.com | www.310787.com | 8159pp.cc | www.904044.com | www.895594.com | 21051199.com | www.8313z.com | www.699925.com | 2127bb.com | www.361cp.cc | www.1368i.cc | 624815.com | www.dz8789.com | www.6364a.com | 8036u.com | www.mg7088.com | www.68689q.com | 0234m.com | www.ty23023.com | www.7793n.com | 32666l.com | www.66h6.com | www.w603.com | 492dh.com | www.ao2016.com | www.855608.com | aobo74.com | www.20199oo.com | www.954688.com | 3775l.com | www.0002737.com | www.99788o.com | www.s1043.com | www.883399d.com | 66458v.com | www.v15528.com | www.52072z.com | 8159www.cc | www.c80288.com | 3556vip6.com | www.6033e.com | www.99788s.com | 38244l.com | www.66mgm777.com | 1389e.com | www.377606.com | www.188975.com |