
en.wikipedia.org/wiki/Web_crawler
Preview meta tags from the en.wikipedia.org website.
Linked Hostnames
99- 313 links toen.wikipedia.org
- 23 links toweb.archive.org
- 22 links todoi.org
- 11 links toapi.semanticscholar.org
- 6 links tociteseerx.ist.psu.edu
- 4 links tofoundation.wikimedia.org
- 3 links towww.researchgate.net
- 3 links towww.wikidata.org
Thumbnail

General Meta Tags
10- titleWeb crawler - Wikipedia
- charsetUTF-8
- ResourceLoaderDynamicStyles
- generatorMediaWiki 1.45.0-wmf.15
- referrerorigin
Open Graph Meta Tags
11- og:imagehttps://upload.wikimedia.org/wikipedia/commons/thumb/d/df/WebCrawlerArchitecture.svg/1200px-WebCrawlerArchitecture.svg.png
- og:image:width1200
- og:image:height917
- og:imagehttps://upload.wikimedia.org/wikipedia/commons/thumb/d/df/WebCrawlerArchitecture.svg/800px-WebCrawlerArchitecture.svg.png
- og:image:width800
Link Tags
58- EditURI//en.wikipedia.org/w/api.php?action=rsd
- alternate//en.m.wikipedia.org/wiki/Web_crawler
- alternate/w/index.php?title=Web_crawler&action=edit
- alternate/w/index.php?title=Special:RecentChanges&feed=atom
- apple-touch-icon/static/apple-touch/wikipedia.png
Links
485- http://archive.ncsa.uiuc.edu/SDG/IT94/Proceedings/Agents/spetka/spetka.html
- http://chato.cl/papers/baeza05_crawling_country_better_breadth_first_web_page_ordering.pdf
- http://chato.cl/research/crawling_thesis
- http://cis.poly.edu/tr/tr-cis-2001-03.pdf
- http://clgiles.ist.psu.edu/papers/VLDB-2000-focused-crawling.pdf