Web Crawler
(Redirected from Web Crawling System)
Jump to navigation
Jump to search
A Web Crawler is an Internet Bot that systematically browses the World Wide Web.
- AKA: Web Crawling System.
- Context:
- It can (often) support a Web Indexing Task.
- It can implement a Web Crawling Algorithm.
- It can range from being a Focused Web Crawler to being a General Web Crawler.
- See: FOAF (Software), Web Search Engine, Web Content, Index (Search Engine), User (Computing), Hyperlink, HTML, Web Scraping, Data-Driven Programming, Offline Reader.
References
2014
- (Wikipedia, 2014) ⇒ http://en.wikipedia.org/wiki/web_crawler Retrieved:2014-5-31.
- A Web crawler is an Internet bot that systematically browses the World Wide Web, typically for the purpose of Web indexing. A Web crawler may also be called a Web spider, an ant, an automatic indexer, or (in the FOAF software context) a Web scutter. [1]
Web search engines and some other sites use Web crawling or spidering software to update their web content or indexes of others sites' web content. Web crawlers can copy all the pages they visit for later processing by a search engine that indexes the downloaded pages so that users can search them much more quickly.
Crawlers can validate hyperlinks and HTML code. They can also be used for web scraping (see also data-driven programming).
- A Web crawler is an Internet bot that systematically browses the World Wide Web, typically for the purpose of Web indexing. A Web crawler may also be called a Web spider, an ant, an automatic indexer, or (in the FOAF software context) a Web scutter. [1]