Chapter 6. Intelligent web crawling
This chapter covers |
|
No one knows the exact number of web pages on the Internet. But we do know that the World Wide Web is
- Huge, with billions of web pages
- Dynamic, with pages being constantly added, removed, or updated
- Growing rapidly
Given the huge amount of information available on the Internet, how does one find information of interest?
In this chapter, we continue our theme of gathering information from outside one’s application. You’ll be introduced to the field of intelligent web crawling to retrieve relevant information. Search engines crawl the web periodically ...
Get Collective Intelligence in Action now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.