Web Crawling
WWW is a directed graph
Use your favorite graph traversal algorithm!!
Netizenship issues
starting points:
individual page
set of pages
domain name searching (good because the web isn't necessarily connected)
Previous slide
Next slide
Back to first slide
View graphic version