Protocols Used in Child URLs

As discussed above, we extracted child URLs from all HTML documents in our data set. We examined the distribution of protocols in this set of child URLs. By far, the most dominant protocol observed was HTTP (there were an average of 17 HTTP URLs per document).


Protocol Usage