Protocols Used in Child URLs
As discussed above, we extracted child URLs from all HTML documents
in our data set.
We examined the distribution of protocols in
this set of child URLs.
By far, the most dominant protocol observed was HTTP
(there were an average of 17 HTTP URLs per document).
Protocol Usage