PageRank (cont)
Approach: the random web surfer
- if we randomly follow links in the web …
- … more likely to re-discover pages with many inbound links
curr=random page, prev=null
for a long time do
| if curr not in array ranked[] then
| rank[curr]=0
| end if
| rank[curr]=rank[curr]+1
| if random(0,100)<85 then // with 85% chance ...
| prev=curr
| curr=choose hyperlink from curr // ... crawl on
| else
| curr=random page // avoid getting stuck
| prev=null
| end if
end for
|
Could be accomplished while we crawl web to build search index
|