@andybaio a *long* time ago I set up a mirror of DMOZ and started (slowly) crawling the content of the websites within it with the intent to build a full-text search facility but ran out of time and resources so I would find such a thing interesting