Two common techniques for archiving websites are using a web crawler or soliciting user submissions: On 12 February 2001, Google acquired the usenet discussion group archives from Deja.com and turned it into their Google Groups service.
The Internet Archive is building a compendium of websites and digital media.
[3] Nextpoint offers an automated cloud-based, SaaS for marketing, compliance, and litigation related needs including electronic discovery.
They employ their PANDAS (PANDORA Digital Archiving System) when building their catalog.
textfiles.com is a large library of old text files maintained by Jason Scott Sadofsky.