2004 U. S. Federal Agency Web Harvest « http://www.webharvest.gov/ | | A harvest of federal agency public web sites as they existed prior to january 20 , 2005. |
Archive-It. org « http://www.archive-it.org/ | | A subscription service from the internet archive , which allows institutions to build , manage and search their own web archive. includes the sites of universities , libraries , and special interest collections of websites. |
DevArchives « http://www.devarchives.com | | Contains archives of faqs , mailing lists , and newsgroups all related to developer/programming/it. free. |
First Monday: Internet Time and the Reliability of Search Engines « http://www.firstmonday.org/issues/issue9_10/wouters/ | | Journal article by paul wouters , iina hellsten , and loet leydesdorff. examines the consequences and implications of internet search engines continuously reconstructing the past by updating their indices. |
Ghost Sites « http://www.disobey.com/ghostsites/index.shtml | | Long running online " museum" provides screenshots of defunct sites. |
Google Groups « http://groups.google.com/ | | Searchable archive of more than 700 million usenet postings from a period of more than 20 years. |
MINERVA: Mapping the INternet Electronic Resources Virtual Archive « http://lcweb2.loc.gov/cocoon/minerva/html/minerva-home.html | | Library of congress web archiving project aims to collect and preserve web sites useful in serving the current or future informational needs of congress and researchers. |
NewsletterArchive. org « http://newsletterarchive.org | | Aims to archive and make available to the public all email newsletters and electronic mailing lists. it will rely on user contributions for its content. |
NoveltyNet « http://www.noveltynet.org/ | | A site where people can submit orphaned content to be archived and kept available. |
Pandora Archive « http://pandora.nla.gov.au | | Australia's web archive , established initially by the national library of australia , and now built in collaboration with nine other australian libraries and cultural collecting organisations. |
Searchenginewatch. com - It's Tough to Get a Good Date with a Search Engine « http://searchenginewatch.com/showPage.html?page=2160061 | | Article by gary price and genie tyburski. explores the question of " what is a date on the web?" and notes that a searcher may be misled by the results of searches restricted by date. |
Textfiles « http://www.textfiles.com/ | | Contains information gathered from bbs's in the early days of the internet. |
The European Archive « http://www.europarchive.org/ | | Digital library of cultural artifacts in digital form. the collection includes public information films , recordings and web harvest of political related and government websites. |
The Internet Archive « http://www.archive.org | | Nonprofit organisation established to preserve web sites by taking regular " snapshots" . the wayback machine provides links to older versions of a webpage. there are special collections , for example on web pioneers. |
The Register: The Web as Historical Record « http://www.theregister.co.uk/2004/05/04/web_historical_record/ | | Essay by peter abrahams pointing out " one of the weaknesses of most search engines and the web itself: you cannot sort by date. " |