Both sides previous revision Previous revision Next revision | Previous revision |
dead_sites_archive [2023/04/21 12:25] – [Sites List] sam | dead_sites_archive [2023/04/21 12:53] (current) – sam |
---|
====== Dead Sites Archive ====== | ====== Dead Sites Archive ====== |
| |
There are a number of smaller websites run by enthusiasts that are at risk of being lost as the internet ages. Inspired by Archive Team, I decided to archive some of the more important/valuable ones. I will not share them unless the site has been taken offline, and will respect any verified requests to remove content. These sites are presented here for personal and reference use only. | There are a number of smaller websites run by enthusiasts that are at risk of being lost as the internet ages. The non-profit [[https://archive.org/|Web Archive]] is the best resource to quickly find web content that's no longer available online. However, one cannot guarantee that all content has been crawled or that a particular website has been crawled at all. |
| |
| Inspired by [[https://wiki.archiveteam.org/|Archive Team]], I decided to archive some of the more important/valuable ones myself using [[https://github.com/ArchiveTeam/grab-site|grab-site]]. I will not share them unless the site has been taken offline, and will respect any verified requests to remove content for copyright reasons. These sites are presented here for personal and reference use only. |
| |
===== How to Use ===== | ===== How to Use ===== |
| |
The sites have been archived using grab-site and stored in WARC format. Download the site you are looking for from the list below. You have two options to view these files: | You can first try using the Web Archive link provided to browse for the content you're looking for. It doesn't require downloading any large files. |
| |
* [[https://github.com/webrecorder/replayweb.page/releases|Download the latest replayweb executable]] for your environment and view in there | If a WARC link is available, you can browse the archive that I have made. Download the site you are looking for from the list below. You have two options to view these files: |
* Open in [[https://replayweb.page/]] (I find this far slower to load than using the local executable) | |
| * [[https://github.com/webrecorder/replayweb.page/releases|Download the latest replayweb executable]] for your operating system and open the file in there (recommended) |
| * Open in [[https://replayweb.page/]] (slower) |
| |
===== Sites List ===== | ===== Sites List ===== |
| |
| ---- struct table ---- |
| schema: site |
| filter: tags = dead |
| cols: name, url, country, description, webarchive, warc, warc-size |
| csv: 0 |
| ---- |
| |
| |