Archiveteam Project Collects Lost Web 2.0 Content
Many users keep their emails with webmail services, wedding pictures in photo communities and reading habits with social bookmarking services. What happens, though, when data is lost or websites fold? Archiveteam wants to help in those circumstances.
The Archiveteam wiki provides various assistance so that your personal photo album and other files don't end up in the ether. Assistance includes instructions and documentation about file formats and storage media, much of which are in early phases of development. In a more progressed state is the team's Deathwatch page with a continually updated list of websites that have gone kaput or are about to go that way. Among them, Yahoo's Geocities site and the already closed Furl and Tripod.
Under the rubric Software, the project collects tools, tips and tricks. Included is the GNU wget command that, with some appropriate parameters, secures a complete Wordpress blog on a local hard drive. Some site-specific pages relate to Google, Livejournal and Twitter.
One of the Archiveteam founders is Jason Scott, whose textfiles.com site has been archiving text data off the network from the 1980s and 90s. The young Archiveteam is looking for fellow archivers to write articles and manuals, set up mirror servers and bittorents and form a download task force.
Debian developer Joey Hess has already had thoughts (in a blog) about a GUI program for rescuing Web 2.0 data. Ideally the user would simply enter a list of URLs or a bookmark file and the program would take care of the rest: plugins appropriate to the service or website would handle the work, including a generic one for sites with RSS feeds. Hess is collecting "thoughts, comments, prior art [and] cute program idea names." Some have come his way already.
Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters
Support Our Work
Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.
News
-
Kali Linux Waxes Nostalgic with BackTrack Mode
For those who've used Kali Linux since its inception, the changes with the new release are sure to put a smile on your face.
-
Gnome 50 Smooths Out NVIDIA GPU Issues
Gamers rejoice, your favorite pastime just got better with Gnome 50 and NVIDIA GPUs.
-
System76 Retools Thelio Desktop
The new Thelio Mira has landed with improved performance, repairability, and front-facing ports alongside a high-quality tempered glass facade.
-
Some Linux Distros Skirt Age Verification Laws
After California introduced an age verification law recently, open source operating system developers have had to get creative with how they deal with it.
-
UN Creates Open Source Portal
In a quest to strengthen open source collaboration, the United Nations Office of Information and Communications Technology has created a new portal.
-
Latest Linux Kernel RC Contains Changes Galore
Linux kernel 7.0-rc3 includes more changes than have been made in a single release in recent history.
-
Nitrux 6.0 Now Ready to Rock Your World
The latest iteration of the Debian-based distribution includes all kinds of newness.
-
Linux Foundation Reports that Open Source Delivers Better ROI
In a report that may surprise no one in the Linux community, the Linux Foundation found that businesses are finding a 5X return on investment with open source software.
-
Keep Android Open
Google has announced that, soon, anyone looking to develop Android apps will have to first register centrally with Google.
-
Kernel 7.0 Now in Testing
Linus Torvalds has announced the first Release Candidate (RC) for the 7.x kernel is available for those who want to test it.
