Archiveteam Project Collects Lost Web 2.0 Content
Many users keep their emails with webmail services, wedding pictures in photo communities and reading habits with social bookmarking services. What happens, though, when data is lost or websites fold? Archiveteam wants to help in those circumstances.
The Archiveteam wiki provides various assistance so that your personal photo album and other files don't end up in the ether. Assistance includes instructions and documentation about file formats and storage media, much of which are in early phases of development. In a more progressed state is the team's Deathwatch page with a continually updated list of websites that have gone kaput or are about to go that way. Among them, Yahoo's Geocities site and the already closed Furl and Tripod.
Under the rubric Software, the project collects tools, tips and tricks. Included is the GNU wget command that, with some appropriate parameters, secures a complete Wordpress blog on a local hard drive. Some site-specific pages relate to Google, Livejournal and Twitter.
One of the Archiveteam founders is Jason Scott, whose textfiles.com site has been archiving text data off the network from the 1980s and 90s. The young Archiveteam is looking for fellow archivers to write articles and manuals, set up mirror servers and bittorents and form a download task force.
Debian developer Joey Hess has already had thoughts (in a blog) about a GUI program for rescuing Web 2.0 data. Ideally the user would simply enter a list of URLs or a bookmark file and the program would take care of the rest: plugins appropriate to the service or website would handle the work, including a generic one for sites with RSS feeds. Hess is collecting "thoughts, comments, prior art [and] cute program idea names." Some have come his way already.
Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters
Support Our Work
Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.
News
-
Fedora 41 Beta Available with Some Interesting Additions
If you're a Fedora fan, you'll be excited to hear the beta version of the latest release is now available for testing and includes plenty of updates.
-
AlmaLinux Unveils New Hardware Certification Process
The AlmaLinux Hardware Certification Program run by the Certification Special Interest Group (SIG) aims to ensure seamless compatibility between AlmaLinux and a wide range of hardware configurations.
-
Wind River Introduces eLxr Pro Linux Solution
eLxr Pro offers an end-to-end Linux solution backed by expert commercial support.
-
Juno Tab 3 Launches with Ubuntu 24.04
Anyone looking for a full-blown Linux tablet need look no further. Juno has released the Tab 3.
-
New KDE Slimbook Plasma Available for Preorder
Powered by an AMD Ryzen CPU, the latest KDE Slimbook laptop is powerful enough for local AI tasks.
-
Rhino Linux Announces Latest "Quick Update"
If you prefer your Linux distribution to be of the rolling type, Rhino Linux delivers a beautiful and reliable experience.
-
Plasma Desktop Will Soon Ask for Donations
The next iteration of Plasma has reached the soft feature freeze for the 6.2 version and includes a feature that could be divisive.
-
Linux Market Share Hits New High
For the first time, the Linux market share has reached a new high for desktops, and the trend looks like it will continue.
-
LibreOffice 24.8 Delivers New Features
LibreOffice is often considered the de facto standard office suite for the Linux operating system.
-
Deepin 23 Offers Wayland Support and New AI Tool
Deepin has been considered one of the most beautiful desktop operating systems for a long time and the arrival of version 23 has bolstered that reputation.