Synchronizing data with the Git-annex Assistant
Git-annex Assistant is a handy web interface that lets you use the power of Git to synchronize data across several computers.
Git-annex comes directly from the heart of the Linux ecosystem. It lets users manage files in a Git repository and sync them across multiple devices, such as an encrypted archive in the cloud or a backup on an external hard drive or SSH server. Once you have mastered Git-annex, you can accomplish these tasks with ease.
The abundance of options may deter some people from using the tool. This fear is countered by the Git-annex Assistant front end, which hides the complexity behind a modern web interface.
Although packages are available for the most common distributions like Debian, Fedora, and Ubuntu, these usually lag far behind the current state of development. Debian "Wheezy" and Ubuntu "Precise" have version 3.2 in their repositories, which does not yet support the Assistant. Ubuntu "Trusty" at least comes with version 5.2, but because of the fast pace of development, it makes more sense to install the precompiled binary archives .
For this manual installation, you just need to extract the archive and add the
git-annex.linux folder to the path. Listing 1 shows the steps required to modify the path variable, but this is only a temporary change. For a permanent installation, add the
git-annex.linux folder to your
$PATH variable. In Ubuntu, you can do this in the
Adding git-annex.linux to PATH
$ wget "http://downloads.kitenet.net/git-annex/linux/current/git-annex-standalone-amd64.tar.gz" $ tar xzf git-annex-standalone-amd64.tar.gz $ PATH="$PATH:$HOME/git-annex.linux" $ git-annex version git-annex version: 5.20140405-g8729abc build flags: Assistant Webapp Webapp-secure Pairing Testsuite S3 WebDAV Inotify DBus DesktopNotify XMPP DNS Feeds Quvi TDFA CryptoHash key/value backends: SHA256E SHA1E SHA512E SHA224E SHA384E SKEIN256E SKEIN512E SHA256 SHA1 SHA512 SHA224 SHA384 SKEIN256 SKEIN512 WORM URL remote types: git gcrypt S3 bup directory rsync web webdav tahoe glacier hook external
The web interface (Figure 1) is part of Git-annex and was created as a result of a crowdfunding campaign. It complements the extensive set of commands with a focus on simple input screens for creating repositories and configuring repositories in the cloud, including their encryption. Moreover, the front end supports the ability to configure synchronization between data repositories via dialogs and to retrieve information about the status of current operations.
To start the application, type the following
at the command line. The software then automatically opens the browser and calls a URL with the format:
<Token> comprises a long string of letters and numbers. For security reasons, both the port and tokens change with every call. The lower part of Figure 1 lists an overview of currently synchronized repositories. The sample includes local data in the
If you look in this folder, you will only find the hidden directory,
.git. However, if you create a file in the
~/annex folder, the Assistant running in the background automatically creates a Git commit and takes the file into its care. Repositories you create in the web interface are set to "direct mode," so this action is transparent to the user . However, the command line allows you peek behind the curtains to see what exactly Git-annex is doing (Listing 2).
Checking the Log
$ cd ~/annex $ git log commit eef1018fe44e208d80436699ec8d4addecfb0e1c Author: Georg Schönberger <email@example.com> Date: Thu Apr 17 16:11:04 2014 +0200 $ git-annex whereis debian-packaging-tutorial.pdf whereis debian-packaging-tutorial.pdf (1 copy) 4dc52c03-7d5a-4e3e-86ff-22e2b161af3e -- here (gschoenb@gschoenb-X220:~/annex) ok
In a network in which you can access the clients directly via SSH, "local pairing" offers a way of automatically synchronizing files. The members of a workgroup can exchange data on the shared network without a central server. As a starting point, you just need the individual computers with the repositories and an SSH server running on each system.
One client grants another client access to the data: The Add another repository button takes you via Local computer to a Secret Phrase prompt – this is the password for the pairing. Armed with this password, the clients can mutually authorize each other for the data exchange. In pairing, the data is multicast via UDP port 55556; the subsequent synchronization in turn uses SSH .
Authenticating via public keys makes the setup far easier because it removes the need to enter passwords repeatedly. Moreover, it increases security considerably. The result of these pairings are Git remotes, each pointing to the other client (Listing 3). Although the configuration of the remotes could also be handled manually, the web interface simplifies the task considerably.
$ git remote -v annexclt.local_annex ssh://firstname.lastname@example.orgDclt_annex/~/annex/ (fetch) annexclt.local_annex ssh://email@example.comDclt_annex/~/annex/ (push)
The client folders stay in sync after pairing. To add another computer, just repeat the above steps, and migrate the data in the
~/annex folders to the other machines. A data exchange takes place immediately when changes in the directory occur, and this includes removing a file. This method does not protect you against loss due to accidental deletion.
Until now, the sync only took place on the local network. However, a person in the field or on the road does not always have access the workgroup data. A centralized server that all clients can reach provides a remedy.
Buy this article as PDF
Linux Foundation's big event celebrates the 25th anniversary of Linux
Linux has evolved from a “won’t be a professional” project to one of the most professional software projects in the history of computers.
Competitors get in the game with RHEL without Red Hat
Security researchers have already notified Microsoft; some fixes are available
The company is collaborating with Google and Intel to use Kubernetes as an engine for Fuel
Customers can take a free test drive of SLES for HPC on the Azure Cloud
San Francisco-based chip company announces their first fully open source chip platform.
The whole distro gets rebuilt on glibc 2.3
Ubuntu Vendor tries to solve app packaging and distribution problem across distributions.