Needle in a Haystack
What Next?
In this tutorial you have learned why and how to use a tool that automatically scans as many ODF or text files you want, to find any given string. Cool, but why stop here?
The first thing you can do is improve odfgrep
as you please. To work on non-writeable media, for example, you can modify it to create a temporary, complete copy of all the folders to examine in another folder. Alternatively, you can replace the test in Listing 1 (line 11) with another on the basis of the file
command: It would be more complicated, but it would recognize ODF files no matter what their extension.
Another fun and productive line of work is using odfgrep
as a model to build similar tools. A good candidate would be an odfdiff
script that prints out the differences between two ODF documents.
The most important take-home lesson, however, is this: ODF is a format for sophisticated text documents, presentations, and spreadsheets that is very easy to work with and process in very efficient ways. For more proof of this, visit my little "ODF scripting" collection [5], and if you know about other scripts like those, or write new ones, please let me know!
Infos
- "Tutorials – Recoll" by Marco Fioretti, Linux Pro Magazine, issue 212, July 2018, pg. 84: http://www.linuxpromagazine.com/Issues/2018/212/Tutorials-Recoll
- odt2txt: https://github.com/dstosberg/odt2txt
- Code for this article: ftp://linux-magazine.com/pub/listings/linux-magazine.com/213/
- SUID: http://www.linuxnix.com/suid-set-suid-linuxunix/
- ODF scripting: http://freesoftware.zona-m.net/tag/odf-scripting
« Previous 1 2 3
Buy this article as PDF
(incl. VAT)
Buy Linux Magazine
Direct Download
Read full article as PDF:
Price $2.95
Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters
News
-
An All-Snap Version of Ubuntu is In The Works
Along with the standard deb version of the open-source operating system, Canonical will release an-all snap version.
-
Mageia 9 Beta 2 Ready for Testing
The latest beta of the popular Mageia distribution now includes the latest kernel and plenty of updated applications.
-
KDE Plasma 6 Looks to Bring Basic HDR Support
The KWin piece of KDE Plasma now has HDR support and color management geared for the 6.0 release.
-
Bodhi Linux 7.0 Beta Ready for Testing
The latest iteration of the Bohdi Linux distribution is now available for those who want to experience what's in store and for testing purposes.
-
Changes Coming to Ubuntu PPA Usage
The way you manage Personal Package Archives will be changing with the release of Ubuntu 23.10.
-
AlmaLinux 9.2 Now Available for Download
AlmaLinux has been released and provides a free alternative to upstream Red Hat Enterprise Linux.
-
An Immutable Version of Fedora Is Under Consideration
For anyone who's a fan of using immutable versions of Linux, the Fedora team is currently considering adding a new spin called Fedora Onyx.
-
New Release of Br OS Includes ChatGPT Integration
Br OS 23.04 is now available and is geared specifically toward web content creation.
-
Command-Line Only Peropesis 2.1 Available Now
The latest iteration of Peropesis has been released with plenty of updates and introduces new software development tools.
-
TUXEDO Computers Announces InfinityBook Pro 14
With the new generation of their popular InfinityBook Pro 14, TUXEDO upgrades its ultra-mobile, powerful business laptop with some impressive specs.