Needle in a Haystack

Tutorials – odfgrep

Article from Issue 213/2018
Author(s):

What grep cannot accomplish with LibreOffice and OpenOffice documents, a small odfgrep script can.

If you have a lot of text files, slide shows, and spreadsheets on your computer, you will need, sooner or later, to know quickly which files contain certain words or sentences. You might also want to use that information to perform some other actions automatically, like sending email notifications or adding new records to a database. Sometimes, you can do this with the Recoll desktop search engine described in the previous issue of Linux Pro Magazine [1]. Should you, however, want something lighter or more flexible than Recoll, try odfgrep: It not only might work better, but also teach you other, very efficient ways to manage all your office documents.

What and Why

A really basic knowledge of the command line and Bash syntax is helpful, but not mandatory: The code is short and explained as accurately as possible, to help you learn some basics of shell programming, if needed.

In fact, the hardest part of this whole tutorial may not be the code itself, but figuring out why you might want to learn and use it. In a nutshell, learning how to search or otherwise process ODF files from the command line, with odfgrep or similar tools, can help you to become a much more productive desktop user, able to delegate to your computer many more otherwise very time-consuming tasks. That's it, really.

[...]

Use Express-Checkout link below to read the full article (PDF).

Buy this article as PDF

Express-Checkout as PDF
Price $2.95
(incl. VAT)

Buy Linux Magazine

SINGLE ISSUES
 
SUBSCRIPTIONS
 
TABLET & SMARTPHONE APPS
Get it on Google Play

US / Canada

Get it on Google Play

UK / Australia

Related content

  • Make the Most of Diffoscope

    Diffoscope finds all the differences between files or folders, but at the price of verbosity. We show you how to focus diffoscope on what you want to know.

  • Tutorials – ODF Metadata

    It is no secret that the native file format of LibreOffice and OpenOffice, the OpenDocument Format (ODF), is a truly open standard for word processing documents, spreadsheets, and presentations. What most people do not know is that ODF files contain lots of metadata that is very easy to read or modify.

  • Command Line – diff and merge

    Diff and merge: They're not just for developers.

  • Tracked Down

    Searching for text in files or data streams is a common and important function. Ugrep tackles this task quickly, efficiently, and even interactively if needed.

  • Command Line: Archives

    Gzip and bzip2 not only compress files, they also provide lean and powerful tools for viewing, searching, and comparing text files.

comments powered by Disqus
Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters

Support Our Work

Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.

Learn More

News