Searching Inside Documents with DocFetcher

Productivity Sauce
There are plenty of tools that you can use to find a specific file or document by its name on your local hard disk or remote share. But what if you need to find a document containing a word or text fragment? Enter DocFetcher, a graphical desktop search application that can search inside documents. It supports a wide range of popular document formats, including Microsoft Office, HTML, PDF, RTF, plain text, and OpenOffice.org. If you are running Ubuntu, you can install DocFetcher using its .deb package. For other Linux distributions, you can download an archived version of the application, unpack it, and launch DocFetcher using the DocFetcher.sh script.
Before you can put DocFetcher to use, you have to index the directory containing documents. To do this, right-click on the Search Scope pane and choose Create Index. Select the desired directory, and wait till DocFetcher finishes the indexing operation. You can then search through the indexed documents by specifying the search term you want in the Search field and pressing Enter. DocFetcher returns a list of matching documents sorted by relevance. Select the document you want, and you can see its content with highlighted occurrences of the search string in the Preview pane. Use the Next and Previous buttons to quickly jump to the previous or next occurrence of the search string. Besides simple searches, you can use a variety of supported parameters. For example, to include or exclude a search term you can use the plus (+) and minus (-) operators. You can also use wild cards and limit searches to specific file extensions.
comments powered by DisqusSubscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters
Support Our Work
Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.

News
-
TuxCare Announces Support for AlmaLinux 9.2
Thanks to TuxCare, AlmaLinux 9.2 (and soon version 9.6) now enjoys years of ongoing patching and compliance.
-
Go-Based Botnet Attacking IoT Devices
Using an SSH credential brute-force attack, the Go-based PumaBot is exploiting IoT devices everywhere.
-
Plasma 6.5 Promises Better Memory Optimization
With the stable Plasma 6.4 on the horizon, KDE has a few new tricks up its sleeve for Plasma 6.5.
-
KaOS 2025.05 Officially Qt5 Free
If you're a fan of independent Linux distributions, the team behind KaOS is proud to announce the latest iteration that includes kernel 6.14 and KDE's Plasma 6.3.5.
-
Linux Kernel 6.15 Now Available
The latest Linux kernel is now available with several new features/improvements and the usual bug fixes.
-
Microsoft Makes Surprising WSL Announcement
In a move that might surprise some users, Microsoft has made Windows Subsystem for Linux open source.
-
Red Hat Releases RHEL 10 Early
Red Hat quietly rolled out the official release of RHEL 10.0 a bit early.
-
openSUSE Joins End of 10
openSUSE has decided to not only join the End of 10 movement but it also will no longer support the Deepin Desktop Environment.
-
New Version of Flatpak Released
Flatpak 1.16.1 is now available as the latest, stable version with various improvements.
-
IBM Announces Powerhouse Linux Server
IBM has unleashed a seriously powerful Linux server with the LinuxONE Emperor 5.