Full-text search with Solr, Xapian, and Sphinx

Creating a list of 10 websites that discuss the latest Ubuntu release is simple: just use Google or another one of the popular web search engines. But if you host an information-packed website yourself and want to offer your own search function for it, you need a full-text search tool. Full-text search engines have other benefits for the user and developer. If you are building a custom application or DVD, for instance, you might want to include a full-text search tool to put important information at the user's fingertips. Full-text search delves the depths of random or systematically arranged data for one or more search terms. You will want the search results sorted by relevance, and you will want the results in a split second.

Luckily, admins and developers need not reinvent the wheel: Solr, Xapian, and Sphinx are open source projects that index and analyze data. But how do you define data? You can roughly distinguish two states in which the search engines find information: structured and unstructured.

Structured data has a fixed, predefined structure that allows it to be easily recognized, categorized, and processed with the help of applications. The most common form of structured data is a relational database, with data organized in rows and columns that, in turn, are connected in the form of tables. In contrast to this, unstructured data lacks a data model. Such data sets are often so ambiguous that a program cannot simply process them because the data, facts, and figures are totally mixed. Unstructured data is the domain of search engines that can at least arrange the chaotic data semantically.

[...]

Use Express-Checkout link below to read the full article (PDF).

Buy this article as PDF

Download Article PDF now with Express Checkout

Price $2.95
(incl. VAT)

Buy Linux Magazine

SINGLE ISSUES

Print Issues

Digital Issues

SUBSCRIPTIONS

Print Subscriptions

Digital Subscriptions

Support Our Work

Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.

News

Is AI Coming to Your Ubuntu Desktop?

Artificial Inte... , Operating Systems , Ubuntu

According to the VP of Engineering at Canonical, AI could soon be added to the Ubuntu desktop distribution.
Framework Laptop 13 Pro Competes with the Best

Hardware , laptop , Linux

Framework has released what might be considered the MacBook of Linux devices.
The Latest CachyOS Features Supercharged Kernel

Arch Linux , CachyOS , Operating Systems

The latest release of CachyOS brings with it an enhanced version of the latest Linux kernel.
Kernel 7.0 Is a Bit More Rusty

Kernel , Performance , Rust

Linux kernel 7.0 has been released for general availability, with Rust finally getting its due.
France Says "Au Revoir" to Microsoft

Digital Soverei... , Linux , open source

In a move that should surprise no one, France announced plans to reduce its reliance on US technology, and Microsoft Windows is the first to get the boot.
CIQ Releases Compatibility Catalog for Rocky Linux

Enterprise Linux , Linux , Rocky Linux

The company behind Rocky Linux is making an open catalog available to developers, hobbyists, and other contributors, so they can verify and publish compatibility with the CIQ lineup.
KDE Gets Some Resuscitation

KDE , Linux , Plasma

KDE is bringing back two themes that vanished a few years ago, putting a bit more air under its wings.
Ubuntu 26.04 Beta Arrives with Some Surprises

Games , graphics , Ubuntu

Ubuntu 26.04 is almost here, but the beta version has been released, and it might surprise some people.
Ubuntu MATE Dev Leaving After 12 years

projects , Ubuntu , Ubuntu MATE

Martin Wimpress, the maintainer of Ubuntu MATE, is now searching for his successor. Are you the next in line?
Kali Linux Waxes Nostalgic with BackTrack Mode

Kali Linux , Operating Systems , penetration tes...

For those who've used Kali Linux since its inception, the changes with the new release are sure to put a smile on your face.

Full-text search with Solr, Xapian, and Sphinx

Buy this article as PDF

Buy Linux Magazine

Related content

Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters

Support Our Work

News

Is AI Coming to Your Ubuntu Desktop?

Framework Laptop 13 Pro Competes with the Best

The Latest CachyOS Features Supercharged Kernel

Kernel 7.0 Is a Bit More Rusty

France Says "Au Revoir" to Microsoft

CIQ Releases Compatibility Catalog for Rocky Linux

KDE Gets Some Resuscitation

Ubuntu 26.04 Beta Arrives with Some Surprises

Ubuntu MATE Dev Leaving After 12 years

Kali Linux Waxes Nostalgic with BackTrack Mode

Full-text search with Solr, Xapian, and Sphinx

Buy this article as PDF

Buy Linux Magazine

Related content

Subscribe to our Linux Newsletters Find Linux and Open Source Jobs Subscribe to our ADMIN Newsletters

Support Our Work

News

Tag Cloud

Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters