The fundamentals of an HPC cluster

The King in Alice's Adventures in Wonderland said it best, "Begin at the beginning …." The general goal of HPC is either to run applications faster or to run problems that can't or won't run on a single server. To do this, you need to run parallel applications across separate nodes. Although you could use a single node and then create two VMs, it's important to understand how applications run across physically different servers and how you administer a system of disparate physical hardware.

With this goal in mind, you can make some reasonable assumptions about the HPC system. If you are interested in parallel computing using multiple nodes, you need at least two separate systems (nodes), each with its own operating system (OS). To keep things running smoothly, the OS on both nodes should be identical. (Strictly speaking, it doesn't have to be this way, but otherwise, it is very difficult to run and maintain.) If you install a package on node 1, then it needs to be installed on node 2 as well. This lessens a source of possible problems when you have to debug the system.

The second thing your cluster needs is a network to connect the nodes so they can communicate to share data, the state of the solution to the problem, and possibly even the instructions that need to be executed. The network can theoretically be anything that allows communication between nodes, but the easiest solution is Ethernet. In this article, I am initially going to consider a single network, but later I will consider more than one.

[...]

Use Express-Checkout link below to read the full article (PDF).

Buy this article as PDF

Express-Checkout as PDF

Price $2.95
(incl. VAT)

Buy Linux Magazine

SINGLE ISSUES

Print Issues

Digital Issues

SUBSCRIPTIONS

Print Subs

Digisubs

TABLET & SMARTPHONE APPS

US / Canada

UK / Australia

Support Our Work

Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.

News

Another Linux Malware Discovered

Linux , malware , Virtualization

Russian hackers use Hyper-V to hide malware within Linux virtual machines.
TUXEDO Computers Announces a New InfinityBook

Hardware , laptop , Linux

TUXEDO Computers is at it again with a new InfinityBook that will meet your professional and gaming needs.
SUSE Dives into the Agentic AI Pool

Artificial Inte... , Enterprise Linux , monitoring

SUSE becomes the first open source company to adopt agentic AI with SUSE Enterprise Linux 16.
Linux Now Runs Most Windows Games

Games , Linux , Steam

The latest data shows that nearly 90 percent of Windows games can be played on Linux.
Fedora 43 Has Finally Landed

Fedora , Gnome , Operating Systems

The Fedora Linux developers have announced their latest release, Fedora 43.
KDE Unleashes Plasma 6.5

Flatpak , KDE , Plasma

The Plasma 6.5 desktop environment is now available with new features, improvements, and the usual bug fixes.
Xubuntu Site Possibly Hacked

Linux , Security , Xubuntu

It appears that the Xubuntu site was hacked and briefly served up a malicious ZIP file from its download page.
LMDE 7 Now Available

Cinnamon , DEBIAN , Linux mint

Linux Mint Debian Edition, version 7, has been officially released and is based on upstream Debian.
Linux Kernel 6.16 Reaches EOL

Kernel , Linux

Linux kernel 6.16 has reached its end of life, which means you'll need to upgrade to the next stable release, Linux kernel 6.17.
Amazon Ditches Android for a Linux-Based OS

Linux , Operating Systems , Tools

Amazon has migrated from Android to the Linux-based Vega OS for its Fire TV.

The fundamentals of an HPC cluster

Buy this article as PDF

Buy Linux Magazine

Related content

Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters

Support Our Work

News

Another Linux Malware Discovered

TUXEDO Computers Announces a New InfinityBook

SUSE Dives into the Agentic AI Pool

Linux Now Runs Most Windows Games

Fedora 43 Has Finally Landed

KDE Unleashes Plasma 6.5

Xubuntu Site Possibly Hacked

LMDE 7 Now Available

Linux Kernel 6.16 Reaches EOL

Amazon Ditches Android for a Linux-Based OS

The fundamentals of an HPC cluster

Buy this article as PDF

Buy Linux Magazine

Related content

Subscribe to our Linux Newsletters Find Linux and Open Source Jobs Subscribe to our ADMIN Newsletters

Support Our Work

News

Tag Cloud

Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters