Easy steps for optimizing shell scripts
Computation
You have several different options for calculating the sum of numbers that a text file contains. In the example in Listing 14, every line containing the fiftieth
string is interesting. The script evaluates a file that contains one million lines. Every fiftieth
line contains the string.
Listing 14
Looking for a String
#!/bin/bash typeset -i sum=0 while read line; do set $line # Totaling field 6 sum=$((sum+$6)) # Parse output from grep done < <(grep " fiftieth " largefile) echo "Sum total: $sum"
Here, too, you can use Awk as a tool for quick summation (Listing 15). But Awk does not work directly in machine language. For this reason, it makes sense to pass the search for the character string to the grep
command and then add the lines found using Awk (Listing 16).
Listing 15
Faster with Awk
# time awk '/ fiftieth / {sum += $6} END {print "Sum total:", sum}' largefile
Listing 16
Fastest: Awk with grep
# time awk '{sum += $6} END {print "Sum total:", sum}' < <(grep " fiftieth " largefile)
In this example, too, optimization achieved significant speed gains. The variant from Listing 15 reduces the runtime to about one third; the variant from Listing 16 runs twenty times faster than the first alternative (see Table 3).
Table 3
Awk Timer
Category | |||
---|---|---|---|
real |
0m4.471s |
0m1.408s |
0m0.231s |
user |
0m2.374s |
0m1.348s |
0m0.050s |
sys |
0m1.956s |
0m0.013s |
0m0.010s |
Conclusions
The examples in this article show that you can drastically increase the speed of your scripts by skillfully using multifunctional tools such as Awk, Python, or Perl and by avoiding complex constructs with Tr, Sed, or Grep: You will thus consistently avoid many context changes.
However, not every anonymous pipe is detrimental to throughput, as the last example shows. Instead, it is more important to use the strengths of the various tools and keep the number of subprocesses as low as possible.
It is also important to remember that you can improve the readability and thus the maintainability of the scripts by doing without complex chains of commands.
« Previous 1 2
Buy this article as PDF
(incl. VAT)
Buy Linux Magazine
Direct Download
Read full article as PDF:
Price $2.95
News
-
Titan Linux is a New KDE Linux Based on Debian Stable
Titan Linux is a new Debian-based Linux distribution that features the KDE Plasma desktop with a focus on usability and performance.
-
Danielle Foré Has an Update for elementary OS 7
Now that Ubuntu 22.04 has been released, the team behind elementary OS is preparing for the upcoming 7.0 release.
-
Linux New Media Launches Open Source JobHub
New job website focuses on connecting technical and non-technical professionals with organizations in open source.
-
Ubuntu Cinnamon 22.04 Now Available
Ubuntu Cinnamon 22.04 has been released with all the additions from upstream as well as other features and improvements.
-
Pop!_OS 22.04 Has Officially Been Released
From the makers of some of the finest Linux-powered desktop and laptop computers on the market comes the latest version of their Ubuntu-based distribution, Pop!_OS 22.04.
-
Star Labs Unveils a New Small Format Linux PC
The Byte Mk I is an AMD-powered mini Linux PC with Coreboot support and plenty of power.
-
MX Linux Verison 21.1 “Wildflower” Now Available
The latest release of the systemd-less MX Linux is now ready for public consumption.
-
Microsoft Expands Their Windows Subsystem for Linux Offerings With AlmaLinux
Anyone who works with Windows Subsystem for Linux (WSL) will now find a new addition to the available distributions, one that’s become the front-runner replacement for CentOS.
-
Debian 11.3 Released wIth Numerous Bug and Security Fixes
The latest point release for Debian Bullseye is now available with some very important updates.
-
The First Alpha of Asahi Linux is Available
Asahi Linux is the first distribution to fully support Apple Silicon and is now available for testing.