Tools for generating regular expressions
Conclusions
Deriving regular expressions based on existing text fragments and patterns helps to analyze and recognize similarities in more complex patterns. The tools I tested work well but not always without error. Some tools generated regular expressions that were more generic than they actually should be based on the text fragments, resulting in searches that returned more matches than desired. In particular, these tools may include results that don't actually match the search patterns, resulting in false positives and some fuzziness.
Regular expressions are complex, inherently mapping a fragment and pattern differently. The performance of these tools does deserve credit given the complexity of the tasks. For these generators to be more useful in the future, increased precision would be desirable.
Acknowledgement
The author would like to thank Axel Beckert and Arne Wichmann for their help and critical comments during the preparation of this article.
Infos
- Regular expression: https://en.wikipedia.org/wiki/Regular_expression
- grep everything: http://noone.org/blog/English/Computer/Shell/grep%20everything.futile
- xmlgrep: https://linux.die.net/man/1/xmlgrep
- "Search more efficiently with Ugrep" by Karsten G¸nther, Linux Magazine, issue 245, April 2021, https://www.linux-magazine.com/Issues/2021/245/Tracked-Down/(language)/eng-US
- Regular-Expressions.info: http://www.regular-expressions.info/
- Regex DB: https://rgxdb.com/
- Friedly, Jeffrey. Mastering Regular Expressions, O'Reilly Media, Inc., 2006: http://regex.info/book.html
- RegEx101: https://regex101.com/
- txt2re: http://www.txt2re.com/index_php3.html
- grex: https://github.com/pemistahl/grex
- regexgen: https://github.com/devongovett/regexgen
- Regex Generator by Olaf Neumann: https://regex-generator.olafneumann.org/
- rgxg: https://rgxg.github.io/
- txt2regex: https://aurelio.net/projects/txt2regex/
- Fink: https://www.finkproject.org/
- RegexGenerator: https://github.com/dbuhlbrown/Regex-Generator
- regex-generator-lib (Python): https://pypi.org/project/regex-generator-lib/
- tdda: http://www.tdda.info/
- RegExTractor: https://github.com/iuliux/RegExTractor
« Previous 1 2 3
Buy this article as PDF
(incl. VAT)
Buy Linux Magazine
Direct Download
Read full article as PDF:
Price $2.95
Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters
Find SysAdmin Jobs
News
-
The Next Major Release of Elementary OS has Arrived
It's been over a year since the developers of elementary OS released version 6.1 (Jólnir) but they've finally made their latest release (Horus) available with a renewed focus on the user.
-
KDE Plasma 5.27 Beta is Ready for Testing
The latest beta iteration of the KDE Plasma desktop is now available and includes some important additions and fixes.
-
Netrunner OS 23 Is Now Available
The latest version of this Linux distribution is now based on Debian Bullseye and is ready for installation and finally hits the KDE 5.20 branch of the desktop.
-
New Linux Distribution Built for Gamers
With a Gnome desktop that offers different layouts and a custom kernel, PikaOS is a great option for gamers of all types.
-
System76 Beefs Up Popular Pangolin Laptop
The darling of open-source-powered laptops and desktops will soon drop a new AMD Ryzen 7-powered version of their popular Pangolin laptop.
-
Nobara Project Is a Modified Version of Fedora with User-Friendly Fixes
If you're looking for a version of Fedora that includes third-party and proprietary packages, look no further than the Nobara Project.
-
Gnome 44 Now Has a Release Date
Gnome 44 will be officially released on March 22, 2023.
-
Nitrux 2.6 Available with Kernel 6.1 and a Major Change
The developers of Nitrux have officially released version 2.6 of their Linux distribution with plenty of new features to excite users.
-
Vanilla OS Initial Release Is Now Available
A stock GNOME experience with on-demand immutability finally sees its first production release.
-
Critical Linux Vulnerability Found to Impact SMB Servers
A Linux vulnerability with a CVSS score of 10 has been found to affect SMB servers and can lead to remote code execution.