Tool Predicts Which Websites Will be Compromised
Carnegie Mellon researchers say 3 million pages could fall down the phishing hole in the next year.
Researchers at Carnegie Mellon University have developed a means for predicting if a currently uncompromised website will become malicious before it happens. According to their results, nearly 3 million web pages are vulnerable to possible exploitation within the next year. Kyle Soska and Nicolas Christin used the Internet Archive, which periodically stores snapshots of large parts of the Internet, to comb through recent history and look for common traits of websites that become compromised by Internet attackers. According to a paper presented at the recent USENIX Security Symposium, the authors of the study “… manage[d] to achieve good detection accuracy over a one-year horizon; that is, we generally manage to correctly predict that currently benign websites will become compromised within a year.”
The authors employed an intelligent algorithm, using samples of malicious sites from blacklists such as PhishTank to train their system to recognize a compromised site. They then used the Internet Archive’s Wayback machine, which searches the state of the Internet at previous points in recent history, to look for common characteristics of these sites before they were compromised. The assessment ignored user-supplied content and focused on factors such as unpatched web services and site structure, as well as anomalies in web traffic. The system learned to identify vulnerable sites on the verge of becoming compromised three to 12 months in advance.
In theory, this method could help organizations find flaws in their sites that could eventually lead to compromise. Search engines could also use a version of this technique to warn users about possible vulnerable pages that appear on the search list, which would provide a big incentive for webmasters to put their sites in order.
Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters
Support Our Work
Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.

News
-
First Release Candidate for Linux Kernel 6.14 Now Available
Linus Torvalds has officially released the first release candidate for kernel 6.14 and it includes over 500,000 lines of modified code, making for a small release.
-
System76 Refreshes Meerkat Mini PC
If you're looking for a small form factor PC powered by Linux, System76 has exactly what you need in the Meerkat mini PC.
-
Gnome 48 Alpha Ready for Testing
The latest Gnome desktop alpha is now available with plenty of new features and improvements.
-
Wine 10 Includes Plenty to Excite Users
With its latest release, Wine has the usual crop of bug fixes and improvements, along with some exciting new features.
-
Linux Kernel 6.13 Offers Improvements for AMD/Apple Users
The latest Linux kernel is now available, and it includes plenty of improvements, especially for those who use AMD or Apple-based systems.
-
Gnome 48 Debuts New Audio Player
To date, the audio player found within the Gnome desktop has been meh at best, but with the upcoming release that all changes.
-
Plasma 6.3 Ready for Public Beta Testing
Plasma 6.3 will ship with KDE Gear 24.12.1 and KDE Frameworks 6.10, along with some new and exciting features.
-
Budgie 10.10 Scheduled for Q1 2025 with a Surprising Desktop Update
If Budgie is your desktop environment of choice, 2025 is going to be a great year for you.
-
Firefox 134 Offers Improvements for Linux Version
Fans of Linux and Firefox rejoice, as there's a new version available that includes some handy updates.
-
Serpent OS Arrives with a New Alpha Release
After months of silence, Ikey Doherty has released a new alpha for his Serpent OS.