Detecting spam users automatically with a neural network

Spam Stopper

© Lead Image © Kirsty Pargeter, 123RF.com

Article from Issue 195/2017

Author(s): Chris Hinze

Build a neural network that uncovers spam websites.

Website builders – online hosting services that provide tools for non-technical users to build their own websites – are frequently exploited by spammers looking for a convenient launching pad. Checking thousands, or sometimes millions, of web pages manually to look for evidence of a spammer is both tedious and inefficient.

In this article, I show how to build a suitable spam-searching neural network with help from Google's TensorFlow machine learning library [2] [3] and TFLearn [4], a library with a high-level API for TensorFlow. Even if you don't spend your days searching for spammers, the techniques described in this article will give you some insights on how to harness the power of neural networks for other complex problems.

Training Day

The neural network needs both positive and negative samples in order to learn. This solution starts with a manually compiled list of sample users divided into spammers and legitimate users, taking care to distribute both types in equal numbers. Alongside this classification (spammer or not spammer), the data set contained the user's name or the website that belongs to the user, the IP address with which the site is registered, and the language version associated with the site.

[...]

Use Express-Checkout link below to read the full article (PDF).

Buy this article as PDF

Download Article PDF now with Express Checkout

Price $2.95
(incl. VAT)

Buy Linux Magazine

SINGLE ISSUES

Print Issues

Digital Issues

SUBSCRIPTIONS

Print Subscriptions

Digital Subscriptions

Support Our Work

Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.

News

New Linux Flaw Lets Attackers Escape VMs

RHEL , Security , vulnerability

A 16-year-old vulnerability allows an attacker to escape a virtual machine, gain access to the host, and execute malicious code.
Hannah Montana Linux Is Back!

DEBIAN , Kubuntu , Plasma

Developer Noah Cagle decided the world needed the once obscure but beloved Linux distribution and gave it a decidedly pink refresh.
System76 Refreshes the Lemur Laptop

Hardware , laptop

If you're looking for a laptop with tons of power and battery, look no further than the latest iteration of the System76 Lemur Pro.
More than 43 Million Lines of Code in Linux Kernel 7.2

Kernel , Linux

Using the cloc utility, Michael Larabel of Phoronix discovered that Linux kernel 7.2 has over 43 million lines of code.
Kubuntu Focus Goes Ultra

Hardware , Kubuntu , laptop

The Kubuntu Focus team has upped the performance ante of its M2 and Zr laptops with the latest, greatest CPUs from Intel.
Linux Gamers May Soon See Less Mouse Lag in KDE Plasma

Games , KDE , Plasma

Gamers using KDE’s Plasma desktop have been suffering from a slight input delay in mouse movement that could lead to getting fragged.
Three Lines of Code Improve Linux Storage Performance

Kernel , Performance , Storage

A developer changed three lines of code, giving Linux storage performance a 5% bump.
AUR Hit Again with Malicious Packages

Arch Linux , Security

Once again the Arch User Repository is plagued by a high volume of malicious packages.
Alpine Linux 3.24 Features Fresh Desktops and a Newer Kernel

Alpine Linux , Gnome , Plasma , Security

If you're a fan of Alpine Linux, it's time to upgrade because the latest version has been released with KDE Plasma 6.6, Gnome 50, and Linux kernel 6.18 LTS.
EU Open Source Strategy Plays Key Role in Tech Sovereignty Package

EU , government , open source

Comprehensive measures adopted by the European Commission aim to reduce dependency on non-EU countries.

Detecting spam users automatically with a neural network

Spam Stopper

Training Day

Buy this article as PDF

Buy Linux Magazine

Related content

Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters

Support Our Work

News

New Linux Flaw Lets Attackers Escape VMs

Hannah Montana Linux Is Back!

System76 Refreshes the Lemur Laptop

More than 43 Million Lines of Code in Linux Kernel 7.2

Kubuntu Focus Goes Ultra

Linux Gamers May Soon See Less Mouse Lag in KDE Plasma

Three Lines of Code Improve Linux Storage Performance

AUR Hit Again with Malicious Packages

Alpine Linux 3.24 Features Fresh Desktops and a Newer Kernel

EU Open Source Strategy Plays Key Role in Tech Sovereignty Package

Detecting spam users automatically with a neural network

Spam Stopper

Training Day

Buy this article as PDF

Buy Linux Magazine

Related content

Subscribe to our Linux Newsletters Find Linux and Open Source Jobs Subscribe to our ADMIN Newsletters

Support Our Work

News

Tag Cloud

Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters