Perl script monitors visitor statistics for YouTube movies
Machine Learning Lite
Line 16 of Listing 5 iterates over all the movies to be monitored and runs a SQL query for each of them to retrieve the historic viewing numbers sorted in ascending order of time of measurement. It dumps these y values into the @yvalues
array and lets the matching x values start at 1 and grow by 1 for each measurement day. In $row[0]
, the first element of each row of the SQL result, you can see the view count for the movie currently processed at the point in time the measurement was taken.
Lines 34 and 35 then remove the latest (today's) measured value so that the script only applies the regression rule against the measuring points that are more than one day in the past. The coefficients()
method in line 42 finally returns the computed $intercept
value (y offset at x=0) and the gradient of the straight line in the $slope
variable. Line 44 uses this to interpolate the value for the last measurement in $y_predicted
; in other words, it determines what you would expect if the average number of viewers continued to increase as it always had.
If the difference to the actual measured value is three times the magnitude of a linear increase, the movie is obviously going viral, and the user is notified by a text message. You need to experiment with this factor; some people might want to learn about even minor successes, so you would use a smaller factor than 3
in line 48 instead.
For more convenience, instead of the print
instruction, you might want to send an HTML email by using, for example, the CPAN Mail::DWIM module, which needs just one line to do that. The recipient would then be able to run their new hit movie directly by clicking on the displayed link.
Infos
- Listings for this article: ftp://ftp.linux-magazin.com/pub/listings/magazine/171
- Lantz, Brett. Machine Learning with R. Packt Publishing, 2013
- Linear regression: http://en.wikipedia.org/wiki/Linear_Regression
« Previous 1 2
Buy this article as PDF
(incl. VAT)
Buy Linux Magazine
Direct Download
Read full article as PDF:
Price $2.95
Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters
Find SysAdmin Jobs
News
-
Escuelas Linux 8.0 is Now Available
Just in time for its 25th anniversary, the developers of Escuelas Linux have released the latest version.
-
LibreOffice 7.5 has Arrived Loaded with New Features and Improvements
The favorite office suite of the Linux community has a new release that includes some visual refreshing and new features across all modules.
-
The Next Major Release of Elementary OS Has Arrived
It's been over a year since the developers of elementary OS released version 6.1 (Jólnir) but they've finally made their latest release (Horus) available with a renewed focus on the user.
-
KDE Plasma 5.27 Beta Is Ready for Testing
The latest beta iteration of the KDE Plasma desktop is now available and includes some important additions and fixes.
-
Netrunner OS 23 Is Now Available
The latest version of this Linux distribution is now based on Debian Bullseye and is ready for installation and finally hits the KDE 5.20 branch of the desktop.
-
New Linux Distribution Built for Gamers
With a Gnome desktop that offers different layouts and a custom kernel, PikaOS is a great option for gamers of all types.
-
System76 Beefs Up Popular Pangolin Laptop
The darling of open-source-powered laptops and desktops will soon drop a new AMD Ryzen 7-powered version of their popular Pangolin laptop.
-
Nobara Project Is a Modified Version of Fedora with User-Friendly Fixes
If you're looking for a version of Fedora that includes third-party and proprietary packages, look no further than the Nobara Project.
-
Gnome 44 Now Has a Release Date
Gnome 44 will be officially released on March 22, 2023.
-
Nitrux 2.6 Available with Kernel 6.1 and a Major Change
The developers of Nitrux have officially released version 2.6 of their Linux distribution with plenty of new features to excite users.