Calculating Probability

Moose Gimmick

Listing 2 uses the CPAN Moose module, which saves me from having to code the Distrib constructor in Perl. However, the code instead needs to use has (line 3) to declare and initialize the class attributes. This is just a neat gimmick in the present case; for more attributes, however, the code would be much leaner than manually creating classes.

The set() method expects the name of a hypothesis (e.g., BB) and its a priori probability and stores this in the object's internal values array. To multiply a hypothesis value by a constant value, both are passed to mult(); it finds the previously stored value in values and multiplies it with the value passed in for $prob.

The normalize() method iterates over all values previously inserted into the hash, adds them up in $sum, and then divides all the values by the sum. Thus, the new sum of all probability values after a multiplication is again 1. Each value can thus be interpreted as a probability between 0 and 1. At the bottom of the module, prob() finds the value for the searched-for hypothesis by retrieving the hash value using the values() method. The latter has been automatically generated by Moose and returns the value stored under the key of the desired hypothesis.

More Abstraction

If you perform several such tests for various problems, you will identify a pattern: After establishing the hypotheses, the probabilities of all defined hypotheses are always multiplied by the likelihood of newly incoming data. It makes sense (as shown in Think Bayes [3]) to define a class derived from Distrib by the name of HypoTest, much as in Listing 3. The class uses an update() method to update all the values for all hypotheses, based on the probabilities of incoming data.

Listing 3

HypoTest.pm

HypoTest also relies on classes derived from it (e.g., CardHypoTest in Listing 4) to overload the abstract likelihood() method and return the value for P(D|H) based on the probability of the additionally available data D under the assumption that hypothesis H is true.

Listing 4

hypotest

The HypoTest framework in Listing 3 calls the likelihood() method repeatedly to obtain the data probabilities under the assumption of individual hypotheses before storing them in the Distrib distribution. The framework further provides a print() method, which is used to output the values of all the updated probabilities for each hypothesis.

Testing Hypotheses

In Listing 4, likelihood() accepts the letter R from the main program as $data – to record a drawn card with red front as an additional condition. Then, based on the hypothesis also passed in (RR, RB, BB), it computes how likely it is that the test candidate will look at a red surface: 1 (i.e., 100 percent) for RR, 0.5 for RB, and 0 for BB.

For this calculation, the function uses a regular expression that counts the number of Rs in the hypothesis and divides the result by 2 as a floating-point value, so that Perl does not perform integer division and dump the remainder.

Finally, hypotest outputs the probabilities for all hypotheses in the distribution using the print() method from the HypoTest module and correctly reports that the red-red card will appear in 2/3 of all cases:

$ ./hypotest
RC 0.333333333333333
RR: 0.666666666666667
BB 0

In other words, the test candidate, who has just drawn a card with a red front and now turns over this card, will see a red back with probability of 2/3.

« Previous 1 2 3 Next »

Buy this article as PDF

Express-Checkout as PDF

Price $2.95
(incl. VAT)

Buy Linux Magazine

SINGLE ISSUES

Print Issues

Digital Issues

SUBSCRIPTIONS

Print Subs

Digisubs

TABLET & SMARTPHONE APPS

US / Canada

UK / Australia

Support Our Work

Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.

News

Canonical Releases Ubuntu 24.04

Gnome , Linux , open source , Ubuntu

After a brief pause because of the XZ vulnerability, Ubuntu 24.04 is now available for install.
Linux Servers Targeted by Akira Ransomware

Enterprise Linux , Linux , ransomware , Security

A group of bad actors who have already extorted $42 million have their sights set on the Linux platform.
TUXEDO Computers Unveils Linux Laptop Featuring AMD Ryzen CPU

Games , Hardware , laptop , Linux

This latest release is the first laptop to include the new CPU from Ryzen and Linux preinstalled.
XZ Gets the All-Clear

Arch Linux , Fedora , Linux , open source , Security , Ubuntu

The back door xz vulnerability has been officially reverted for Fedora 40 and versions 38 and 39 were never affected.
Canonical Collaborates with Qualcomm on New Venture

Artificial Inte... , Linux , open source , Security , Ubuntu

This new joint effort is geared toward bringing Ubuntu and Ubuntu Core to Qualcomm-powered devices.
Kodi 21.0 Open-Source Entertainment Hub Released

audio , Multimedia , Music , open source , streaming video , Video

After a year of development, the award-winning Kodi cross-platform, media center software is now available with many new additions and improvements.
Linux Usage Increases in Two Key Areas

Games , Linux , open source , Steam

If market share is your thing, you'll be happy to know that Linux is on the rise in two areas that, if they keep climbing, could have serious meaning for Linux's future.
Vulnerability Discovered in xz Libraries

Fedora , Linux , malware , Security

An urgent alert for Fedora 40 has been posted and users should pay attention.
Canonical Bumps LTS Support to 12 years

Linux , open source , Operating Systems , Ubuntu

If you're worried that your Ubuntu LTS release won't be supported long enough to last, Canonical has a surprise for you in the form of 12 years of security coverage.
Fedora 40 Beta Released Soon

Fedora , Gnome , open source , Plasma , Wayland

With the official release of Fedora 40 coming in April, it's almost time to download the beta and see what's new.

Calculating Probability

Moose Gimmick

More Abstraction

Testing Hypotheses

Buy this article as PDF

Buy Linux Magazine

Related content

Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters

Support Our Work

News

Canonical Releases Ubuntu 24.04

Linux Servers Targeted by Akira Ransomware

TUXEDO Computers Unveils Linux Laptop Featuring AMD Ryzen CPU

XZ Gets the All-Clear

Canonical Collaborates with Qualcomm on New Venture

Kodi 21.0 Open-Source Entertainment Hub Released

Linux Usage Increases in Two Key Areas

Vulnerability Discovered in xz Libraries

Canonical Bumps LTS Support to 12 years

Fedora 40 Beta Released Soon

Calculating Probability

Moose Gimmick

More Abstraction

Testing Hypotheses

Buy this article as PDF

Buy Linux Magazine

Related content

Subscribe to our Linux Newsletters Find Linux and Open Source Jobs Subscribe to our ADMIN Newsletters

Support Our Work

News

Tag Cloud

Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters