A Bayesian average is a method of estimating the mean of a population using outside information, especially a pre-existing belief, which is factored into the calculation

Assisted Mindfulness

Last update: Oct 19, 2022

Related tags

Miscellaneous bayesian-average

Overview

Bayesian Average

A Bayesian average is a method of estimating the mean of a population using outside information, especially a pre-existing belief, which is factored into the calculation. To understand if the Bayesian average is right for you, let's look at two ways to rank star ratings:

Use an arithmetic average that adds together all ratings and divides them by the total quantity of ratings. If there are 100 1-star ratings and 10 5-star ratings, the calculation is ((100x1) + (10x5))/ (100+10) = 1.36.
Use a Bayesian average that adjusts a product’s average rating by how much it varies from the catalog average. This favors products with a higher quantity of ratings. As already suggested, ignoring the quantity of ratings doesn’t help distinguish between items with 10 ratings and 1000 ratings. You need to at least calculate an average that includes the quantity of ratings.

The following image shows three items ranked by different averages. The left side uses the arithmetic average for ranking. The right side uses the Bayesian average.

Both sides display the arithmetic average in parenthesis just right of the stars. They also display the average used for ranking as avg_star_rating and bayes_avg respectively, under each item.

By putting Item A at the top, the left side’s ranking is both misleading and unsatisfying. The ranking on the right, based on the Bayesian average, reflects a better balance of rating and quantity of ratings. This example shows how the Bayesian average lowered item A’s average to 4.3 because it measured A’s 10 ratings against B and C’s much larger numbers of ratings. As described later, the Bayesian average left Items B and C unchanged because the Bayesian average affects items with low rating counts much more then those that have more ratings.

In sum, by relativizing ratings in this way, the Bayesian average creates a more reliable comparison between products. It ensures that products with lower numbers of ratings have less weight in the ranking.

Understanding the Bayesian average

The Bayesian average adjusts the average rating of products whose rating counts fall below a threshold. Suppose the threshold amount is calculated to be 100. That means average ratings with less than 100 ratings get adjusted, while average ratings with more than 100 ratings change only very slightly. This threshold amount of 100 is called a confidence number, because it gives you confidence that averages with 100 or more ratings are more reliable than averages with less than 100 ratings.

This confidence number derives from the catalog’s distribution of rating counts and the average rating of all products. By factoring in ratings counts and averages from the whole catalog, the Bayesian average has the following effect on an item’s individual average rating:

For an item with a fewer than average quantity of ratings, the Bayesian average lowers its artificially high rating by weighing it down (slightly) to the lower catalog average.
For an item with a lot of ratings (that is, more than the threshold), the Bayesian average doesn’t change its rating average by a significant amount.

Installation

You can install the Bayesian average into your project using the Composer package manager:

composer require assisted-mindfulness/bayesian-average

Usage

Create an instance of the AssistedMindfulness\BayesianAverage\BayesianAverage class.

// Item with a large number of ratings
$itemLargeRating = collect(range(0, 500))->transform(fn () => random_int(4, 5));
$itemLargeRatingAverage = $itemLargeRating->avg();
$itemLargeRatingCount = $itemLargeRating->count();

$c = 100; // Confidence number
$m = 3.5; // Catalog average

$bayes = new BayesianAverage();
$bayes
    ->setConfidenceNumber($c)
    ->setAverageRatingOfAllElements($m);

$bayes->getAverage($itemLargeRatingAverage, $itemLargeRatingCount); // ~4.3

The confidence number is set by one of the following methods:

setConfidenceNumber(int|float $confidenceNumber): will set the passed argument to a confidence number.
setConfidenceNumberForEvenOrOdd(int $count, callable $even, callable $odd): In case of an even $count (number of elements), will set the result of executing $even as a confidence number; if $count is odd, the confidence mean will be set to the result of running $odd.

Example

Computing the Bayesian mean using an array as an example.

[5, 4, 3, 4, 3, 2, 4, 3], 'ratings_count' => 8, ], [ 'name' => "Item B", "ratings" => [4, 5, 5, 5, 5, 5, 5, 5, 4], 'ratings_count' => 9, ], [ 'name' => "Item C", "ratings" => [5], 'ratings_count' => 1, ], ]); $allRatingsCount = $data->sum('ratings_count'); $sum = $data->sum(fn($item) => array_sum($item['ratings'])); $bayes = new BayesianAverage($allRatingsCount, $sum); $bayes->setConfidenceNumber(1); $average = array_sum($data[0]['ratings']) / count($data[0]['ratings']); // 3.5 $bayes_avg = $bayes->getAverage($average, count($data[0]['ratings'])); // 3.5802469135802 ">

$data = collect([
    [
        'name'          => "Item A",
        "ratings"       => [5, 4, 3, 4, 3, 2, 4, 3],
        'ratings_count' => 8,
    ],
    [
        'name'          => "Item B",
        "ratings"       => [4, 5, 5, 5, 5, 5, 5, 5, 4],
        'ratings_count' => 9,
    ],
    [
        'name'          => "Item C",
        "ratings"       => [5],
        'ratings_count' => 1,
    ],
]);

$allRatingsCount = $data->sum('ratings_count');
$sum = $data->sum(fn($item) => array_sum($item['ratings']));

$bayes = new BayesianAverage($allRatingsCount, $sum);

$bayes->setConfidenceNumber(1);

$average = array_sum($data[0]['ratings']) / count($data[0]['ratings']); // 3.5
$bayes_avg = $bayes->getAverage($average, count($data[0]['ratings'])); // 3.5802469135802

Consider using the setConfidenceNumberForEvenOrOdd method. Let's set the lower quartile as the confidence number.

$bayes = new BayesianAverage($allRatingsCount, $sum);

$bayes->setConfidenceNumberForEvenOrOdd($data->count(), function ($position) use ($data) {
    $item = $data->sortBy('ratings_count')->values()->get($position / 2);

    return $item['ratings_count'];
}, function ($position) use ($data) {
    $item1 = $data->sortBy('ratings_count')->values()->get(($position + 1) / 2);
    $item2 = $data->sortBy('ratings_count')->values()->get(($position - 1) / 2);

    return ($item1['ratings_count'] + $item2['ratings_count']) / 2;
});

$data->each(function ($item) use ($bayes, $sum, $allRatingsCount) {
    $average = array_sum($item['ratings']) / count($item['ratings']);
    $bayes_avg = $bayes->getAverage($average, count($item['ratings']));
    
    printf('Average = %s, Bayesian  average = %s', $average, $bayes_avg);
});

/*
 *Average = 3.5, Bayesian  average = 3.5802469135802 
 *Average = 4.7777777777778, Bayesian  average = 4.7222222222222 
 *Average = 5, Bayesian  average = 4.6111111111111 
 */

License

The MIT License (MIT). Please see License File for more information.

You might also like...

This package makes it easy to add early access mode to your existing application.

This package makes it easy to add early access mode to your existing application. This is useful for when you want to launch a product and need to gat

174 Nov 26, 2022

Guest to Customer for Magento2 - Quickly and easily convert existing guest checkout customers to registered customers.

Guest to Customer for Magento 2.0 For Magento 2.0.x, 2.1.x, 2.2.x, 2.3.x and 2.4.x In general E-commerce, shoppers do not like to create an account du

66 Oct 7, 2022

A Composer Package which installs the PhantomJS binary (Linux, Windows, Mac) into /bin of your project.

phantomjs-installer A Composer package which installs the PhantomJS binary (Linux, Windows, Mac) into /bin of your project. Table of Contents Installa

149 Nov 8, 2022

Adds a header to every response to try and twart Google's usage of your site in it's FLoC tracking method.

Laravel No FLoC This package will add the Permissions-Policy: interest-cohort=() to try and twart Google's usage of your site in it's FLoC tracking me

11 Jul 14, 2022

Easily decorate your method calls with laravel-decorator package

🎄 Laravel Decorator Decorator pattern in laravel apps Made with ❤️ for smart clean coders A try to port "decorator" feature from python language to l

126 Jan 1, 2023

With the help of the Laravel eCommerce CashU Payment Gateway, the admin can integrate the CashU payment method in the Bagisto store.

Introduction Bagisto CashU Payment add-on allow customers to pay for others using CashU payment gateway. Requirements: Bagisto: v1.3.2 Installation wi

2 Aug 22, 2022

Symfony bundle for class/method memoization

Symfony service memoization bundle This bundle provides memoization for your services - every time you call the same method with the same arguments a

16 Oct 31, 2022

Shortest Path - have a function ShortestPath (strArr) take strArr which will be an array of strings which models a non-looping Graph.

Have the function ShortestPath(strArr) take strArr which will be an array of strings which models a non-looping Graph

1 Feb 5, 2022

A research raw data repository for researchers of Arba Minch University built using Codeigniter which follows MVC architecture. The front-end is build using Bootstrap.

Arba Minch University Dataset Repository This system is a research dataset repository for Arba Minch University researchers and is build using Codeign

8 Jul 1, 2022

Releases(1.0.0)

1.0.0(Sep 19, 2022)
What's Changed

Added main class, tests, updated description by @SadElephant in https://github.com/Assisted-Mindfulness/bayesian-average/pull/1

New Contributors

@SadElephant made their first contribution in https://github.com/Assisted-Mindfulness/bayesian-average/pull/1

Full Changelog: https://github.com/Assisted-Mindfulness/bayesian-average/commits/1.0.0
Source code(tar.gz)
Source code(zip)

A Bayesian average is a method of estimating the mean of a population using outside information, especially a pre-existing belief, which is factored into the calculation

Related tags

Overview

Bayesian Average

Understanding the Bayesian average

Installation

Usage

Example

License

You might also like...

This package makes it easy to add early access mode to your existing application.

Guest to Customer for Magento2 - Quickly and easily convert existing guest checkout customers to registered customers.

A Composer Package which installs the PhantomJS binary (Linux, Windows, Mac) into /bin of your project.

Adds a header to every response to try and twart Google's usage of your site in it's FLoC tracking method.

Easily decorate your method calls with laravel-decorator package

With the help of the Laravel eCommerce CashU Payment Gateway, the admin can integrate the CashU payment method in the Bagisto store.

Symfony bundle for class/method memoization

Shortest Path - have a function ShortestPath (strArr) take strArr which will be an array of strings which models a non-looping Graph.

A research raw data repository for researchers of Arba Minch University built using Codeigniter which follows MVC architecture. The front-end is build using Bootstrap.

Releases(1.0.0)

1.0.0(Sep 19, 2022)

What's Changed

New Contributors

Owner

Assisted Mindfulness

A sampling profiler for PHP written in PHP, which reads information about running PHP VM from outside of the process.

A composer plugin, to install differenty types of composer packages in custom directories outside the default composer default installation path which is in the vendor folder.

Enable method chaining or fluent expressions for any value and method.

Simplifies GST calculation and operations

CRUD Build a system to insert student name information, grade the class name, and edit and delete this information

Add scalar type hints and return types to existing PHP projects using PHPDoc annotations

A Magento community sourced security pre-flight checklist.

Install an execute script of specify quality tools to your git pre-commit hook, and it executes only for changed files

Simple user settings facade for Hyperf. Settings are stored as JSON in a single database column, so you can easily add it to an existing table.