Mathematics AI SL's Sample Internal Assessment

Mathematics AI SL's Sample Internal Assessment

Investigation on the Variation of Probability of Getting the Maximum Share of a Red Packet in WeChat at a particular drawing based on five different raw data sets (each of width 30)

5/7
5/7
11 mins read
11 mins read
Candidate Name: N/A
Candidate Number: N/A
Session: N/A
Word count: 2,080

Table of content

Rationale

“Money won is twice as sweet as money earned.” A well-known proverb used in the gambling stands for the fact that the money earned in gambling or by the luck feels more exciting and exaggerating than the hard-earned money of business or service. In our last family trip to Hong Kong during the Chinese New Year, we have planned a stay in Macao for two nights. It was my first interaction with gambling when we visited the biggest casino in China – The Venetian Macao. I came across the power of gambling which may either take one to the height of lifestyle, or to the world of depression due to huge of debts. However, the game of roulette has significantly drawn my attention. It was my belief that there is a scientific approach, if followed, may lead to winning the game.

 

After my holiday, I went deeper into procedure of the game – Roulette. After studying about the game from various articles, research projects that were previously carried on prediction of correct calls, my interest took a shape of a project. It was in 2017, when I was introduced with concepts and applications of probability in mathematics. It has a significantly contribution behind the mathematical exploration of the analysis on determination of corrected calls in Roulette. My project went well but the Future Prospect of the project was analysis on Red Envelope of WeChat.

 

This was the moment when I came across the concept of Red Envelope. However, due to some circumstances further work was not carried on at that time. But recently my interest developed again on this field.

 

I have done a thorough research on the feature of Red Envelope on WeChat. I have gone through a number of research articles and journals where I learnt the algorithm which is followed by WeChat to offer the sum of money in each drawings of Red Envelope. Moreover, I have analyzed several data sets on disbursing cash in each drawing from several newspapers and news articles. However, the answer to the question which was partially answered in the last project on Roulette, was not answered this time, in case of Red Envelope – which term will disburse the maximum amount of money?

 

Heaped with worries, I decided to research and find the answer to my query. This IA is about the same.

Aim

The main motive of this exploration is to determine the variation in probability of determining the maximum share drawn at each drawing. This is to determine a correlation between the number of drawing from the Red Envelope and the amount drawn in each drawing so that any relationship can be derived which may lead to determination of term offering the maximum share.

Research question

What is the variation in probability of determining the maximum share of a Red Packet in WeChat at a particular drawing based on five different raw data sets (each of width 30)?

Background information

What is red envelope in reference with wechat

Red Envelope is a feature offered in Chinese multipurpose, social media, messaging and mobile payment app made by Tencent – WeChat. This feature acts as a metaphor to the famous tradition in China – gifting red envelope during any occasion, specially, Chinese New Year. Each envelope usually contains some amount of money which is gifted to the friends and family members as their love, affection, relationship and often as a vode of thanks. WeChat added a feature naming Red Envelope which offers the same, virtually.

 

Here, a person can send a red envelope with a fixed amount of money in Chinese Yuan (CNY) in a WeChat group. The app gives the user, the liberty to set his desired amount of money and the number of drawings that could be made. Once the settings are done and the envelope is sent, the other members of the group will be able to draw from the envelope. It should be noted that the money disbursed in each drawing is not pre-determined and works on an algorithm. Neither the user who sent the envelope, nor the other members of the group can pre-determine the amount of money disbursed in each drawing.

 

Once the number of drawings set by the user is reached, no more members can draw money from that red envelope and the envelope will be terminated. The money received by the members will be directly deposited in their bank accounts linked with WeChat. It should be noted that the total amount of money sent in the envelope will be disbursed only if the number of drawings is reached. The envelope remains valid only for a day. Thus, if the total number of drawings is not reached by the end of the day, the remaining amount of money is refunded to the user.

What is the procedure of distribution of share in red envelope?

As discussed in the previous sub-heading, the distribution of share is determined by an algorithm. The amount of money disbursed in each drawing ranges widely. There is a coefficient of maximum share which determines the range of money which can be disbursed in each drawing. Usually, this value is equal to 2.1. Thus, in case of each drawing, the amount of money to be disbursed ranges between 0.01 CNY and average residual amount of money times the coefficient of maximum disbursement. The average residual amount of money is defined as follows:

 

Average Residual = \(\frac{Residual\ Amount\ of\ Money}{Residual\ Time\ of\ Drawings}\)

What are the money limits per envelope and the number of members allowed to draw from each envelope?

The maximum amount of money that could be added in a red envelope is 200 CNY and the maximum number of drawings could be set to 100.

Basic concept of probability used in this IA

The probability of occurrence of an event is:

 

Probability (P)\(\frac{Number\ of\ Favourable\ Outcome}{Total\ Number\ of\ Sample\ Spaces}\)

Regression correlation coefficient

Regression correlation coefficient is a tool to measure the strength of the correlation between the independent variable and the dependent variable. The set of values (x1,y1), (x2,y2), (xn,yn) are used to find the value of r as stated by the formula below:

 

r\(\frac{n\big(\sum xy)-(\sum x)(\sum y)}{\sqrt{[n\sum x^2-\big(\sum x\big)^2][n\sum y^2-\big(\sum y\big)^2]}}\)

 

In the above-mentioned formula, x is the value of independent variable of each observation, y is the value of dependent variable of each observation, xy is the value of the product of the independent and the dependent variable of each observation, n is the number of observation and denotes the sum of all the observation of the mentioned variable.

 

By squaring the value of r, the value of the regression coefficient (r2) will be achieved. The value of r2 lies between 0 and 1 where 1 signifies maximum correlation whereas 0 signifies null correlation.

Chi squared test

Chi squared test is a kind of analysis which predicts the existence of any correlation between an independent variable and a dependent variable. The Chi squared value of any given set of data is firstly calculated. Now, based on the type of data, for example, paired data or independent data, the Chi squared value is checked in the Chi squared table which further predicts the existence of any correlation.

 

The formula of Chi squared value is given below:

 

X2 value = Σ \(\frac{(O_i-E_i)^2}{E_i}\)

 

Here, Oi is the observed value, Ei is the expected value, denotes the sum of all the observation of the mentioned variable.

 

Now, the Chi squared value is checked in Chi squared table which predicts the existence of any correlation. The Chi squared table is shown below:

df0.9950.990.9750.950.900.100.050.0250.010.005

1

------0.0010.0040.0162.7063.8415.0246.6357.879

2

0.0100.0200.0510.1030.2114.6055.9917.3789.21010.597

3

0.0720.1150.2160.3520.5846.2517.8159.34811.34512.838

4

0.2070.2970.4840.7111.0647.7799.48811.14313.27714.860

5

0.4120.5540.8311.1451.6109.23611.07012.83315.08616.750

6

0.6760.8721.2371.6352.20410.64512.59214.44916.81218.548

7

0.9891.2391.6902.1672.83312.01714.06716.01318.47520.278

8

1.3441.6462.1802.7333.49013.36215.50717.53520.09021.955

9

1.7352.0882.7003.3254.16814.68416.91919.02321.66623.589

10

2.1562.5583.2473.9404.86515.98718.30720.48323.20925.188

11

2.6033.0533.8164.5755.57817.27519.67521.92024.72526.757

12

3.0743.5714.4045.2266.30418.54921.02623.33726.21728.300

13

3.5654.1075.0095.8927.04219.81222.36224.73627.68829.819

14

4.0754.6605.6296.5717.79021.06423.68526.11929.14131.319

15

4.6015.2296.2627.2618.54722.30724.99627.48830.57832.801

16

5.1425.8126.9087.9629.31223.54226.29628.84532.00034.267

17

5.6976.4087.5648.67210.08524.76927.58730.19133.40935.718

18

6.2657.0158.2319.39010.86525.98928.86931.52634.80537.156

19

6.8447.6338.90710.11711.65127.20430.14432.85236.19138.582

20

7.4348.2609.59110.85112.44328.41231.41034.17037.56639.997

21

8.0348.89710.28311.59113.24029.61532.67135.47938.93241.401

22

8.6439.54210.98212.33814.04130.81333.92436.78140.28942.796

23

9.26010.19611.68913.09114.84832.00735.17238.07641.63844.181

24

9.88610.85612.40113.84815.65933.19636.41539.36442.98045.559

25

10.52011.52413.12014.61116.47334.38237.65240.64644.31446.928

26

11.16012.19813.84415.37917.29235.56338.88541.92345.64248.290

27

11.80812.87914.57316.15118.11436.74140.11343.19546.96349.645

28

12.46113.56515.30816.92818.93937.91641.33744.46148.27850.993

29

13.12114.25616.04717.70819.76839.08742.55745.72249.58852.336

30

13.78714.95316.79118.49320.59940.25643.77346.97950.89253.672

40

20.70722.16424.43326.50929.05151.80555.75859.34263.69166.766

50

27.99129.70732.35734.76437.68963.16767.50571.42076.15479.490

60

35.53437.48540.48243.18846.45974.39779.08283.29888.37991.952

70

43.27545.44248.75851.73955.32985.52790.53195.023100.425104.215

80

51.17253.54057.15360.39164.27896.578101.879106.629112.329116.321

90

59.19661.75465.64769.12673.291107.565113.145118.136124.116128.299

100

67.32870.06574.22277.92982.358118.498124.342129.561135.807140.169

Figure 1

Python programming – a very brief idea

Python is a high-level programming language which is used to serve several purposes in the domain of information technology. In context with this exploration, python programming language can be used to develop a prototype of the feature of red envelope only with respect to the amount of money that should be disbursed.

Hypothesis

Null hypothesis

It is assumed that there does not exist any correlation between the number of drawing and probability of getting the maximum share.

Alternate hypothesis

It is assumed that there exists a correlation between the number of drawing and probability of getting the maximum share.

Data collection

Source of data

A data sheet has been prepared based on several news articles, reports and surveys in money disbursed in each drawing in Red Envelope WeChat. It has been possible to record the data of number of drawing and amount as these amounts directly reflect in bank statement.

 

Justification of the Source and Interval of Raw Data

Data sheet has been prepared based on the amount of money added in the red envelope by the contributor. In all the data sets, the number of drawings is set to 30. This is treated as a controlled variable to keep a uniformity to study the correlation. The amount of money added in each trial is increased linearly at an interval of 30. This is done to ignore more complex calculations as the number of drawings is kept fixed to 30.

Raw data table

TermAmount of Money disbursed (in CNY)
11.23
21.43
30.97
40.45
50.78
60.56
71.23
84.51
90.98
100.23
110.34
120.21
132.34
140.98
151.00
160.65
171.02
180.56
190.98
200.34
214.23
220.34
230.16
240.54
251.02
260.94
270.56
280.45
290.45
300.52

Figure 2 - Table On Raw Data Table For Disbursement Of Money In 30 Drawings When Total Amount Of Money Is 30 CNY

TermAmount of Money disbursed (in CNY)
10.65
21.65
32.34
42.65
51.8
62.00
71.78
81.43
91.76
103.54
111.34
120.65
130.34
141.23
151.65
162.43
171.43
181.76
192.34
202.98
212.34
227.65
232.43
242.98
252.34
261.54
272.00
281.23
291.18
300.56

Figure 3 - Table On Raw Data Table For Disbursement Of Money In 30 Drawings When Total Amount Of Money Is 60 CNY

TermAmount of Money disbursed (in CNY)
13
23.23
32.34
42.65
53.87
63.45
72.12
81.54
93.76
103.98
113.56
121.23
130.45
140.34
151.54
162.34
171.23
183.45
193
201.32
211.54
223.65
233.87
242.56
252.54
2610.43
2710.54
282.34
291.65
302.48

Figure 4 - Table On Raw Data Table For Disbursement Of Money In 30 Drawings When Total Amount Of Money Is 90 CNY

TermAmount of Money disbursed (in CNY)
13.54
21.76
34.65
47.65
55.43
63.34
72.43
82.64
92.76
103.54
112.43
122.98
131.65
141.65
151.54
160.98
170.76
182.54
192.86
201.54
213.65
224.87
234.65
248.76
2515.43
262.54
2710.98
283.54
294.54
304.37

Figure 5 - Table On Raw Data Table For Disbursement Of Money In 30 Drawings When Total Amount Of Money Is 120 CNY

TermAmount of Money disbursed (in CNY)
15.03
25.02
35.76
44.36
55.03
65.32
75.65
84.87
95.34
105.76
114.67
125.09
135.87
144.56
154.56
164.87
175.67
185.23
195.67
205.98
214.87
224.67
234.56
244.98
255.87
265.89
275.03
284.45
294.34
301.03

Figure 6 - Table On Raw Data Table For Disbursement Of Money In 30 Drawings When Total Amount Of Money Is 150 CNY

Processed data table

Figure 7 - Table On Processed Data Table For Disbursement Of Money When Total Amount Of Money Is 30 CNY

Figure 8 - Table On Processed Data Table For Disbursement Of Money When Total Amount Of Money Is 60 CNY