Philosophical Analytics: Blackjack: A Paradox

Introduction

A friend of mine asked me to simulate blackjack deals to verify what he believed would be a new winning strategy in blackjack. If you don't know the rules of blackjack, see here for the Wikipedia page. The basic idea he had was that, assuming the dealer hits on 16 or below and stays on 17 or above, the face up cards that lead to a dealer bust should be different from what is typically assumed.

The typical assumption is that the next card is always a 10 in value, as 10s are the most common point value (16/52 are worth 10 points). Assuming this and the dealer rules, we would expect the dealer to bust the most on face up 2s through 6s. Why? Because if a dealer gets a 6 face up, we believe him to have a 10 face down, which adds to 16, which means he has to hit. But getting another 10 gives him 26 which is a bust. Similarly down to 2 (2->12->22 = bust).

However, while it is true that 10 is the most common point value for a card, the average point value in the deck is about 7 (higher or lower depending on if the Ace is an 11 or 1 in any hand). So we should expect any hidden cards to average about 7 points. But if this is true, then the dealer should bust with a different set of face up cards. For instance, if we assume the average to be 7, then we would expect 2, 8, and 9 to be the bust cards (2->9->16->23; 8->15->22; 9->16->23). If you choose a different average calculation, you can get a different set of face up bust cards.

So which is it? As it turns out, the original basic strategy of assuming the next card is a 10 is the correct approach. But why? This leads to the following paradox as both approaches seem intuitively correct:

The paradox: If the average value of a card in blackjack is about 7, why is the basic winning strategy to assume that the next card is a 10?

Data Exploration

The dealer busts about 28.6% of the time and stays about 71.4% of the time. The dealer does in fact bust most frequently on face up 2s-6s. See the table below (J=11, Q=12, K=13, A=14). This is based on 100,000 simulations. 5s and 6s are highest at about 43% of the time:

Face Up	Busted	Stay	Busted/Total
2	2671	4964	0.350
3	2828	4742	0.374
4	3136	4522	0.410
5	3338	4320	0.436
6	3294	4445	0.426
7	2047	5720	0.264
8	1817	5945	0.234
9	1771	5880	0.231
10	1670	6005	0.218
11	1623	6106	0.210
12	1679	5950	0.220
13	1716	6002	0.222
14	1012	6797	0.130

So the original basic strategy is correct that the bust cards are 2-6. And there is a significant break between 6 and 7. But why is this the case? Let's do some exploring.

Averages:

The average value of the cards in a dealer's hand when he busts is 7.03. When he stays, its 7.61. Breaking this down by face up card, we have the below table:

Face Up	Busted	Stay	Difference
2	6.076	5.548	0.529
3	6.383	5.696	0.688
4	6.726	5.814	0.911
5	7.052	5.877	1.175
6	7.500	6.350	1.150
7	7.162	7.380	-0.217
8	7.283	7.878	-0.596
9	7.467	8.359	-0.892
10	7.579	8.869	-1.290
11	7.597	8.878	-1.281
12	7.570	8.869	-1.299
13	7.586	8.888	-1.302
14	5.419	8.504	-3.085

Interestingly, the bust averages for 2-6 are higher than the stay averages, but the opposite is true for 7-14. Why is this the case? Perhaps another table of the average count of cards will help:

Face Up	Busted	Stay	Difference
2	3.946	3.517	0.429
3	3.783	3.422	0.361
4	3.606	3.338	0.269
5	3.478	3.284	0.194
6	3.296	3.047	0.249
7	3.399	2.580	0.819
8	3.317	2.467	0.850
9	3.233	2.366	0.867
10	3.174	2.257	0.917
11	3.169	2.255	0.915
12	3.184	2.256	0.928
13	3.172	2.255	0.917
14	4.454	2.520	1.934

Busts on average require more cards than stays, which makes sense, because a bust goes over 21 while a stay is between 17-21. On average, it takes more cards to get a higher value, hence, getting over 21 takes more cards than getting between 17 and 21.

However, there is a big break in average counts of cards between face up 2s-6s and face up 7s-14s that matches the break in average values of cards. Why should this exist?

If we start with the assumption that the next card (or any hidden card) is a 10, then this is to be expected. With that assumption, a face up 6 or below is a total of 16 or below (hit) while a 7 face up is a total of 17 (stay). A hit requires one more card at least, so it makes sense that a large break occurs between 6 and 7.

Also, the higher the face up card, the more likely a stay is with only 2 cards, meaning that each card is on average larger than a similar stay with 3 cards. For example, a stay of 17 with a 10 and 7 averages 8.5 per card while a stay of 17 with an 8, 2, and 7 averages 5.7 per card. As the face up card gets smaller, the amount of cards needed to get to a stay or bust increases, since the cards are worth less, hence, the averages get smaller.

But this isn't the whole story. The first table really comes from the combination of the second table and another table, the average hand value for the dealer by face up card. We can see that the busted averages are largely the same with the highest value at 6. The stay averages are also similar with a notable low value at 7. The difference between the busted total value and the stay total value shows a peak of 5.6 at a face up value of 7, and this decreases to either side.

This makes sense, since a 7 face up (and a hidden 10) is the lowest possible stay, while a 6 face up (with a hidden 10) is the highest hit. So we should expect the greatest difference between busts and stays to be right around the 6 and 7 divide.

Face Up	Busted	Stay	Difference
2	23.325	18.917	4.408
3	23.403	18.974	4.429
4	23.570	18.967	4.603
5	23.843	18.920	4.923
6	24.159	18.761	5.398
7	23.740	18.100	5.640
8	23.692	18.466	5.226
9	23.732	18.865	4.867
10	23.765	19.263	4.502
11	23.792	19.276	4.516
12	23.783	19.261	4.522
13	23.784	19.276	4.507
14	23.712	19.528	4.185

So a large difference in the middle combined with decreasing average counts of cards in the dealers hand makes for a switch in the difference of average card values at the 6/7 divide. And all of this makes sense when we expect the next (or hidden) card to have a value of 10.

Counts of 10s:

Is assuming that the next (or hidden) card is a 10 in value a good assumption? In a way, yes. 71% of dealer hands have a 10 in them. Thus, it is reasonable to assume that the dealer will have a 10 face down or at least coming as a third card. That is, most of the time, the dealer will have a 10 in the hand.

Here is a more detailed breakdown:

Count of 10s	Busted	Stay
0	4154	24520
1	15002	37876
2	9446	9002

The dealer busts only 4% of the time without any 10s. That is, only 15% of busts lack a 10 valued card. That means that 85% of busts have at least one 10 involved. With stays, only 34% of them lack a 10.

If we remove the possibility of getting a 10 valued card as the face up, then 59% of hands will still have a 10 valued card hidden or coming. In greater detail:

Count of 10s	Busted	Stay
0	4154	24520
1	12587	22815
2	5173	0

81% of these busts have a 10 involved. Stays are split nearly 50-50 on having a 10 or not. When a 10 is involved, 43% of the time the dealer busts.

Clearly, one should expect the dealer to get a 10 valued card.

Combining Averages with 10s

Suppose the dealer gets one 10 valued card (not the face up), which happens 51% of the time. What happens then?

Let's assume the other card besides the face up card is a 7 (the average). If the 10 comes first, then a bust comes on face up 5 and 6, which are the most likely to bust. If the 10 comes second, then 5-9 are busts.
If the other card is an 8 (the median), a bust happens on 4-6 with the 10 coming first, and 4-8 when it comes second.
If the other card is a 10 (the mode) (7% of the time), then a bust happens on 2-6.

So we see that busting on 5-6 is very likely in all circumstances, and then depending on what we take to be the best representative of the set (mean vs. median vs. mode), additional face up values lead to busting.

When we combine all of these together, it is not surprising that we do get the following result of bust averages by face up value, now ordered from most likely to least:

Face Up	Busted	Stay	Busted/Total
5	3338	4320	0.436
6	3294	4445	0.426
4	3136	4522	0.410
3	2828	4742	0.374
2	2671	4964	0.350
7	2047	5720	0.264
8	1817	5945	0.234
9	1771	5880	0.231
13	1716	6002	0.222
12	1679	5950	0.220
10	1670	6005	0.218
11	1623	6106	0.210
14	1012	6797	0.130

Still, if we had to pick one number to represent the set of values, it seems that 10 (the mode) is the best number to do so, and not the mean or the median. This best aligns with the face up value busts. But why?

Resolving the Paradox: A Statistical Lesson

If assuming that a hidden card is a 10 in value is the right approach, why is assuming that the next card is a 7 in value a bad approach? In other words, why does the mode take priority over the mean in representing the distribution of values of cards in blackjack?

The answer lies in the distribution of values in blackjack. A hidden assumption in my friend's approach is that the average value of the cards was from an approximately normal distribution. If that were the case, then we could expect the next card to be, on average, a 7, because in fact, the average card would be about a 7.

However, blackjack values are not normally distributed. There are 4 times as many 10s as there are of any other card value. As a result, using the average value of the cards to represent any hidden card values is misleading. This is a misuse of the mean to represent the sample.

To make this point more obvious, suppose we played a game in which every card was worth 200 points except for the 2 of clubs and 2 of spades, which were both worth -5000 points. The average value of the cards in this set is 0 points. However, most of the time, the next card played will be worth 200 points. And no card is in fact worth 0 points.

In this circumstance, it makes sense to treat every next card as being worth 200 points, knowing full well that 2 of the cards will be worth much less. Why? Because in fact, most of the time, the next card will be 200 points. The best strategy for winning the game will likely assume that every hidden card is worth 200 points.

Conclusion

Going back to blackjack, and given the foregoing, the best simple assumption does seem to be that any hidden card is a 10 in value. If we make this a bit more complex, the assumption becomes that the dealer will get at least one 10 valued card. So if this is not the face up card, we should expect it to be the face down card or a hit card. This, combined with what we have discovered about the mean and median, leads us view face up cards 2-6 as bust cards, with 5s and 6s being especially important.

The statistical lesson to be learned is that one must be careful in using the mean (or median or mode) to represent the dataset. Do a sanity check. Look at the data. Does it make sense, or does it mischaracterize the data? Using the word "expected" helps. Does the mean represent what I expect any unknown value to be, or is the mode or something else a better representation of what is expected?

Context is also important. Saying that the average value of a data set is X really does not tell us much about that dataset. Putting that into the context of the distribution, standard deviation, skew, mode, median, and other various statistical measures and descriptions can help us understand what the data is really like, and how we can best use that data to make good decisions.

It has been said that there are three kinds of lies: "lies, damned lies, and statistics." Make sure your statistics do not misrepresent the true nature of the data and become worse than damned lies.

Philosophical Analytics

Pages

Wednesday, February 17, 2016

Blackjack: A Paradox

Introduction

Data Exploration

Averages:

Counts of 10s:

Combining Averages with 10s

Resolving the Paradox: A Statistical Lesson

Conclusion

No comments:

Post a Comment

Blog Archive

Popular Posts

Labels