Formel zum Würfeln (non-brute force)

Zunächst bin ich mir nicht sicher, wo diese Frage veröffentlicht werden soll. Ich frage, ob ein Statistikproblem NP-Complete ist und ob es nicht programmgesteuert gelöst werden soll. Ich poste es hier, weil das Statistikproblem der Mittelpunkt ist.

Ich versuche eine bessere Formel zu finden, um ein Problem zu lösen. Das Problem ist: Wenn ich 4W6 (4 gewöhnliche 6-seitige Würfel) habe und alle auf einmal würfle, entferne ich einen Würfel mit der niedrigsten Zahl (genannt "Fallenlassen") und summiere dann die verbleibenden 3, was ist die Wahrscheinlichkeit für jedes mögliche Ergebnis ? Ich weiß, die Antwort lautet:

Sum (Frequency): Probability
3   (1):         0.0007716049
4   (4):         0.0030864198
5   (10):        0.0077160494
6   (21):        0.0162037037
7   (38):        0.0293209877
8   (62):        0.0478395062
9   (91):        0.0702160494
10  (122):       0.0941358025
11  (148):       0.1141975309
12  (167):       0.1288580247
13  (172):       0.1327160494
14  (160):       0.1234567901
15  (131):       0.1010802469
16  (94):        0.0725308642
17  (54):        0.0416666667
18  (21):        0.0162037037

Der Durchschnitt liegt bei 12,24 und die Standardabweichung bei 2,847.

Ich habe die obige Antwort mit roher Gewalt gefunden und weiß nicht, wie oder ob es eine Formel dafür gibt. Ich vermute, dass dieses Problem NP-vollständig ist und daher nur mit brachialer Gewalt gelöst werden kann. Es könnte möglich sein, alle Wahrscheinlichkeiten von 3d6 (3 normale 6-seitige Würfel) zu erhalten und diese dann nach oben zu verschieben. Dies wäre schneller als rohe Gewalt, weil ich eine schnelle Formel habe, wenn alle Würfel behalten werden.

Ich habe die Formel so programmiert, dass alle Würfel im College bleiben. Ich hatte meinen Statistikprofessor danach gefragt und er fand diese Seite , die er mir dann erklärte. Es gibt einen großen Leistungsunterschied zwischen dieser Formel und Brute Force: 50W6 dauerte 20 Sekunden, aber der niedrigste Absturz von 8W6 war nach 40 Sekunden zu verzeichnen (Chrom hat nicht mehr genügend Arbeitsspeicher).

Ist das Problem NP-Complete? Wenn ja, legen Sie bitte einen Beweis vor, wenn nein, legen Sie bitte eine gewaltfreie Formel zur Lösung vor.

Beachten Sie, dass ich nicht viel über NP-Complete weiß, also denke ich vielleicht an NP, NP-Hard oder etwas anderes. Der Beweis für die NP-Vollständigkeit ist für mich nutzlos. Der einzige Grund, warum ich danach frage, ist, zu verhindern, dass Menschen raten. Und bitte machen Sie mit, es ist lange her, dass ich daran gearbeitet habe: Ich kann mich nicht mehr an Statistiken erinnern, so wie ich das möglicherweise lösen muss.

Im Idealfall suche ich nach einer allgemeineren Formel für die Anzahl X von Würfeln mit Y Seiten, wenn N von ihnen fallengelassen werden, aber ich beginne mit etwas viel Einfacherem.

Bearbeiten:

Ich würde auch die Formel vorziehen, um Frequenzen auszugeben, aber es ist akzeptabel, nur Wahrscheinlichkeiten auszugeben.

Für Interessierte habe ich whubers Antwort in JavaScript auf programmiert meinem GitHub (bei diesem Commit verwenden nur die Tests tatsächlich die definierten Funktionen).

dice np SkySpiral7
quelle

Das ist eine interessante Frage. Ich denke, es sollte hier zum Thema werden. Danke für deine Rücksicht.

gung - Wiedereinsetzung von Monica

Obwohl die Einstellung interessant ist, haben Sie noch keine beantwortbare Frage gestellt: Die Idee der NP-Vollständigkeit hängt von einer Klasse von Problemen ab, während Sie nur eine beschrieben haben. Genau wie soll es verallgemeinern? Obwohl Sie darauf hinweisen, dass die Anzahl der Würfel variieren kann, sind verschiedene zusätzliche Optionen möglich, die unterschiedliche Antworten liefern können: Sie können die Anzahl der Gesichter, die Werte auf den Gesichtern, die Anzahl der Würfel und die Anzahl der abgeworfenen Würfel ändern auf verschiedene Weise mit verschiedenen Beziehungen zwischen ihnen.

Whuber

@whuber Sie kennt keine Komplexitätstheorie, aber ich denke, es ist klar, dass sie nach der Familie der Probleme fragt, die durch die Änderung der Anzahl der Würfel entstehen. Ich denke auch, dass ich einen effizienten Algorithmus dafür habe.

Andy Jones

@Andy Ich sehe am Ende, dass sie nach "einer allgemeineren Formel für die Anzahl X von Würfeln mit Y Seiten, wenn N von ihnen fallen gelassen werden" fragt.

whuber

@whuber Hah! Anscheinend nicht so klar wie ich damals dachte. Es tut mir leid.

Andy Jones

Antworten:

Lösung

Es gebe $n=4$ Würfel, die den Ergebnissen gleiche Chancen geben $1, 2, \ldots, d=6$ . Sei $K$ das Minimum der Werte, wenn alle $n$ Würfel unabhängig voneinander geworfen werden.

Betrachten wir die Verteilung der Summe aller $n$ - Werte abhängig $K$ . Sei $X$ diese Summe. Die Erzeugungsfunktion für die Anzahl der Möglichkeiten, einen gegebenen Wert von $X$ , vorausgesetzt, das Minimum ist mindestens $k$ , ist

\begin{matrix} (1) & f_{(n, d, k)} (x) = x^{k} + x^{k + 1} + \dots + x^{d} = x^{k} \frac{1 - x^{d - k + 1}}{1 - x} . \end{matrix}

$f_{(n,d,k)}(x) = x^k+x^{k+1} + \cdots + x^d = x^k\frac{1-x^{d-k+1}}{1-x}.\tag{1}$

Da die Würfel unabhängig sind, ist die Erzeugungsfunktion für die Anzahl von Wegen, um Werte von zu bilden, wobei alle Würfel Werte von oder größer zeigen, gleich $X$ $n$ $k$

\begin{matrix} (2) & f_{(n, d, k)} (x)^{n} = x^{k n} {(\frac{1 - x^{d - k + 1}}{1 - x})}^{n} . \end{matrix}

$f_{(n,d,k)}(x)^n = x^{kn}\left(\frac{1-x^{d-k+1}}{1-x}\right)^n.\tag{2}$

Diese Erzeugungsfunktion enthält Terme für die Ereignisse, bei denen überschreitet , so dass wir sie abziehen müssen. Daher wird die Erzeugungsfunktion für die Anzahl von Arten von Werten zu bilden , , da , ist , $K$ $k$ $X$ $K=k$

\begin{matrix} (3) & f_{(n, d, k)} (x)^{n} - f_{(n, d, k + 1)} (x)^{n} . \end{matrix}

$f_{(n,d,k)}(x)^n - f_{(n,d,k+1)}(x)^n.\tag{3}$

Feststellung , dass die Summe der höchsten Werte ist die Summe aller Werte minus dem kleinsten gleich . Die Erzeugungsfunktion muss daher durch geteilt werden . Es wird eine wahrscheinlichkeitserzeugende Funktion durch Multiplikation mit der gemeinsamen Wahrscheinlichkeit einer beliebigen Kombination von Würfeln : $n-1$ $X-K$ $k$ $(1/d)^n$

\begin{matrix} (4) & d^{- n} \sum_{k = 1}^{d} x^{- k} (f_{(n, d, k)} (x)^{n} - f_{(n, d, k + 1)} (x)^{n}) . \end{matrix}

$d^{-n}\sum_{k=1}^dx^{-k}\left(f_{(n,d,k)}(x)^n - f_{(n,d,k+1)}(x)^n\right).\tag{4}$

Da alle Polynomprodukte und Potenzen in -Operationen berechnet werden können (sie sind Faltungen und können daher mit der diskreten schnellen Fouriertransformation ausgeführt werden), ist der gesamte Rechenaufwand $O(n\log n)$ . Insbesondere handeltes sich um einen polynomiellen Zeitalgorithmus. $O(k\,n\log n)$

Beispiel

Lassen Sie uns das Beispiel in der Frage mit und . $n=4$ $d=6$

Formel für die PGF von bedingt durch ergibt $(1)$ $X$ $K\ge k$

\begin{aligned} f_{(4, 6, 1)} (x) & = x + x^{2} + x^{3} + x^{4} + x^{5} + x^{6} \\ f_{(4, 6, 2)} (x) & = x^{2} + x^{3} + x^{4} + x^{5} + x^{6} \\ \dots \\ f_{(4, 6, 5)} (x) & = x^{5} + x^{6} \\ f_{(4, 6, 6)} (x) & = x^{6} \\ f_{(4, 6, 7)} (x) & = 0. \end{aligned}

$\eqalign{ f_{(4,6,1)}(x) &= x+x^2+x^3+x^4+x^5+x^6 \\ f_{(4,6,2)}(x) &= x^2+x^3+x^4+x^5+x^6 \\ \ldots \\ f_{(4,6,5)}(x) &= x^5+x^6 \\ f_{(4,6,6)}(x) &= x^6 \\ f_{(4,6,7)}(x) &= 0. }$

Das Erhöhen auf die Potenz wie in Formel ergibt $n=4$ $(2)$

\begin{aligned} f_{(4, 6, 1)} (x)^{4} & = x^{4} + 4 x^{5} + 10 x^{6} + \dots + 4 x^{23} + x^{24} \\ f_{(4, 6, 2)} (x)^{4} & = x^{8} + 4 x^{9} + 10 x^{10} + \dots + 4 x^{23} + x^{24} \\ \dots \\ f_{(4, 6, 5)} (x)^{4} & = x^{20} + 4 x^{21} + 6 x^{22} + 4 x^{23} + x^{24} \\ f_{(4, 6, 6)} (x)^{4} & = x^{24} \\ f_{(4, 6, 7)} (x)^{4} & = 0 \end{aligned}

$\eqalign{ f_{(4,6,1)}(x)^4 &= x^4 + 4x^5 + 10 x^6 + \cdots + 4x^{23} + x^{24} \\ f_{(4,6,2)}(x)^4 &= x^8 + 4x^9 + 10x^{10}+ \cdots + 4x^{23} + x^{24} \\ \ldots \\ f_{(4,6,5)}(x)^4 &=x^{20} + 4 x^{21} + 6 x^{22} + 4x^{23} +x^{24}\\ f_{(4,6,6)}(x)^4 &= x^{24}\\ f_{(4,6,7)}(x)^4 &= 0 }$

Their successive differences in formula $(3)$ are

\begin{aligned} f_{(4, 6, 1)} (x)^{4} - f_{(4, 6, 2)} (x)^{4} & = x^{4} + 4 x^{5} + 10 x^{6} + \dots + 12 x^{18} + 4 x^{19} \\ f_{(4, 6, 2)} (x)^{4} - f_{(4, 6, 3)} (x)^{4} & = x^{8} + 4 x^{9} + 10 x^{10} + \dots + 4 x^{20} \\ \dots \\ f_{(4, 6, 5)} (x)^{4} - f_{(4, 6, 6)} (x)^{4} & = x^{20} + 4 x^{21} + 6 x^{22} + 4 x^{23} \\ f_{(4, 6, 6)} (x)^{4} - f_{(4, 6, 7)} (x)^{4} & = x^{24} . \end{aligned}

$\eqalign{ f_{(4,6,1)}(x)^4 - f_{(4,6,2)}(x)^4 &= x^4 + 4x^5 + 10 x^6 + \cdots + 12 x^{18} + 4x^{19} \\ f_{(4,6,2)}(x)^4 - f_{(4,6,3)}(x)^4 &= x^8 + 4x^9 + 10x^{10} + \cdots + 4 x^{20} \\ \ldots \\ f_{(4,6,5)}(x)^4 - f_{(4,6,6)}(x)^4 &=x^{20} + 4 x^{21} + 6 x^{22} + 4x^{23} \\ f_{(4,6,6)}(x)^4 - f_{(4,6,7)}(x)^4 &= x^{24}. }$

The resulting sum in formula $(4)$ is

6^{- 4} (x^{3} + 4 x^{4} + 10 x^{5} + 21 x^{6} + 38 x^{7} + 62 x^{8} + 91 x^{9} + 122 x^{10} + 148 x^{11} + 167 x^{12} + 172 x^{13} + 160 x^{14} + 131 x^{15} + 94 x^{16} + 54 x^{17} + 21 x^{18}) .

$6^{-4}\left(x^3 + 4x^4 + 10x^5 + 21x^6 + 38x^7 + 62x^8 + 91x^9 + 122x^{10} + 148x^{11} + \\167x^{12} + 172x^{13} + 160x^{14} + 131x^{15} + 94x^{16} + 54x^{17} + 21x^{18}\right).$

For example, the chance that the top three dice sum to $14$ is the coefficient of $x^{14}$ , equal to

6^{- 4} \times 160 = 10 / 81 = 0.123 456 790 123 456 \dots .

$6^{-4}\times 160 = 10/81 = 0.123\,456\,790\,123\,456\,\ldots.$

It is in perfect agreement with the probabilities quoted in the question.

By the way, the mean (as calculated from this result) is $15869/1296 \approx 12.244598765\ldots$ and the standard deviation is $\sqrt{13\,612\,487/1\,679\,616}\approx 2.8468444$ .

A similar (unoptimized) calculation for $n=400$ dice instead of $n=4$ took less than a half a second, supporting the contention that this is not a computationally demanding algorithm. Here is a plot of the main part of the distribution:

Since the minimum $K$ is highly likely to equal $1$ and the sum $X$ will be extremely close to having a Normal $(400\times 7/2, 400\times 35/12)$ distribution (whose mean is $1400$ and standard deviation is approximately $34.1565$ ), the mean must be extremely close to $1400-1=1399$ and the standard deviation extremely close to $34.16$ . This nicely describes the plot, indicating it is likely correct. In fact, the exact calculation gives a mean of around $2.13\times 10^{-32}$ greater than $1399$ and a standard deviation around $1.24\times 10^{-31}$ less than $\sqrt{400\times 35/12}$ .

whuber
quelle

Your answer is fast and is correct so I've marked it as the answer. Also in an edit I said it would also be nice to have frequencies if possible. For that you don't need to edit your answer since I can see that the 6^-4 multiplier is used to convert from frequency to probability.

SkySpiral7

Edit: @SkySpiral has had trouble getting the below formula to work. I currently don't have time to work out what the issue is, so if you're reading this it's best to proceed under the assumption it's incorrect.

I'm not sure about the general problem with varying numbers of dice, sides, and drops, but I think I can see an efficient algorithm for the drop-1 case. The qualifier is that I'm not completely sure that it's correct, but right now I can't see any flaws.

Let's start by not dropping any dice. Suppose $X_n$ represents the $n$ th die, and suppose $Y_n$ represents the sum of $n$ dice. Then

p (Y_{n} = a) = \sum_{k} p (Y_{n - 1} = a - k) p (X_{n} = k)

$p(Y_n = a) = \sum_k p(Y_{n-1} = a - k)p(X_n=k)$

Now suppose $Z_n$ is the sum of $n$ dice when one die is dropped. Then

p (Z_{n} = a) = p (n th die is the smallest) p (Y_{n - 1} = a) + p (n th die is not the smallest) \sum_{k} p (Z_{n - 1} = a - k) p (X_{n} = k)

$p(Z_n = a) = p(\text{$n$th die is the smallest})p(Y_{n-1} = a) + \\ p(\text{$n$th die is not the smallest})\sum_k p(Z_{n-1} = a - k)p(X_n=k)$

If we define $M_n$ to be distribution of the minimum of $n$ dies, then

p (Z_{n} = a) = p (X_{n} \leq M_{n - 1}) p (Y_{n - 1} = a | X_{n} \leq M_{n - 1}) + p (X_{n} > M_{n - 1}) \sum_{k} p (Z_{n - 1} = a - k) p (X_{n} = k | X_{n} > M_{n - 1})

$p(Z_n = a) = p(X_n \leq M_{n-1})p(Y_{n-1} = a | X_n \leq M_{n-1}) + \\ p(X_n > M_{n-1})\sum_k p(Z_{n-1} = a - k)p(X_n=k | X_n > M_{n-1})$

and we can calculate $M_n$ using

p (M_{n} = a) = p (X_{n} \leq M_{n - 1}) p (X_{n} = a | X_{n} \leq M_{n - 1}) + p (X_{n} > M_{n - 1}) p (M_{n - 1} = a | X_{n} > M_{n - 1})

$p(M_n = a) = p(X_n \leq M_{n-1})p(X_n = a |X_n \leq M_{n-1}) + p(X_n > M_{n-1})p(M_{n-1} = a|X_n > M_{n-1})$

Anyway, together this all suggests a dynamic programming algorithm based on $Y_n, Z_n$ and $M_n$ . Should be quadratic in $n$ .

edit: A comment has been raised on how to calculate $p(X_n \leq M_{n-1})$ . Since $X_n, M_{n-1}$ can each only take on one of six values, we can just sum over all possibilities:

p (X_{n} \leq M_{n - 1}) = \sum_{a, b} p (X_{n} = a, M_{n - 1} = b, a \leq b)

$p(X_n \leq M_{n-1}) = \sum_{a,b} p(X_n = a, M_{n-1} = b, a \leq b)$

Similarly, $p(X_n = k | X_n > M_{n-1})$ can be calculated by applying Bayes rule then summing over the possible values of $X_n, M_{n-1}$ .

Andy Jones
quelle

+1 This looks correct and you said that's it's quadratic. But it's been a few years since I took statistics (I'm primarily a programmer). So I'd like to fully understand this before marking it as the answer. Also I see you have p(nth is the smallest die) does this include if nth is tied with the smallest? Such as rolling all 3s.

SkySpiral7

Good catch. If the

n

$n$ th die rolled is the same as the current minimum, we can regard that die as the one to be dropped. In which case the distribution is

Y_{n - 1}

$Y_{n-1}$ . I've swapped some

(<)

$(<)$ s for

(\leq)

$(\leq)$ s to reflect this.

Andy Jones

Thank you. If I understand this correctly I think your formulas are the answer. However I don't know how to calculate p(X(n) > M(n-1)) (or the negation of it) or p(X(n)=k|X(n) > M(n-1)) so I can't use this answer yet. I'll mark this as the answer but I'd like more information. Can you edit your answer to explain these or should I post it as another question?

SkySpiral7

Edited my answer.

Andy Jones

Sorry I know it's been a year and a half but I've finally gotten around to implementing this formula into code. However the p(Z(n)=a) formula appears incorrect. Suppose 2 dice with 2 sides (drop lowest), what are the chances of the result being 1? The chance of X(n) being the smallest or tied is 3/4 and p(Y(n-1)=1) is 1/2 so that Z(n) returns at least 3/8 even though the correct answer is 1/4. The Z formula looks correct to me and I don't know how to fix it. So if it's not too much to ask: what do you think?

SkySpiral7

I have a reasonably efficient algorithm for this that, on testing, seems to match results of pure brute force while relying less heavily on enumerating all possibilities. It's actually more generalized than the above problem of 4d6, drop 1.

Some notation first: Let $X_NdY$ indicate that you are rolling $X$ dice with $Y$ faces (integer values $1$ to $Y$ ), and considering only the highest $N$ dice rolled. The output is a sequence of dice values, e.g. $4_3d6$ yields $3, 4, 5$ if you rolled $1, 3, 4, 5$ on the four dice. (Note that I'm calling it a "sequence," but the order is not important here, particularly since all we care about in the end is the sum of the sequence.)

The probability $P(X_NdY = S)$ (or more specifically, $P(4_3d6 = S)$ ) is a simplified version of the original problem, where we are only considering a specific set of dice, and not all possible sets that add up to a given sum.

Suppose $S$ has $k$ distinct values, $s_0, s_1, ..., s_k$ , such that $s_i > s_{i+1}$ , and each $s_i$ has a count of $c_i$ . For example, if $S = 3, 4, 4, 5$ , then $(s_0,c_0) = (5,1)$ , $(s_1,c_1) = (4,2)$ , and $(s_2,c_2) = (3,1)$ .

You can calculate $P(X_NdY = S)$ in the following way:

P (X_{N} d Y = S) = \frac{(\prod_{i = 0}^{k - 1} (\binom{X - \sum_{h = 0}^{i - 1} c_{h}}{c_{i}})) (\sum_{j = 0}^{X - N} (\binom{c_{k} + X - N}{c_{k} + X - N - j}) (s_{k} - 1)^{j})}{Y^{X}}

$P(X_NdY = S) = \frac{ \left( \prod_{i=0}^{k-1} {X - \sum_{h=0}^{i-1} c_h \choose c_i} \right) \left( \sum_{j=0}^{X-N} { c_k+X-N \choose c_k+X-N-j} (s_k-1)^j \right)}{ Y^X }$

That's pretty messy, I know.

The product expression $\prod_{i=0}^{k-1}$ is iterating through all but the lowest of the values in $S$ , and calculating all the ways those values may be distributed among the dice. For $s_0$ , that's just $X \choose c_i$ , but for $s_1$ , we have to remove the $c_0$ dice that have already been set aside for $s_0$ , and likewise for $s_i$ you must remove $\sum_{h=0}^{i-1}c_h$ .

The sum expression $\sum_{j=0}^{X-N}$ is iterating through all the possibilities of how many of the dropped dice were equal to $s_k$ , since that affects the possible combinations for the un-dropped dice with $s_k$ as their value.

By example, let's consider $P[4_3d6=(5,4,4)]$ :

(s_{1}, c_{1}) = (5, 1)

$(s_1, c_1) = (5, 1)$

(s_{2}, c_{2}) = (4, 2)

$(s_2, c_2) = (4, 2)$

So using the formula above:

P [4_{3} d 6 = (5, 4, 4)] = \frac{(\binom{4}{1}) ((\binom{3}{3}) \cdot 3^{0} + (\binom{3}{2}) \cdot 3^{1})}{6^{4}} = \frac{5}{162} = 0.0 \bar{308641975}

$P[4_3d6=(5,4,4)] \\ = \frac{ {4 \choose 1} \left( {3 \choose 3} \cdot 3^0 + {3 \choose 2} \cdot 3^1 \right) }{ 6^4 } \\ = \frac{5}{162} = 0.0\overline{308641975}$

The formula breaks down on a domain issue when $s_k=1$ and $j=0$ in the summation, leading to a first term of $0^0$ , which is indeterminate and needs to be treated as $1$ . In such a case, a summation is not actually necessary at all, and can be omitted, since all the dropped dice will also have a value of $s_k = 1$ .

Now here's where I do need to rely on some brute force. The original problem was to calculate the probability of the sum being some value, and $X_NdY$ represents the individual dice left after dropping. This means you must add up the probabilities for all possible sequences $S$ (ignoring ordering) whose sum is the given value. Perhaps there is a formula to calculate this across all such values of $S$ at once, but I haven't even tried broaching that yet.

I've implemented this in Python first, and the above is an attempt to express it mathematically. My Python algorithm is accurate and reasonably efficient. There are some optimizations that could be made for the case of calculating the entire distribution of $\sum X_NdY$ , and maybe I'll do that later.

Riley John Gibbs
quelle

As a programmer it might be easier for me to understand your Python code (although I've never used Python so it might be the same). Posting the code here is off topic but you could post a link to github etc.

SkySpiral7

Your answer may be correct and it seems to reduce the complexity from O(Y^X) to O((Y+X-1)!/(X!*(Y-1)!)) but it still isn't as efficient as whuber's answer of O(c*X*log(X)). Thanks for your answer though +1.

SkySpiral7