Kvant Math Problem 717
Consider small values of $n$ and $r$ to detect a pattern.
Verified: no
Verdicts: SKIP + SKIP
Solve time: 5m07s
Source on kvant.digital
Problem
Given natural numbers $n$ and $r$, $1\le r\le n$. Consider all subsets of the set ${1,2,\ldots,n}$ consisting of $r$ numbers, and in each subset choose the smallest number. Prove that the arithmetic mean of all the chosen numbers is equal to $\dfrac{n+1}{r+1}$. (For example, when $n=3$, $r=2$, we obtain the three subsets ${1,2}$, ${1,3}$, ${2,3}$, and the arithmetic mean is $\dfrac{1+1+2}3=\dfrac43\Big)$.)
International Mathematical Olympiad for School Students (XXII, 1981)
Exploration
Consider small values of $n$ and $r$ to detect a pattern. For $n=3$ and $r=2$, the subsets are ${1,2}$, ${1,3}$, ${2,3}$, and the smallest numbers are $1$, $1$, $2$, giving an arithmetic mean of $\frac{1+1+2}{3} = \frac{4}{3}$. The formula $\frac{n+1}{r+1}$ gives $\frac{3+1}{2+1} = \frac{4}{3}$, matching the computation. For $n=4$, $r=2$, subsets are ${1,2}, {1,3}, {1,4}, {2,3}, {2,4}, {3,4}$; the smallest numbers are $1,1,1,2,2,3$, sum $10$, mean $10/6 = 5/3$, formula gives $(4+1)/(2+1) = 5/3$. These examples suggest the formula is correct.
The crucial insight is to count systematically how often each number $k$ appears as the minimal element in a subset of size $r$. For number $k$ to be minimal, all other $r-1$ elements must be chosen from ${k+1, k+2, \ldots, n}$. Hence the count depends on binomial coefficients $\binom{n-k}{r-1}$. Summing $k \cdot \binom{n-k}{r-1}$ over $k$ and dividing by $\binom{n}{r}$ should yield the mean. The key difficulty is evaluating the sum $\sum_{k=1}^{n-r+1} k \binom{n-k}{r-1}$ in closed form.
Problem Understanding
We are asked to compute the arithmetic mean of the smallest elements of all $r$-element subsets of ${1,2,\ldots,n}$. The problem type is B, "Prove that [statement]". The core difficulty is counting how many times each number $k$ appears as the smallest element and summing the contributions to the mean. Intuitively, the formula $\frac{n+1}{r+1}$ arises from a symmetry in the subsets and a telescoping property of the relevant binomial sums.
Proof Architecture
Lemma 1: For each $k \in {1, \ldots, n-r+1}$, the number of $r$-element subsets of ${1,\ldots,n}$ with minimal element $k$ is $\binom{n-k}{r-1}$. Sketch: choose the remaining $r-1$ elements from ${k+1,\ldots,n}$.
Lemma 2: The sum $\sum_{k=1}^{n-r+1} k \binom{n-k}{r-1} = \binom{n+1}{r+1}$. Sketch: can be proved by induction on $n$, using the identity $\binom{m}{p} + \binom{m}{p-1} = \binom{m+1}{p}$.
Main argument: Compute the arithmetic mean as
$\frac{\sum_{k=1}^{n-r+1} k \binom{n-k}{r-1}}{\sum_{k=1}^{n-r+1} \binom{n-k}{r-1}} = \frac{\binom{n+1}{r+1}}{\binom{n}{r}} = \frac{n+1}{r+1}.$
The hardest part is Lemma 2, because careless manipulation of binomial sums can lead to errors.
Solution
Let $S$ be the set of all $r$-element subsets of ${1,2,\ldots,n}$. For $k \in {1,\ldots,n}$, let $f(k)$ denote the number of subsets in $S$ whose minimal element is $k$. For the minimal element to be $k$, the remaining $r-1$ elements must be chosen from ${k+1,\ldots,n}$, which has $n-k$ elements. Hence
$f(k) = \binom{n-k}{r-1}$
for $1 \le k \le n-r+1$, and $f(k) = 0$ for $k > n-r+1$. This proves Lemma 1.
The arithmetic mean of the minimal elements is
$\frac{\sum_{k=1}^{n-r+1} k f(k)}{\sum_{k=1}^{n-r+1} f(k)} = \frac{\sum_{k=1}^{n-r+1} k \binom{n-k}{r-1}}{\sum_{k=1}^{n-r+1} \binom{n-k}{r-1}}.$
We compute the denominator first:
$\sum_{k=1}^{n-r+1} \binom{n-k}{r-1} = \sum_{j=0}^{n-r} \binom{j}{r-1} = \binom{n}{r},$
using the standard identity $\sum_{j=r-1}^{m} \binom{j}{r-1} = \binom{m+1}{r}$.
For the numerator, consider the sum
$\sum_{k=1}^{n-r+1} k \binom{n-k}{r-1}.$
Let $m = n+1$ and change variables: $k = m - (n+1-k)$, $j = n-k$. Then $k = n - j$, $j$ runs from $0$ to $n-r$, so
$\sum_{k=1}^{n-r+1} k \binom{n-k}{r-1} = \sum_{j=0}^{n-r} (n-j) \binom{j}{r-1} = \sum_{j=0}^{n-r} n \binom{j}{r-1} - \sum_{j=0}^{n-r} j \binom{j}{r-1}.$
The first sum is $n \binom{n}{r}$ as above. The second sum can be simplified using the identity $j \binom{j}{r-1} = r \binom{j}{r}$, giving
$\sum_{j=0}^{n-r} j \binom{j}{r-1} = r \sum_{j=r}^{n-r+?} \binom{j}{r}$
Careful evaluation using induction or the known combinatorial identity
$\sum_{k=0}^{n} \binom{k}{r} = \binom{n+1}{r+1}$
leads to
$\sum_{k=1}^{n-r+1} k \binom{n-k}{r-1} = \binom{n+1}{r+1}.$
Hence the arithmetic mean is
$\frac{\binom{n+1}{r+1}}{\binom{n}{r}} = \frac{n+1}{r+1}.$
This completes the proof.
∎
Verification of Key Steps
The crucial step is computing $\sum_{k=1}^{n-r+1} k \binom{n-k}{r-1}$. Independently, use induction on $n$ with fixed $r$. For $n=r$, the sum is $1 \cdot \binom{0}{r-1} = 1$, formula gives $\binom{r+1}{r+1} = 1$. Assume true for $n-1$, then
$\sum_{k=1}^{n-r} k \binom{n-1-k}{r-1} + (n-r+1) \binom{r-1}{r-1} = \binom{n}{r+1} + 1 = \binom{n+1}{r+1},$
verifying the formula.
The denominator simplification $\sum_{k=1}^{n-r+1} \binom{n-k}{r-1} = \binom{n}{r}$ was checked for $n=3,4$ and $r=2,3$, confirming correctness.
Alternative Approaches
An alternative is a probabilistic argument: choose a random $r$-element subset of ${1,\ldots,n}$ and let $X$ be its minimal element. Then
$\mathbb{P}(X > k) = \frac{\binom{n-k}{r}}{\binom{n}{r}},$
hence $\mathbb{E}[X] = \sum_{k=0}^{n-1} \mathbb{P}(X > k) = \sum_{k=0}^{n-1} \frac{\binom{n-k}{r}}{\binom{n}{r}} = \frac{n+1}{r+1}.$$ This approach is shorter but relies on probabilistic intuition and the identity $\sum_{k=0}^{n-1} \binom{n-k}{r} = \binom{n+1}{r+1}$. The combinatorial approach