vectors of any inner product space (real or complex) [3]
The Cauchy-Schwarz inequality was originally expressed in terms of sequences
of numbers [1]. The continuous analogue is in terms
of two integrable
functions[4].
In terms of random variables
Given any two random variables X and Y,
E[XY]2≤E[X2]E[Y2]
with equality holding iff aX+bY=0 for some constants a and b,
at least one non-zero
(i.e. X and Y are linearly dependent).
Proof
If either E[X2]=0 or E[Y2]=0 then E[XY]=0.
Otherwise define
X^:=E[X2]X and Y^:=E[Y2]Y
for which E[X^2]=E[Y^2]=1.
The proof follows from the product of two numbers always being less than or
equal to the average of their squares
0E[X^Y^]E[X2]E[Y2]E[XY]E[XY]2≤E[(X^−Y^)2]≤E[2X^2+Y^2]≤21+1≤E[X2]E[Y2]
If both sides of the inequality are equal, linear dependence follows since
either X=0 or Y=0 or
E[X2]1X+E[Y2]1Y=X^−Y^=0
If X and Y are linearly dependent, either X=kY or Y=kX for some
constant k, either way both sides of the inequality are equal.
QED
In terms of vectors of a real inner product space
The probabilistic proof can be generalized to any real inner product space as
shown in [1].
Given any vectors x,y from a real inner product
space,
the Cauchy-Schwarz inequality is
⟨x,y⟩≤∥x∥∥y∥
with equality holding iff x and y are linearly dependent.
Proof
If either ∥x∥=0 or ∥y∥=0 then ⟨x,y⟩=0.
Otherwise define
x^:=∥x∥x and y^:=∥y∥y
for which ∥x^∥=∥y^∥=1.
02⟨x^,y^⟩2∥x∥∥y∥⟨x,y⟩⟨x,y⟩≤⟨x^−y^,x^−y^⟩≤⟨x^,x^⟩+⟨y^,y^⟩≤1+1≤∥x∥∥y∥
If both sides of the inequality are equal, linear dependence follows since
either x=0 or y=0 or
∥x∥1x+∥y∥1y=x^−y^=0
If x and y are linearly dependent, either x=λy or y=λx for some scaler λ, either way both sides of the inequality are
equal.
QED
In terms of vectors of an inner product space
This section considers the Cauchy-Schwarz inequality for vectors of a real or
complex inner product space.
The proof for real inner produce spaces does not work for complex inner
product spaces because ⟨y,x⟩=⟨x,y⟩ (complex
conjugate).
The proof is effectively the same as the previous proof for real inner
product spaces. But the normalized vectors x^ and y^ must be
“rotated” in the complex plane so that both sides of the inequality remain
real. This rotation will be done via a multiplier α.
Proof
Let x^ and y^ be defined as in the proof for real inner product
spaces.
If ⟨x^,y^⟩=0 the inequality holds, otherwise let
α:=∣⟨x^,y^⟩∣⟨y^,x^⟩
for which the following convenient properties hold
αα=1=ααα2⟨x^,y^⟩=∣⟨x^,y^⟩∣⟨y^,x^⟩⟨x^,y^⟩=∣⟨x^,y^⟩∣∣⟨x^,y^⟩∣2=∣⟨x^,y^⟩∣=α2⟨y^,x^⟩
The proof proceeds like with a real inner product space but using α,
02∣⟨x^,y^⟩∣2∥x∥∥y∥∣⟨x,y⟩∣∣⟨x,y⟩∣≤⟨αx^−αy^,αx^−αy^⟩=αα⟨x^,x^⟩−α2⟨x^,y^⟩−α2⟨y^,x^⟩+αα⟨y^,y^⟩≤⟨x^,x^⟩+⟨y^,y^⟩≤1+1≤∥x∥∥y∥
If both sides of the inequality are equal, linear dependence follows since
either x=0 or y=0 or
∥x∥αx+∥y∥αy=αx^−αy^=0
If x and y are linearly dependent, either x=λy or y=λx for some scaler λ, either way both sides of the inequality are
equal.
QED
References
1.
Steele JM. The Cauchy-Schwarz master class: An introduction to the art of mathematical inequalities. Cambridge ; New York: Cambridge University Press; 2004.
2.
DeGroot MH, Schervish MJ. Probability and statistics. 3rd ed. Boston: Addison-Wesley; 2002.