Description
Problems
Theory
1. Suppsoe ฮ : ๐ณ ร๐ณ โ R is a PDS kernel. Prove that โ๐ฅ, ๐ฅโฒ โ ๐ณ it holds that |ฮ(๐ฅ, ๐ฅโฒ
)|
2 โค ฮ(๐ฅ, ๐ฅ)ฮ(๐ฅ
โฒ
, ๐ฅโฒ
).
2. Given a kernel ๐พ on ๐ณ define its normalized version as
๐พยฏ (๐ฅ, ๐ฅโฒ
) =
โง
โชโจ
โชโฉ
0 if ๐พ(๐ฅ, ๐ฅ) = 0 or ๐พ(๐ฅ
โฒ
, ๐ฅโฒ
) = 0
๐พ(๐ฅ, ๐ฅโฒ
)
โ๏ธ
๐พ(๐ฅ, ๐ฅ)
โ๏ธ
๐พ(๐ฅ
โฒ
, ๐ฅโฒ)
Otherwise.
Show that if ๐พ is PDS then so is ๐พยฏ .
3. Show that the following kernels on R
๐ are PDS:
โข Polynomial kernel: ๐พ(๐ฅ, ๐ฅโฒ
) = (๏ธ
๐ฅ
๐ ๐ฅ
โฒ + ๐
)๏ธ๐ผ
for ๐ > 0 and ๐ผ โ N.
โข Exponential kernel: ๐พ(๐ฅ, ๐ฅโฒ
) = exp(๐ฅ
๐ ๐ฅ
โฒ
).
โข RBF kernel: ๐พ(๐ฅ, ๐ฅโฒ
) = exp(โ๐พ
2โ๐ฅ โ ๐ฅ
โฒโ
2
2
).
4. Let ฮฉ โ R
๐ and let {๐๐}
๐
๐=1 be a sequence of continuous functions on ฮฉ and {๐๐}
๐
๐=1 a sequence of
non-negative numbers. Show that ๐พ(๐ฅ, ๐ฅโฒ
) = โ๏ธ๐
๐=1 ๐๐๐๐ (๐ฅ)๐๐ (๐ฅ
โฒ
) is a PDS kernel on ฮฉ.
5. Show that: (i) if ๐พ and ๐พโฒ are two reproducing kernels for an RKHS โ, then they have to be the
same. (ii) the RKHS of a PDS kernel ๐พ is unique.
Computation
Download the MNIST training and test .csv files from Canvas and load them on your computer. I suggest
you use Python or MATLAB for this excercise.
โข Use Principle Component Analysis (PCA) on the training set to reduce the dimension of your input.
How many modes do you need to preserve 95% of the variance in the training set?
โข Extract the digits 1 and 9 from the training set. Use kernel regression to design and train a classifier
to distinguish these digits using three different kernels of your choosing (I suggest RBF, Polynomila,
and linear). It is a good idea to use PCA to reduce your input dimensions here. Also, you may use
cross validation to tune your kernel/regularization/nugget parameters if you need them. Present the
training and test error of your method.
โข Repeat the above experiment for the digits (3, 8), (1, 7), and (5, 2).
โข Write a report of a maximum of four pages, outlining your results and findings.