Klanke, Stefan: Learning manifolds with the Parametrized Self-Organizing Map and Unsupervised Kernel Regression. 2007
Inhalt
- Introduction
- Motivation
- Outline and contributions
- Supervised learning
- Statistical model
- Empirical error minimization and regularization
- Bias variance dilemma
- Model selection
- Unsupervised learning
- Further concepts
- Related Methods
- Vector quantization and clustering
- Principal Component Analysis
- Auto-associative neural networks
- The Self-Organizing Map
- Principal curves
- Principal surfaces
- Pointwise embedding methods
- Multi-dimensional scaling
- The Sammon mapping
- Curvilinear Component Analysis
- Curvilinear Distance Analysis
- Nonlinear spectral embedding methods
- The Parametrized Self-Organizing Map and extensions
- Original formulation
- Application in kinematics learning
- PSOM+ extensions
- PSOM+ model of PA-10 kinematics
- Unsupervised learning of manifolds with the PSOM+
- Discussion
- Unsupervised Kernel Regression
- The Nadaraya-Watson estimator
- Derivation of UKR
- Regularization approaches
- Optimizing a UKR model
- Gradient of the reconstruction error
- Spectral initialization
- Homotopy-based optimization
- Projection of new data
- Summary of the procedure
- Feature space variant
- Experiments
- ``Noisy spiral'' data
- Homotopy and penalty terms: S-shaped triangle data
- USPS handwritten digits
- "Oil flow" data
- Discussion
- Extensions to Unsupervised Kernel Regression
- UKR with general loss functions
- Loss functions used in this work
- Including general loss functions with UKR
- Optimization scheme for the -insensitive loss
- Experiments
- Relation to feature space UKR
- Discussion
- UKR with leave-K-out cross-validation
- Landmark UKR
- Unsupervised Local Polynomial Regression
- Conclusion
- Mathematical notation
- UKR gradient and computational complexity
- Derivative calculations for Unsupervised Local Polynomial Regression
- Gradient of the reconstruction error
- Local Constant Estimate
- Local Linear Estimate
- Local Quadratic Estimate (w/o cross-terms)
- Local Quadratic Estimate (q=2)
- Gradient with respect to the scale
- Gradient calculation for the landmark variant
- Jacobi matrix for projecting new data
- References
