
INDEX
No free lunch theorem, 115
Noise-contrastive estimation, 620
Non-parametric model, 113
Norm, ix, 38
Normal distribution, 62, 63, 124
Normal equations, 108, 108, 111, 233
Normalized initialization, 302
Numerical differentiation, see finite differ-
ences
Object detection, 451
Object recognition, 451
Objective function, 81
OMP-k, see orthogonal matching pursuit
One-shot learning, 536
Operation, 203
Optimization, 79, 81
Orthodox statistics, see frequentist statistics
Orthogonal matching pursuit, 26, 253
Orthogonal matrix, 41
Orthogonality, 40
Output layer, 167
Parallel distributed processing, 17
Parameter initialization, 299, 405
Parameter sharing, 250, 334, 372, 374, 388
Parameter tying, see Parameter sharing
Parametric model, 113
Parametric ReLU, 192
Partial derivative, 83
Partition function, 566, 604, 669
PCA, see principal components analysis
PCD, see stochastic maximum likelihood
Perceptron, 15, 26
Persistent contrastive divergence, see stochas-
tic maximum likelihood
Perturbation analysis, see reparametrization
trick
Point estimator, 121
Policy, 478
Pooling, 329, 685
Positive definite, 88
Positive phase, 468, 605, 608, 656, 668
Precision, 422
Precision (of a normal distribution), 62, 64
Predictive sparse decomposition, 521
Preprocessing, 451
Pretraining, 322, 526
Primary visual cortex, 363
Principal components analysis, 47, 146, 147,
488, 631
Prior probability distribution, 134
Probabilistic max pooling, 685
Probabilistic PCA, 488, 489, 632
Probability density function, 57
Probability distribution, 55
Probability mass function, 55
Probability mass function estimation, 102
Product of experts, 568
Product rule of probability, see chain rule
of probability
PSD, see predictive sparse decomposition
Pseudolikelihood, 615
Quadrature pair, 369
Quasi-Newton condition, 315
Quasi-Newton methods, 315
Radial basis function, 195
Random search, 433
Random variable, 55
Ratio matching, 618
RBF, 195
RBM, see restricted Boltzmann machine
Recall, 422
Receptive field, 336
Recommender Systems, 475
Rectified linear unit, 171, 192, 424, 505
Recurrent network, 26
Recurrent neural network, 377
Regression, 99
Regularization, 119, 119, 177, 227, 429
Regularizer, 118
REINFORCE, 691
Reinforcement learning, 24, 105, 478, 691
Relational database, 481
Reparametrization trick, 690
Representation learning, 3
Representational capacity, 113
792