
INDEX
Conditional Computation in Neural Nets,
365
Conditional independence, vi, 52
Conditional probability, 51
Connectionism, 15, 338
Connectionist temporal classiļ¬cation, 318
consistency, 118
Constrained optimization, 85
Context-speciļ¬c independence, 385
Continuation methods, 245
Contractive auto-encoder, 427, 476
Contractive autoencoders, 407
Contrast, 345
Contrastive divergence, 482, 531, 534
Convolution, 247, 538
Convolutional network, 14
Convolutional neural network, 247
Coordinate descent, 240, 534
Correlation, 53
Cost function, see objective function
Covariance, vi, 53
Covariance matrix, 54
Cross entropy, 119, 156
Cross-correlation, 249
Cross-validation, 109
CTC, see connectionist temporal classiļ¬ca-
tion
Curriculum-learning, 245
curse of dimensionality, 135
Cyc, 2
D-separation, 384
DAE, see denoising auto-encoder
Data generating distribution, 100
Data generating process, 100
Data parallelism, 341
Dataset, 94
Dataset augmentation, 345, 349
DBM, see deep Boltzmann machine
Decision trees, 449
Decoder, 4
Deep belief network, 23, 500, 512, 521, 539
Deep Blue, 2
Deep Boltzmann machine, 20, 23, 500, 512,
524, 534, 539
Deep learning, 1, 5
Denoising auto-encoder, 421
Denoising autoencoders, 180
Denoising score matching, 490
Density estimation, 93
Derivative, vi, 76
Design matrix,
boldindex96
Detector layer, 255
Diagonal matrix, 36
Dirac delta function, 60
Directed graphical model, 66, 376
Directional derivative, 80
Distributed Representation, 448
Distributed representation, 15
domain adaptation, 438
Dot product, 30
Doubly block circulant matrix, 249
Dream sleep, 481, 510
DropConnect, 220
Dropout, 180, 217, 333, 334, 534
Dynamic structure, 343
Dynamically structured networks, 343
E-step, 503
Early stopping, 166, 207, 209ā212
EBM, see energy-based model
Echo state network, 20, 23, 305
Eļ¬ective number of parameters, 195
Eļ¬ciency, 121
Eigendecomposition, 37
Eigenvalue, 37
Eigenvector, 37
ELBO, see evidence lower bound
Element-wise product, see Hadamard prod-
uct, see Hadamard product
EM, see expectation maximization
Embedding, 464
Empirical distribution, 60
Empirical risk, 226
Empirical risk minimization, 226
Encoder, 4
Energy function, 382
Energy-based model, 382, 524
Ensemble methods, 215
Epoch, 227, 236
Equality constraint, 86
594