
INDEX
Conditional computation, see dynamic struc-
ture
Conditional independence, vi, 55
Conditional probability, 53
Connectionism, 15, 373
Connectionist temporal classiļ¬cation, 346
consistency, 122
Constrained optimization, 88
Context-speciļ¬c independence, 423
Continuation methods, 269
Contractive auto-encoder, 465, 514
Contractive autoencoders, 445
Contrast, 382
Contrastive divergence, 520, 569, 572
Convolution, 272, 576
Convolutional network, 14
Convolutional neural network, 222,
boldindex272
Coordinate descent, 261, 572
Correlation, 56
Cost function, see objective function
Covariance, vi, 55
Covariance matrix, 56
Cross entropy,
boldindex59, 163
Cross-correlation, 274
Cross-validation, 113
CTC, see connectionist temporal classiļ¬ca-
tion
Curriculum-learning, 271
curse of dimensionality, 143
Cyc, 2
D-separation, 422
DAE, see denoising auto-encoder
Data generating distribution,
boldindex103, 122
Data generating process, 103
Data parallelism, 376
Dataset, 97
Dataset augmentation, 382, 387
DBM, see deep Boltzmann machine
Decision tree,
boldindex136
Decision trees, 487
Decoder, 4
Deep belief network, 24, 538, 550, 559, 577
Deep Blue, 2
Deep Boltzmann machine, 21, 24, 538, 550,
562, 572, 577
Deep learning, 1, 5
Denoising auto-encoder, 459
Denoising autoencoders, 186
Denoising score matching, 528
Density estimation, 96
Derivative, vi, 79
Design matrix,
boldindex99
Detector layer, 280
Diagonal matrix, 36
Dirac delta function, 63
Directed graphical model, 69, 414
Directional derivative, 83
Distributed Representation, 486
Distributed representation, 15
domain adaptation, 476
Dot product, 31
Double exponential distribution, see Laplace
distribution
Doubly block circulant matrix, 276
Dream sleep, 519, 548
DropConnect, 229
Dropout, 186, 226, 364, 365, 572
Dynamic structure, 378
E-step, 541
Early stopping, 215ā219
EBM, see energy-based model
Echo state network, 21, 24, 331
Eļ¬ective number of parameters, 201
Eļ¬ciency, 125
Eigendecomposition, 37
Eigenvalue, 38
Eigenvector, 38
ELBO, see evidence lower bound
Element-wise product, see Hadamard prod-
uct, see Hadamard product
EM, see expectation maximization
Embedding, 502
Empirical distribution, 63
Empirical risk, 235
Empirical risk minimization, 235
638