This part of the book summarizes the state of modern deep learning as it is

used to solve practical applications.

Deep learning has a long history and many aspirations. Several approaches

have been proposed that have yet to entirely bear fruit. Several ambitious goals

have yet to be realized. These less-developed branches of deep learning appear in

the ﬁnal part of the book.

This part focuses only on those approaches that are essentially working tech-

nologies that are already used heavily in industry.

Modern deep learning provides a very powerful framework for supervised learn-

ing. By adding more layers and more units within a layer, a deep network can

represent functions of increasing complexity. Most tasks that consist of mapping

an input vector to an output vector, and that are easy for a person to do rapidly,

can be accomplished via deep learning, given suﬃciently large model and dataset

of labeled training examples. Other tasks, that can not be described as associat-

ing one vector to another, or that are diﬃcult enough that to do them a person

would require time to think and reﬂect, remain beyond the scope of deep learning

for now.

This part of the book describes the core parametric function approximation

technology that is behind nearly all modern practical applications of deep learn-

ing. Our description includes details such as, how to eﬃciently model speciﬁc

kinds of inputs, how to process image inputs with convolutional networks as well

as how to process sequence inputs with recurrent and recursive networks. More-

over, we provide guidance for how to preprocess the data for various tasks and

how to choose the values of the various settings that govern the behavior of these

algorithms.

These chapters are the most important for a practitioner – someone who wants

to begin implementing and using deep learning algorithms to solve real-world

problems today.

148