
CHAPTER 5. MACHINE LEARNING BASICS
learning has recently begun to have an important impact on this kind of
task (Sutskever et al., 2014a; Bahdanau et al., 2014).
• Structured output tasks involve any task where the output is a vector con-
taining important relationships between the different elements. This is a
broad category, and includes the transcription and translation tasks de-
scribed above, but also many other tasks. One example is parsing—mapping
a natural language sentence into a tree that describes its grammatical struc-
ture and the relative role of its constituents. See Collobert (2011) for an
example of deep learning applied to a parsing task. Another example is
pixel-wise segmentation of images, where the computer program assigns ev-
ery pixel in an image to a specific category. For example, deep learning can
be used to annotate the locations of roads in aerial photographs (Mnih and
Hinton, 2010). The output need not have its form mirror the structure of
the input as closely as in these annotation-style tasks. For example, in an
image captioning, the computer program observes an image and outputs a
natural language sentence describing the image (Kiros et al., 2014a,b; Mao
et al., 2015; Vinyals et al., 2015; Donahue et al., 2014; Karpathy and Li,
2015; Fang et al., 2015; Xu et al., 2015a). These tasks are called struc-
tured output tasks because the program must output several values that
are all tightly inter-related. For example, the words produced by an image
captioning program must form a valid sentence.
• Anomaly detection: In this type of task, the computer program sifts through
a set of events or objects, and flags some of them as being unusual or atypi-
cal. An example of an anomaly detection task is credit card fraud detection.
By modeling your purchasing habits, a credit card company can detect mis-
use of your cards. If a thief steals your credit card or credit card information,
the thief’s purchases will often come from a different probability distribution
over purchase types than your own. The credit card company can prevent
fraud by placing a hold on an account as soon as that card has been used
for an uncharacteristic purchase.
• Synthesis and sampling: In this type of task, the machine learning algorithm
is asked to generate new examples that are similar to those in the training
data. This can be useful for media applications where it can be expensive
or boring for an artist to generate large volumes of content by hand. For ex-
ample, video games can automatically generate textures for large objects or
landscapes, rather than requiring an artist to manually label each pixel (Luo
et al., 2013). In some cases, we want the sampling or synthesis procedure
to generate some specific kind of output given the input. For example, in a
speech synthesis task, we provide a written sentence and ask the program
95