Name		Name	Last commit message	Last commit date
parent directory ..
docs		docs
.gitignore		.gitignore
Manifest.toml		Manifest.toml
Project.toml		Project.toml
README.md		README.md
mlp_mnist.jl		mlp_mnist.jl

README.md

Multilayer Perceptron (MLP)

Source

Model Info

A multi-layer perceptron (MLP) consists of at least three sets of of nodes: an input layer, one or more hidden layer and an output layer. Each node except for the input node is a neuron that uses a nonlinear activation function. The multiple layers and non-linearities allow an MLP to distinguish data that is not linearly separable once trained.

In this example, we create a MLP that classifies handwritten digits using the MNIST dataset. Our model uses the simplest Flux layers, namely Dense and Chain. Since it uses softmax on its outputs, and crossentropy as the loss function.

For simplicity this model does not use a graphics card, since an ordinary CPU is fast enough. See for example the LeNet convolutional network for GPU usage.

Training

You can copy and paste the example into the Julia REPL to see what each part does. Or you can run it all at once from the terminal, like this:

cd vision/mlp_mnist
julia --project mlp_mnist.jl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mlp_mnist

mlp_mnist

README.md

Multilayer Perceptron (MLP)

Model Info

Training

Reference

Files

mlp_mnist

Directory actions

More options

Directory actions

More options

Latest commit

History

mlp_mnist

Folders and files

parent directory

README.md

Multilayer Perceptron (MLP)

Model Info

Training

Reference