Trained Deep Neural Network

From GM-RKB

(Redirected from Trained Deep NNet)

Jump to navigation Jump to search

A Trained Deep Neural Network is a trained neural network that is a deep neural network.

Context:
- It can (often) be an Overparameterized Model.
Example(s):
- an ONNX Format NNet, such as a Trained GPT-2 ONNX Model (GPT-2) [1].
- …
Counter-Example(s):
- a Trained Gradient Boosted Tree.
- …
See: Over-Parameterized Model, Neural Network File.

References

2020

https://github.com/onnx/models
- QUOTE: ... The ONNX Model Zoo is a collection of pre-trained, state-of-the-art models in the ONNX format contributed by community members like you. Accompanying each model are Jupyter notebooks for model training and running inference with the trained model. The notebooks are written in Python and include links to the training dataset as well as references to the original paper that describes the model architecture. ...

2019

(Frankle & Carbin, 2019) ⇒ Jonathan Frankle, and Michael Carbin. (2019). “The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks.” In: International Conference on Learning Representations.
- QUOTE: ... In practice, neural networks tend to be dramatically overparameterized. Distillation (Ba & Caruana, 2014; Hinton et al., 2015) and pruning (LeCun et al., 1990; Han et al., 2015) rely on the fact that parameters can be reduced while preserving accuracy. Even with sufficient capacity to memorize training data, networks naturally learn simpler functions (Zhang et al., 2016; Neyshabur et al., 2014; Arpit et al., 2017). Contemporary experience (Bengio et al., 2006; Hinton et al., 2015; Zhang et al., 2016) and Figure 1 suggest that overparameterized networks are easier to train. We show that dense networks contain sparse subnetworks capable of learning on their own starting from their original initializations. Several other research directions aim to train small or sparse networks. ...

Retrieved from "http://www.gabormelli.com/RKB/index.php?title=Trained_Deep_Neural_Network&oldid=765908"

Concept