Ewen2015 / tf-study-journal

a study journal for TensorFlow

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

tf-study-journal

a study journal for TensorFlow

study list

  1. Intro to TensorFlow for Deep Learning

March, 19, 2020

ITTFDL: Lesson 5

  1. Memorizing is not learning! — 6 tricks to prevent overfitting in machine learning
  • Early Stopping: In this method, we track the loss on the validation set during the training phase and use it to determine when to stop training such that the model is accurate but not overfitting.
  • Image Augmentation: Artificially boosting the number of images in our training set by applying random image transformations to the existing images in the training set.
  • Dropout: Removing a random selection of a fixed number of neurons in a neural network during training.

March, 18, 2020

ITTFDL: Lesson 4

  1. A convolution is the process of applying a filter (“kernel”) to an image. Max pooling is the process of reducing the size of the image through downsampling.
  2. The model's compile step:
    • Loss function — An algorithm for measuring how far the model's outputs are from the desired output. The goal of training is this measures loss.
    • Optimizer —An algorithm for adjusting the inner parameters of the model in order to minimize loss.
    • Metrics —Used to monitor the training and testing steps. The following example uses accuracy, the fraction of the images that are correctly classified.
  3. A Comprehensive Guide to Convolutional Neural Networks — the ELI5 way
  • CNNs: Convolutional neural network. That is, a network which has at least one convolutional layer. A typical CNN also includes other types of layers, such as pooling layers and dense layers.
  • Convolution: The process of applying a kernel (filter) to an image
  • Kernel / filter: A matrix which is smaller than the input, used to transform the input into chunks
  • Padding: Adding pixels of some value, usually 0, around the input image
  • Pooling: The process of reducing the size of an image through downsampling.There are several types of pooling layers. For example, average pooling converts many values into a single value by taking the average. However, maxpooling is the most common.
  • Maxpooling: A pooling process in which many values are converted into a single value by taking the maximum value from among them.
  • Stride: the number of pixels to slide the kernel (filter) across the image.
  • Downsampling: The act of reducing the size of an image

March, 17, 2020

ITTFDL: Lesson 3

  1. The Rectified Linear Unit (ReLU): As we can see, the ReLU function gives an output of 0 if the input is negative or zero, and if input is positive, then the output will be equal to the input. ReLU gives the network the ability to solve nonlinear problems.
  2. Rectified Linear Units (ReLU) in Deep Learning
  3. Build, Compile, Train (BCT)
  • Flattening: The process of converting a 2d image into 1d vector
  • ReLU: An activation function that allows a model to solve nonlinear problems
  • Softmax: A function that provides probabilities for each possible output class
  • Classification: A machine learning model used for distinguishing among two or more output categories

March, 16, 2020

Reduce Loss

  1. The gradient descent algorithm then calculates the gradient of the loss curve at the starting point. The gradient of the loss is equal to the derivative (slope) of the curve, and tells you which way is "warmer" or "colder." When there are multiple weights, the gradient is a vector of partial derivatives with respect to the weights.
  2. Note that a gradient is a vector, so it has both of the following characteristics: 1) a direction 2) a magnitude

March, 14, 2020

Intro to TensorFlow for Deep Learning

  1. Create a model:
    • Build a layer
    • Assemble layers into the model
  2. Compile the model, with loss and optimizer functions
  3. Train the model

By now you should know what the following terms are:

  • Feature: The input(s) to our model
  • Examples: An input/output pair used for training
  • Labels: The output of the model
  • Layer: A collection of nodes connected together within a neural network.
  • Model: The representation of your neural network
  • Dense and Fully Connected (FC): Each node in one layer is connected to each node in the previous layer.
  • Weights and biases: The internal variables of model
  • Loss: The discrepancy between the desired output and the actual output
  • MSE: Mean squared error, a type of loss function that counts a small number of large discrepancies as worse than a large number of small ones.
  • Gradient Descent: An algorithm that changes the internal variables a bit at a time to gradually reduce the loss function.
  • Optimizer: A specific implementation of the gradient descent algorithm. (There are many algorithms for this. In this course we will only use the “Adam” Optimizer, which stands for ADAptive with Momentum. It is considered the best-practice optimizer.)
  • Learning rate: The “step size” for loss improvement during gradient descent.
  • Batch: The set of examples used during training of the neural network
  • Epoch: A full pass over the entire training dataset
  • Forward pass: The computation of output values from input
  • Backward pass (backpropagation): The calculation of internal variable adjustments according to the optimizer algorithm, starting from the output layer and working back through each layer to the input.

About

a study journal for TensorFlow

License:MIT License