kanhua / a-cv-engineer-prepares

Curated list of the learning resources of a computer vision engineer

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

A (Tier-5) Computer Vision Engineer Prepares

I am trying to put together a list that in my opinion is essential to a computer vision engineer. As I am still working towards a higher tier of a engineer (see below for the definition what "rank" refers to here), I don't really know what really makes a CV engineer with Tier higher than 3. I will try to share it once I know.

Digression on the title

(Chinese: 一個電腦視覺工程師的自我修養) This title was come from "An Actor Prepares". Because of Steven Chu's movie "[King of Comedy](https://en.wikipedia.org/wiki/King_of_Comedy_(film)", this book title become well-known in Chinese culture zone.

Ranking computer scientists or engineers

"Landau genius scale" or "Landau's ranking" [link] is a ranking system of physicists maintained by Lev Landau.

In "The Essence of computing" by Wu Jun (Chinese, currently there's no English version yet), Wu adopted similar concepts to ranking computer scientist or engineers by their impacts.

  1. Tier 1: Being able to create a whole new industry or lay out a foundation of an scientific already. Wu ranked Donald E. Knuth as this tier.
  2. Tier 2: Being able to make significant contributions to the theories or realize the products that other engineers cannot
  3. Tier 3: Being able to an unsolved problems and make successful engineering products in the markets
  4. Tier 4: Being able to lead a team to solve engineering problem using state-of-the art solutions.
  5. Tier 5: Being able to solve an engineering problem independently.

Deep learning

Concepts

Basics of machine learning

  • Machine learning based classifier
  • Regularization
  • Stochastic Gradient Decent

Basic concepts of neural network

  • Concepts neural networks
  • Fully-connected layers
  • Convolutional network
  • Backpropagation
  • Optimizer

More advanced concepts

  • Dropout
  • Batch normalization
  • Attention

Architectures for solving the specific topics

  • Well-known architectures: such as VGG or ResNet
  • Image classification
  • Fine-tuning
  • Object detection
  • Image Segmentation

Learning materials

Mini projects to try

Dataset that is small enough and already hosted on Kaggle, so that it is possible to leverage on Kaggle's free GPUs.

Computer Vision

Good Practices of Software Engineering

Soft skills

Conducting literature review

Reporting

About

Curated list of the learning resources of a computer vision engineer

License:MIT License