xxradon/darkflow

Update

Demo in webcam is available!. Use option --demo :)

YOLOv1 is up and running:

v1.0: yolo-full 1.1GB, yolo-small 376MB, yolo-tiny 180MB
v1.1: yolov1 789MB, tiny-yolo 108MB, tiny-coco 268MB, yolo-coco 937MB

YOLOv2 forward pass is up and running:

yolo 270MB, tiny-yolo-voc 63 MB.

TODO: training YOLOv2

Intro

This repo aims at building a tensorflow version of darknet framework, where YOLO (real time object detection & classification) is created. In fact, darkflow has a tensorflow backend and is compatible with both darknet files including binary .weights and configuration .cfg - who looks something like this:

...

[convolutional]
batch_normalize = 1
size = 3
stride = 1
pad = 1
activation = leaky

[maxpool]

[connected]
output = 4096
activation = linear

...

With darkflow, you can enjoy such ease of designing a deepnet as well as the rich and powerful ecosystem of tools that tensorflow has to offer. Currently, darkflow is built to sufficiently run YOLO(v1). For other net structures, new code will be added in a plug-and-play manner. Take a look at net/yolo/ or net/vanilla and see how this task should be quite simple.

Regarding bridging Darknet and Tensorflow for YOLO, there are currently some available repos online such as this and this. Unfortunately, they only provide hard-coded routines that allows translating YOLO's full/small/tiny configurations from Darknet to Tensorflow, and only build the forward pass. The awaited backward part is still not committed (because it needs a loss evaluation).

This is understandable since building the loss op of YOLO in tensorflow is not a trivial task, it requires careful computational considerations. But hey, I've got time to do that. Namely, we are now able to create new configurations and train them in GPU/CPU mode. Moreover, YOLO would not be completed if it is not running real-time (preferably on mobile devices), darkflow also allows saving the trained net to a constant protobuf object that can be used in Tensorflow C++ interface.

How to use it

Parsing the annotations

Skip this if you are not training or fine-tuning anything (you simply want to forward flow a trained net)

The only thing to do is specifying the classes you want to work with, write them down in the labels.txt file. For example, if you want to work with only 3 classes tvmonitor, person, pottedplant; edit labels.txt as follows

tvmonitor
person
pottedplant

And that's it. darkflow will take care of the parsing whenever necessary.

Design the net

Skip this if you are working with one of the three original configurations since they are already there.

In this step you create a configuration [config_name].cfg and put it inside cfg/. Take a look at some of the available configs there to know the syntax.

Note that these files, besides being descriptions of the net structures, also store technical specifications that is read by Darknet framework (e.g. learning rate, batch size, epoch number). darkflow therefore, ignore these Darknet specifications.

Flowing the graph using `flow`

# Have a look at its options
./flow --h

First, let's take a closer look at one of a very useful option --load

# 1. Load yolo-tiny.weights
./flow --model cfg/yolo-tiny.cfg --load bin/yolo-tiny.weights

# 2. To completely initialize a model, leave the --load option
./flow --model cfg/yolo-3c.cfg

# 3. It is useful to reuse the first identical layers of tiny for 3c
./flow --model cfg/yolo-3c.cfg --load bin/yolo-tiny.weights
# this will print out which layers are reused, which are initialized

More on --load later. All of the above flow commands essentially perform forward pass of the net. In fact, they flow all input images from default folder test/ through the net and draw predictions into test/out/. We can always specify more parameters for such forward passes, such as detection threshold, batch size, test folder, etc. Below is one example where the forward pass is told to utilize 100% GPU capacity:

# Forward all images in test/ using tiny yolo and 100% GPU usage
./flow --test test/ --model cfg/yolo-tiny.cfg --load bin/yolo-tiny.weights --gpu 1.0

Training is simple as you only have to add option --train like below:

# Initialize yolo-3c from yolo-tiny, then train the net on 100% GPU:
./flow --model cfg/yolo-3c.cfg --load bin/yolo-tiny.weights --train --gpu 1.0

# Completely initialize yolo-3c and train it with ADAM optimizer
./flow --model cfg/yolo-3c.cfg --train --trainer adam

During training, the script will occasionally save intermediate results into Tensorflow checkpoints, stored in ckpt/. Only the 20 most recent point are kept, you can change this number in the keep option, if keep = 0, no intermediate result is omitted.

To resume to any checkpoint before performing training/testing, use --load [checkpoint_num] option, if checkpoint_num < 0, darkflow will load the most recent save by parsing ckpt/checkpoint. Here are a few examples:

# Resume the most recent checkpoint for training
./flow --train --model cfg/yolo-3c.cfg --load -1

# Test with checkpoint at step 1500
./flow --model cfg/yolo-3c.cfg --load 1500

# Fine tuning yolo-tiny from the original one
./flow --train --model cfg/yolo-tiny.cfg --load bin/yolo-tiny.weights

You can even initialize new nets from ckpt files with --load:

./flow --train --model cfg/yolo-2c.cfg --load ckpt/yolo-3c-1500
# recollected and initialized layers will be printed to console

Migrating the graph to C++ and Objective-C++

Now this is the tricky part since there is no official support for loading variables in C++ API. Some suggest adding assigning ops from variable to constants into the graph and save it down as a .pb (protobuf) file like this. However this will double the necessary size of this file (or even triple if there is training ops), which is very undesirable in, say, building mobile applications.

There is an official way to do the same thing using this script provided in Tensorflow. I did not have the time to check its implementation and performance, however doing so would certainly require running on a separate script. darkflow allows freezing the graph on the fly, either during training or testing, without doubling/tripling the necessary size.

## Saving the lastest checkpoint to protobuf file
./flow --model cfg/yolo-3c.cfg --load -1 --savepb

For further usage of this protobuf file, please refer to the official documentation of Tensorflow on C++ API here. To run it on, say, iOS application, simply add the file to Bundle Resources and update the path to this file inside source code.

That's all

xxradon / darkflow

Update

Intro

How to use it

Parsing the annotations

Design the net

Flowing the graph using `flow`

Migrating the graph to C++ and Objective-C++

About

Languages

Update

Intro

How to use it

Parsing the annotations

Design the net

Flowing the graph using flow

Migrating the graph to C++ and Objective-C++

About

Languages

Flowing the graph using `flow`