`Deep Gradient Learning for Efficient Camouflaged Object Detection`

Authors: Ge-Peng Ji, Deng-Ping Fan, Yu-Cheng Chou, Dengxin Dai, Alexander Liniger, & Luc Van Gool.

This repository contains the source code, prediction results, and evaluation toolbox of our Deep Gradient Network, also called DGNet. The technical report could be found at arXiv.

1. Features

Figure 1: We present the scatter relationship between the performance weighted F-measure and parameters of all competitors on CAMO-Test. These scatters are in various colors for better visual recognition and are also corresponding to the histogram (Right). The larger size of the coloured scatter point, the heavier the model parameter. (Right) We also report the parallel histogram comparison of model's parameters, MACs, and performance.

Novel supervision. We propose to excavate the texture information via learning the object level gradient rather than using boundary-supervised or uncertainty-aware modeling.
Simple but efficient. We decouple all the heavy designs as much as we can, yielding a simple but efficient framework. We hope this framework could be served as a baseline learning paradigm for the COD field.
Best trade-off. Our vision is to achieve new SOTA with the best performance-efficiency trade-off on existing cutting-edge COD benchmarks.

2. 🔥 NEWS 🔥

[2022/05/30] 🔥 We release the implementation of DGNet with different AI frameworks: Pytorch-based and Jittor-based.
[2022/05/30] Thank @Katsuya Hyodo for adding our model into PINTO. This is a repository for storing models that have been inter-converted between various frameworks (e.g., TensorFlow, PyTorch, ONNX).
[2022/05/25] Releasing the codebase of DGNet (Pytorch) and whole COD benchmarking results (20 models).
[2022/05/23] Creating repository.

This project is still work in progress, and we invite all to contribute in making it more acessible and useful. If you have any questions about our paper, feel free to contact me via e-mail (gepengai.ji@gmail.com & johnson111788@gmail.com & dengpfan@gmail.com). And if you are using our code and evaluation toolbox for your research, please cite this paper (BibTeX).

3. Proposed Framework

3.1. Overview

Figure 2: Overall pipeline of the proposed DGNet, It consists of two connected learning branches, i.e., context encoder and texture encoder. Then, we introduce a gradient-induced transition (GIT) to collaboratively aggregate the feature that is derived from the above two encoders. Finally, a neighbor connected decoder (NCD [1]) is adopted to generate the prediction.

Figure 3: Illustration of the proposed gradient-induced transition (GIT). It use a soft grouping strategy to provide parallel nonlinear projections at multiple fine-grained sub-spaces, which enables the network to probe multi-source representations jointly.

References of neighbor connected decoder (NCD) benchmark works [1] Concealed Object Detection. TPAMI, 2022.

3.2. Usage

The training and testing experiments are conducted using PyTorch and Jittor libraries with a single GeForce RTX TITAN GPU. Note that we only report the results of the Pytorch-based DGNet in our manuscript.

For the Pytorch usage, please refer to our pytorch_lib.
For the Jittor usage, please refer to our jittor_lib.

3.3 Evaluation

One-key evaluation is written in MATLAB code ./eval/matlab/, please follow this the instructions in ./eval/matlab/main.m and just run it to generate the evaluation results in ./eval-result/.

3.4 COD Benchmark Results:

The prediction of our DGNet and DGNet-S can be found in Pytorch / Jitror. The whole benchmark results can be found at OneDrive. Here are quantitative performance comparison from three perspectives.

Figure 4: Quantitative results in terms of full metrics for cutting-edge competitors, including 8 SOD-related and 12 COD-related, on three test datasets: NC4K-Test, CAMO-Test, and COD10K-Test. @R means the ranking of the current metric, and Mean@R indicates the mean ranking of all metrics.

Figure 5: Super-classes (i.e., Amphibian, Aquatic, Flying, Terrestrial, and Other) on the COD10K-Test of the proposed methods (DGNet & DGNet-S) and other 20 competitors. Symbol \uparrow indicates the higher the score, the better, and symbol \downarrow indicates the lower, the better. The best score is marked with bold.

Figure 6: Sub-class results on COD10K-Test of 12 COD-related and 8 SOD-related baselines in terms of structure measure (\mathcal{S}_\alpha), where Am., Aq., Fl., Te., and Ot. represent Amphibian, Aquatic, Flying, Terrestrial, and Other, respectively. CDL., GP.Fish, and LS.Dragon denote Crocodile, and GhostPipeFish, LeafySeaDragon, respectively. The best score is marked with bold.

4. Citation

Please cite our paper if you find the work useful:

@article{ji2022gradient,
  title={Deep Gradient Learning for Efficient Camouflaged Object Detection},
  author={Ji, Ge-Peng and Fan, Deng-Ping and Chou, Yu-Cheng and Dai, Dengxin and Liniger, Alexander and Van Gool, Luc},
  journal={arXiv},
  year={2022}
}

mczhuge / DGNet