OverFeat is a Convolutional Network-based image classifier and feature extractor.

Last update: Dec 8, 2022

Related tags

Deep Learning OverFeat

Overview

OverFeat

OverFeat is a Convolutional Network-based image classifier and feature extractor.

OverFeat was trained on the ImageNet dataset and participated in the ImageNet 2013 competition.

This package allows researchers to use OverFeat to recognize images and extract features.

A library with C++ source code is provided for running the OverFeat convolutional network, together with wrappers in various scripting languages (Python, Lua, Matlab coming soon).

OverFeat was trained with the Torch7 package ( http://www.torch.ch ). The OverFeat package provides tools to run the network in a standalone fashion. The training code is not distributed at this time.

CREDITS, LICENSE, CITATION

OverFeat is Copyright NYU 2013. Authors of the present package are Michael Mathieu, Pierre Sermanet, and Yann LeCun.

The OverFeat system is by Pierre Sermanet, David Eigen, Xiang Zhang, Michael Mathieu, Rob Fergus, and Yann LeCun.

Please refer to the LICENSE file in the same directory as the present file for licensing information.

If you use OverFeat in your research, please cite the following paper:

"OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks", Pierre Sermanet, David Eigen, Xiang Zhang, Michael Mathieu, Rob Fergus, Yann LeCun http://arxiv.org/abs/1312.6229

INSTALLATION:

Download the archive from http://cilvr.cs.nyu.edu/doku.php?id=software:overfeat:start

Extract the files:

tar xvf overfeat-vXX.tgz
cd overfeat

Overfeat uses external weight files. Since these files are large and do not change often, they are not included in the archive. We provide a script to automatically download the weights :

./download_weights.py

The weight files should be in the folder data/default in the overfeat directory.

Overfeat can run without BLAS, however it would be very slow. We strongly advice you to install openblas on linux (on MacOS, Accelerate should be available without any installation). On Ubuntu/Debian you should compile it (it might take a while, but it is worth it) :

sudo apt-get install build-essential gcc g++ gfortran git libgfortran3
cd /tmp
git clone https://github.com/xianyi/OpenBLAS.git
cd OpenBLAS
make NO_AFFINITY=1 USE_OPENMP=1
sudo make install

For some reason, on 32 bits Ubuntu, libgfortran doesn't create the correct symlink. If you have issues linking with libgfortran, locate where libgfortran is installed (for instance /usr/lib/i386-linux-gnu) and create the correct symlink :

cd 
   
    
sudo ln -sf libgfortran.so.3 libgfortran.so

The precompiled binaries use BLAS. If you don't want to (or can't, for some reason) use BLAS, you must recompile overfeat.

RUNNING THE PRE-COMPILED BINARIES

Pre-compiled binaries are provided for Ubuntu Linux (32 bits and 64 bits) and Mac OS. The pre-requisites are python and imagemagick, which are installed by default on most popular Linux distros.

Important note: OverFeat compiled from source on your computer will run faster than the pre-compiled binaries.

Example of image classification, printing the 6 highest-scoring categories:

bin/YOUR_OS/overfeat -n 6 samples/bee.jpg

where YOUR_OS can be either linux_64, linux_32, or macos.

Running the webcam demo:

bin/YOUR_OS/webcam

GPU PRE-COMPILED BINARIES (EXPERIMENTAL)

We are providing precompiled binaries to run overfeat on GPU. Because the code is not released yet, we do not provide the source for now. The GPU release is experimental and for now only runs on linux 64bits. It requires a Nvidia GPU with CUDA architecture >= 2.0 (that covers all recent GPUs from Nvidia).

You will need openblas to run the GPU binaries.

The binaries are located in

bin/linux_64/cuda

And work the same way as the CPU versions. You can include the static library the same way as the CPU version.

COMPILING FROM SOURCE

Install dependencies : python, imagemagick, git, gcc, cmake (pkg-config and opencv required for the webcam demo). On Ubuntu/Debian :

apt-get install g++ git python imagemagick cmake

For the webcam demo :

apt-get install pkg-config libopencv-dev libopencv-highgui-dev

Here are the instructions to build the OverFeat library and tools:

Go to the src folder :

cd src

Build the tensor library (TH), OverFeat and the command-line tools:

make all

Build the webcam demo (OpenCV required) :

make cam

On Mac OS, the default gcc doesn't support OpenMP. We strongly recommend to install a gcc version with OpenMP support. With MacPort :

sudo port install gcc48

Which will provide g++-mp-48 . If you don't install this version, you will have to change the two corresponding lines in the Makefile.

UPDATING

A git repository is provided with the archive. You can update by typing

git pull

from the overfeat directory.

HIGH LEVEL INTERFACE:

The feature extractor requires a weight file, containing the weights of the network. We provide a weight file located in data/default/net_weight . The software we provide should be able to locate it automatically. In case it doesn't, the option -d can be used to manually provide a path.

Overfeat can use two sizes of network. By default, it uses the smaller one. For more accuracy, the option -l can be used to use a larger, but slower, network.

CLASSIFICATION:

In order to get the top (by default, =5) classes from a number of images :

bin/linux_64/overfeat [-n 
   
    ] [-d 
    
     ] [-l] path_to_image1 [path_to_image2 [path_to_image3 [... ] ] ]

To use overfeat online (feeding an image stream), feed its stdin stream with a sequence of ppm images (ended by end of file ('\0') character). In this case, please use option -p. For instance :

convert image1.jpg image2.jpg -resize 231x231 ppm:- | ./overfeat [-n 
   
    ] [-d 
    
     ] [-l] -p

Please note that to get the classes from an image, the image size should be 231x231. The image will be cropped if one dimension is larger than 231, and the network won't be able to work if both dimension are larger. For feature extraction without classification, it can be any size greater or equal to 231x231 for the small network, and 221x221 for the large network .

FEATURE EXTRACTION:

In order to extract the features instead of classifying, use -f option. For instance :

bin/linux_64/overfeat [-d 
   
    ] [-l] -f image1.png image2.jpg

It is compatible with option -p. The option -L (overrides -f) can be used to return the output of any layer. For instance

bin/linux_64/overfeat [-d 
   
    ] [-l] -L 12 image1.png

returns the output of layer 12. The option -f corresponds to layer 19 for the small layer and 22 for the large one.

It writes the features on stdout as a sequence. Each feature starts with three integers separated by spaces, the first is the number of features (n), the second is the number of rows (h) and the last is the number of columns (w). It is followed by a end of line ('\n') character. Then follows nhw floating point numbers (written in ascii) separated by spaces. The feature is the first dimension (so that to obtain the next feature, you must add wh to your index), followed by the row (to obtain the next row, add w to your index). That means that if you want the features corresponding to the top-left window, you need to read pixels ih*w for i=0..4095 .

The output is going to be a 3D tensor. The first dimension correspond to the features, while dimensions 2 and 3 are spatial (y and x respectively). The spatial dimension is reduced at each layer, and with the default network, using option -f, the output has size nFeatures * h * w where

for the small network,
- nFeatures = 4096
- h = ((H-11)/4 + 1)/8-6
- w = ((W-11)/4 + 1)/8-6
for the large network,
- nFeatures = 4096
- h = ((H-7)/2 + 1)/18-5
- w = ((W-7)/2 + 1)/18-5 if the input has size 3HW . Each pixel in the feature map corresponds to a localized window in the input. With the small network, the windows are 231x231 pixels, overlapping so that the i-th window begins at pixel 32i, while for the large network, the windows are 221x221, and the i-th window begins at pixel 36i.

WEBCAM:

We provide a live classifier based on the webcam. It reads images from the webcam, and displays the most likely classes along with the probabilities. It can be run with

bin/linux_64/webcam [-d 
   
    ] [-l] [-w 
    
     ]

BATCH:

We also provide an easy way to process a whole folder :

./bin/linux_64/overfeat_batch [-d 
   
    ] [-l] -i 
    
      -o

It process each image in the input folder and produces a corresponding file in the output directory, containing the features,in the same format as before.

EXAMPLES:

Classify image samples/bee.jpg, getting the 3 most likely classes :

bin/linux_64/overfeat -n 3 samples/bee.jpg

Extract features from samples/pliers.jpg with the large network :

bin/linux_64/overfeat -f -l samples/pliers.jpg

Extract the features from all files in samples :

./bin/linux_64/overfeat_batch -i samples -o samples_features

Run the webcam demo with the large network :

bin/linux_64/webcam -l

ADVANCED:

The true program is actually overfeatcmd, where overfeat is only a python script calling overfeatcmd. overfeatcmd is not designed to be used by itself, but can be if necessary. It taked three arguments :

bin/linux_64/overfeatcmd

If is positive, it is, as before, the number of top classes to display. If is nonpositive, the features are going to be the output. The option specifies from which layer the features are obtained (by default, =16, corresponding to the last layer before the classifier). corresponds to the size of the network : 0 for small, 1 for large.

APIs:

C++:

The library is written in C++. It consists of one static library named liboverfeat.a . The corresponding header is overfeat.hpp . It uses the low level torch tensor library (TH). Sample code can be found in overfeatcmd.cpp and webcam.cpp.

The library provides several functions in the namespace overfeat :

void init(const std::string & weight_file_path, int net_idx) : This function must be called once before using the feature extractor. It reads the weights and must be passed a path to the weight files. It must also be passed the size of the network (net_idx), which should be 0, or 1, respectively for small or large networks. Note that the weight file must correspond to the size of the network.

void free() : This function releases the ressources and should be called when the feature extractor is no longer used.

THTensor* fprop(THTensor* input) : This is the main function. It takes an image stored in a THTensor* and runs the network on it. It returns a pointer to a THTensor containing the output of the classifier. If the input is 3HW, the output is going to be nClasses * h * w, where

for the small network :

nClasses = 1000

h = ((H-11)/4 + 1)/8 - 6

w = ((W-11)/4 + 1)/8 - 6

for the large network :

nClasses = 1000

h = ((H-7)/2 + 1)/18 - 5

w = ((W-7)/2 + 1)/18 - 5 Each pixel of the output corresponds to a 231x231 window on the input for the small network, and 221x221 for the large network. The windows overlap in the same way as described earlier for the feature extraction. Each class gets a score, but they are not probabilities (they are not normalized).

THTensor* get_output(int i) : Once fprop has been computed, this function returns the output of any layer. For instance, in the default network, layer 16 corresponds to the final features before the classifier.

int get_n_layers() : Returns the total number of layers of the network.

void soft_max(THTensor* input, THTensor* output) : This function converts the output to probabilities. It only works if h = w = 1 (only one output pixel).

std::string get_class_name(int i) : This function returns the string corresponding to the i-th class.

std::vector > get_top_classes(THTensor* probas, int n) : Given a vector with nClasses elements containing scores or probabilities, this function returns the names of the top n classes, along with their score/probabilities.

When compiling code using liboverfeat.a, the code must also be linked against libTH.a, the tensor library. The file libTH.a will have been produced when compiling torch.

Torch7:

We have bindings for torch, in the directory API/torch. The file API/torch/README contains more details.

Python:

The bindings for python are in API/python. See API/python/README .

Comments

Issue with libopenblas.so.0

If followed your directions for installation on Ubuntu 12.04 LTS (via AWS) and am getting the following error when trying to run the example:

$ bin/linux_64/overfeat samples/bee.jpg
sh: 1: convert: not found
/socialq/overfeat/bin/linux_64/overfeatcmd: error while loading shared libraries: libopenblas.so.0: cannot open shared object file: No such file or directory

opened by josephmisiti 25

Fix error with extra_link_args in Python API.
While trying to compile the Python extension, I encountered the same issue as the two people here:

https://groups.google.com/d/msg/overfeat/nzcWFEdAMD0/OS069TrBhfsJ

Here's a pull request that fixes the problem. Thanks!
opened by dnouri 0
cropping a square from the webcam (assuming w>h)

Salut Pierre,

Instead of just resizing to square, this first crops the webcam to a centered square. No deformation => better performance for demos with the webcam. :)

Cheers, Gabriel

opened by syhw 0
Inconsistency of the layer number

In webpage http://cilvr.nyu.edu/doku.php?id=software:overfeat:start, it says "The option -f corresponds to layer 18 for the small network, and 21 for the large one." However in https://github.com/sermanet/OverFeat/blob/master/src/overfeat#L54, 19 is for small, and 22 for the large. While in README.md, it says "The option -f corresponds to layer 21 for the small layer and 24 for the large one."

I am totally confused with this inconsistency. Could you please fix them?

opened by zhongwen 0

All Outputs of Network are the Same (Python API)

Hi, I receive the same output for every input image I give the Overfeat model using the Python API. Minimal code to reproduce:

>>> overfeat.init(path_to_weights, 0)
>>> img = np.random.rand(3, 231, 231).astype(np.float32)
>>> r1 = overfeat.fprop(img)
>>> img = np.random.rand(3, 231, 231).astype(np.float32)
>>> r2 = overfeat.fprop(img)
>>> all(r1 == r2)
True

opened by craymichael 2

Error with installing Python api

When I run python setup.py install in Terminal Show the following error

chenwei@Ubuntu14:~/project/OverFeat/overfeat (2)/API/python$ python setup.py install /usr/lib/python2.7/distutils/dist.py:267: UserWarning: Unknown distribution option: 'install_requires' warnings.warn(msg) running install running build running build_ext building 'overfeat' extension creating build creating build/temp.linux-x86_64-2.7 x86_64-linux-gnu-gcc -pthread -fno-strict-aliasing -DNDEBUG -g -fwrapv -O2 -Wall -Wstrict-prototypes -fPIC -I../../src -I/usr/local/lib/python2.7/dist-packages/numpy/core/include -I/usr/include/python2.7 -c overfeatmodule.cpp -o build/temp.linux-x86_64-2.7/overfeatmodule.o -fopenmp cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++ [enabled by default] In file included from ../../src/THTensor.hpp:5:0, from ../../src/overfeat.hpp:4, from overfeatmodule.cpp:3: ../../src/TH/TH.h:4:23: fatal error: THGeneral.h: 没有那个文件或目录 #include "THGeneral.h" ^ compilation terminated. error: command 'x86_64-linux-gnu-gcc' failed with exit status 1

opened by Cw-zero 2
How multi-scale CNN selects final output map

I read a few days ago about multi-scale CNN (in OverFeat method), which you can access to presentation via this link. You performed CNN on different scales of an image and then combine all output maps. You said inside of that presentation:

Classification performed at 6 scales at test time, but only 1 scale at run time .

So my question is: If we use 6 different scales of CNN architecture, then we have different convolution layers in every scale (I guess so). So how in OverFeat You use just 1 scale in run time? if we use specific scale then how we can access to other feature extractor of different scales?, and I see in the article You combine feature maps of different scales but I can't figure out how this process performed.

Thanks

opened by robosina 0
Invalid Image Format error in overfeat_batch

I am using the Overfeat tool to extract image features. However, when I input a set of JPG images using the batch file "overfeat_batch", I got the following error, "Invalid image format (must be 'P6')". I checked the code and saw the step which converts JPG images to PPM images, but I still cannot figure out why this convertion is working well for the JPG images given in the sample folder, but not working properly for my JPG images. Is there any requirement for the image format? Thanks.

opened by sancharidan 0
Process for saving weights to file

How were the weights saved to disk? Were they extracted from Torch using getParameters then saved as a torch object using torch.save? I'm interested because I've trained my own smaller network and would like to load the weights from torch/lua training in C++.

opened by nmante 2

OverFeat is a Convolutional Network-based image classifier and feature extractor.

Related tags

Overview

OverFeat

CREDITS, LICENSE, CITATION

INSTALLATION:

HIGH LEVEL INTERFACE:

APIs:

Comments

Issue with libopenblas.so.0

Fix error with extra_link_args in Python API.

cropping a square from the webcam (assuming w>h)

Inconsistency of the layer number

All Outputs of Network are the Same (Python API)

Error with installing Python api

How multi-scale CNN selects final output map

Invalid Image Format error in overfeat_batch

Process for saving weights to file

Owner

Static Features Classifier - A static features classifier for Point-Could clusters using an Attention-RNN model

A method that utilized Generative Adversarial Network (GAN) to interpret the black-box deep image classifier models by PyTorch.

An Image compression simulator that uses Source Extractor and Monte Carlo methods to examine the post compressive effects different compression algorithms have.

PyTorch implementation of Graph Convolutional Networks in Feature Space for Image Deblurring and Super-resolution, IJCNN 2021.

Minimal But Practical Image Classifier Pipline Using Pytorch, Finetune on ResNet18, Got 99% Accuracy on Own Small Datasets.

Simple embedding based text classifier inspired by fastText, implemented in tensorflow

This is a model made out of Neural Network specifically a Convolutional Neural Network model

This repository implements and evaluates convolutional networks on the Möbius strip as toy model instantiations of Coordinate Independent Convolutional Networks.

A sketch extractor for anime/illustration.

Table-Extractor 表格抽取

Snscrape-jsonl-urls-extractor - Extracts urls from jsonl produced by snscrape

Scripts for training an AI to play the endless runner Subway Surfers using a supervised machine learning approach by imitation and a convolutional neural network (CNN) for image classification

CoSMA: Convolutional Semi-Regular Mesh Autoencoder. From Paper "Mesh Convolutional Autoencoder for Semi-Regular Meshes of Different Sizes"

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network.

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network

paper: Hyperspectral Remote Sensing Image Classification Using Deep Convolutional Capsule Network

This project demonstrates the use of neural networks and computer vision to create a classifier that interprets the Brazilian Sign Language.

People movement type classifier with YOLOv4 detection and SORT tracking.