GitHub - yifita/action.sr_cnn: code for Two-Stream SR-CNNs for Action Recognition in Videos 2016

BMVC2016 Two-Stream SR-CNNs for Action Recognition in Videos

Prerequisites

Caffe

clone and build caffe from here. This caffe version is based on Limin Wang's fork [1] contains merge_batch and weighted_sum layer. In addition it exposed some protected caffe functions in the matlab interface to emulate iter_size in matlab.
modify caffe_mex.m to the corresponding caffe matlab interface directory

Optical Flow

extract optical flow with Limin's flow extractor

Bounding Boxes

We extracted 118 objects' bounding boxes in all video frames using Faster-RCNN [2] (retraining is required) and obtained filtered bounding boxes taking consideration of temporal coherency and motion saliency.
The extracted and processed bounding boxes for ucf-101 can be downloaded here. Place the downloaded mat files under imdb/cache.
If you wish to extract the bounding boxes yourself, you need to be able to run Ren Shaoqing's Faster-RCNN (most codes are migrated into this repository with minor modifications and more comments)
- First generate raw object detection using faster_rcnn_{dataset}.m
- Then use action/prepare_rois_context.m to process bounding boxes as described in the paper.

Test

datasets

create dataset.mat using imdb/get_{name}_dataset.m (Directories may need to be adjusted!) An example of generated ucf_dataset.mat

models

models/srcnn/{stream} contains model prototxt files
model weights can be downloaded in the following links

Stream person+scene (the final proposed model in the paper)

spatial split1 split2 split3

flow split1 split2 split3
the reported two-stream results in the paper are yielded from summing spatial and temporal classification scores using weight 1 : 3.
other models mentioned in the paper experiments can be provided if the demand is large.

run

in matlab

% test spatial
test_spatial('model_path', path_to_weights, 'split', 1)

% test flow
`test_flow('model_path', path_to_weights, 'split', 1)`

Ren, S., He, K., Girshick, R., & Sun, J. (2015). Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems (pp. 91-99).

Wang, L., Xiong, Y., Wang, Z., & Qiao, Y. (2015). Towards good practices for very deep two-stream convnets. arXiv preprint arXiv:1507.02159.

Citation

Please cite the following if you find the code useful.

@inproceedings{wang2016two,
  title={Two-Stream SR-CNNs for Action Recognition in Videos},
  author={Yifan, Wang and Song, Jie and Wang, Limin and Van Gool, Luc and Hilliges, Otmar},
  year={2016},
  organization={BMVC}
}

Contact

Yifan Wang: [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
+Dataset		+Dataset
+Faster_RCNN_Train		+Faster_RCNN_Train
+Model		+Model
action		action
functions		functions
imdb		imdb
models		models
util		util
.gitignore		.gitignore
LICENSE		LICENSE
caffe_mex.m		caffe_mex.m
faster_rcnn_build.m		faster_rcnn_build.m
faster_rcnn_hmdb.m		faster_rcnn_hmdb.m
faster_rcnn_human_jhmdb.m		faster_rcnn_human_jhmdb.m
faster_rcnn_ucf.m		faster_rcnn_ucf.m
readme.md		readme.md
test_flow.m		test_flow.m
test_spatial.m		test_spatial.m
train_faster_rcnn_voc_ilsvrc.m		train_faster_rcnn_voc_ilsvrc.m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BMVC2016 Two-Stream SR-CNNs for Action Recognition in Videos

Prerequisites

Caffe

Optical Flow

Bounding Boxes

Test

datasets

models

run

Citation

Contact

About

Releases

Packages

Languages

Stream	person+scene (the final proposed model in the paper)
spatial	split1 split2 split3
flow	split1 split2 split3

License

yifita/action.sr_cnn

Folders and files

Latest commit

History

Repository files navigation

BMVC2016 Two-Stream SR-CNNs for Action Recognition in Videos

Prerequisites

Caffe

Optical Flow

Bounding Boxes

Test

datasets

models

run

Citation

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages