Long-term Recurrent Convolutional Networks_lorcon-lo: long-term recurrent convolutional netwo

作者：黑客灵魂 | 2024-07-17 13:35:02

踩

lorcon-lo: long-term recurrent convolutional network-based lidar odometry

Long-term Recurrent Convolutional Networks

http://jeffdonahue.com/lrcn/
This is the project page for Long-term Recurrent Convolutional Networks (LRCN), a class of models that unifies the state of the art in visual and sequence learning. LRCN was accepted as an oral presentation at CVPR 2015. See our arXiv report for details on our approach.

https://arxiv.org/abs/1411.4389
Long-term Recurrent Convolutional Networks for Visual Recognition and Description

Code
We have created a Pull Request to the official BVLC Caffe repository which adds support for RNNs and LSTMs, and provides an example of training an LRCN model for image captioning in the COCO dataset. To use the code before it is merged into the official Caffe repository, you can check out the recurrent branch of Jeff Donahue's Caffe fork at git@github.com:jeffdonahue/caffe.git. Please find instructions for replicating activity recognition experiments at Activity Recognition. We will update this page as the code is officially released and code for the video description becomes available.
https://github.com/BVLC/caffe/pull/2033
https://people.eecs.berkeley.edu/~lisa_anne/LRCN_video

Example Results - Video description (multiple sentences)
Scrambled egg
Preparing onions
Making hot dog - partial failure case
  http://youtu.be/w2iV8gt5cd4
  http://youtu.be/9VH8bn7ikbw
  http://youtu.be/nsoWwROh-7g

TACoS Multi-Level Corpus
https://www.mpi-inf.mpg.de/departments/computer-vision-and-multimodal-computing/research/vision-and-language/tacos-multi-level-corpus/

This site hosts the TACoS Multi-Level corpus presented in Coherent Multi-Sentence Video Description with Variable Level of Detail.
Coherent Multi-Sentence Video Description with Variable Level of Detail

This research was supported by the Berkeley vision group and BVLC. To cite LRCN with BibTeX, use:


@inproceedings{lrcn2014,
   Author = {Jeff Donahue and Lisa Anne Hendricks and Sergio Guadarrama
             and Marcus Rohrbach and Subhashini Venugopalan and Kate Saenko
             and Trevor Darrell},
   Title = {Long-term Recurrent Convolutional Networks
            for Visual Recognition and Description},
   Year  = {2015},
   Booktitle = {CVPR}
}

声明：本文内容由网友自发贡献，不代表【wpsshop博客】立场，版权归原作者所有，本站不承担相应法律责任。如您发现有侵权的内容，请联系我们。转载请注明出处：https://www.wpsshop.cn/w/黑客灵魂/article/detail/840197