Ross B. Girshick URL-优快云博客

Ross Girshick是一位在Facebook AI Research工作的研究科学家，专注于计算机视觉、AI和机器学习领域的研究，特别关注构建用于目标检测和识别的模型。他的研究兴趣包括在中到大规模数据集上理解图像内容，并且致力于开发快速、实践有效的系统。在博士期间，他参与了多项研究，包括在PASCAL VOC目标检测挑战赛中的优秀表现。

Ross B. Girshick
Research Scientist
Facebook AI Research (FAIR)
~~Researcher~~
~~Microsoft Research, Redmond~~
~~Postdoctoral fellow~~
~~University of California, Berkeley, EECS~~
r......@eecs.berkeley.edu
cv / google scholar / Ph.D. thesis
papers: arXiv / journal / conference
MSR website

from: http://www.cs.berkeley.edu/~rbg/

About me

I finished my Ph.D. in computer vision at The University of Chicago under the supervision of Pedro Felzenszwalb in April 2012. Then, I spent two unbelievably wonderful years as a postdoc at UC Berkeleyunder Jitendra Malik. From Berkeley, I spent just over one year as a Researcher at Microsoft Research, Redmond. Now, I'm off on a new adventure as a Research Scientist with the terrific group of researchers and engineers in Facebook AI Research (FAIR).

My main research interests are in computer vision, AI, and machine learning. I'm particularly focused on building models for object detection and recognition. These models aim to incorporate the "right" biases so that machine learning algorithms can understand image content from moderate to large-scale datasets. I always have an eye towards fast systems that work well in practice.

During my Ph.D., I spent time as a research intern at Microsoft Research Cambridge, UK working on human pose estimation from (Kinect) depth images. I also participated in several first-place entries into the PASCAL VOC object detection challenge, and was awarded a "lifetime achievement" prize for my work on deformable part models. I think this refers to the lifetime of the PASCAL challenge—and not mine!

News

Sean Bell led our team, together with Kavita Bala and Larry Zitnick, to 3rd place in the 2015 MS COCO object detection challenge! Sean also won the prize for the best student-led entry. Check out our tech report describing the ION (inside-outside network) detector:
Faster R-CNN: paper / Python code / Matlab code
Fast R-CNN: paper / code

Project pages

Faster R-CNN github repository (Python source code for training and using Faster R-CNN)
Fast R-CNN github repository (Python source code for training and using Fast R-CNN)
R-CNN github repository (Matlab source code for training and using R-CNN)
<a href="http://www.eecs.berkeley.edu/Research/Projects/CS/vision/shape/sds/" <="" a="" style="margin: 0pt; padding: 0pt; border: 0pt none; font-family: inherit; font-style: inherit; font-weight: inherit; outline: invert none 0pt; vertical-align: baseline; color: rgb(23, 114, 208); text-decoration: none;">SDS: Simultaneous Detection and Segmentation
R-CNN (and more!) for RGB-D data
LSDA: Large Scale Detection through Adaptation
Deformable Part Models (DPM) (voc-release5 — dissertation code / includes cascade and NIPS grammar model)
Cascade object detection with deformable part models (add-on code for voc-release4.01)

arXiv tech reports

Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks
Sean Bell, C. Lawrence Zitnick, Kavita Bala, Ross Girshick
arXiv [cs.CV]
abstract / bibtex

Object Detection Networks on Convolutional Feature Maps
Shaoqing Ren, Kaiming He, Ross Girshick, Xiangyu Zhang, Jian Sun
arXiv [cs.CV]
bibtex

Journal papers

Region-based Convolutional Networks for Accurate Object Detection and Segmentation

Region-based Convolutional Networks for Accurate Object Detection and Semantic Segmentation
R. Girshick, J. Donahue, T. Darrell, J. Malik
IEEE Transactions on Pattern Analysis and Machine Intelligence (accepted May 18, 2015)
abstract / code / CVPR'14 version

Indoor Scene Understanding with RGB-D Images: Bottom-up Segmentation, Object Detection and Semantic Segmentation
Saurabh Gupta, Pablo Arbeláez, Ross Girshick, Jitendra Malik
International Journal of Computer Vision (IJCV), 2014
code / bibtex

Efficient Human Pose Estimation from Single Depth Images
J. Shotton, R. Girshick, A. Fitzgibbon, T. Sharp, M. Cook, M. Finocchio, R. Moore, P. Kohli, A. Criminisi, A. Kipman, A. Blake
IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 35, No. 12, Dec. 2013
abstract / bibtex

Object Detection with Discriminatively Trained Part Based Models ^†
P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan
IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 32, No. 9, Sep. 2010
abstract / PAMI code / latest code (voc-release5) / bibtex

Conference papers

2015

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren, Kaiming He, Ross Girshick, Jian Sun
Neural Information Processing Systems (NIPS), 2015
Python code / Matlab code / bibtex

Fast R-CNN
Ross Girshick
IEEE International Conference on Computer Vision (ICCV), 2015
oral presentation
code / slides / bibtex

Contextual Action Recognition with R*CNN
Georgia Gkioxari, Ross Girshick, Jitendra Malik
IEEE International Conference on Computer Vision (ICCV), 2015
code / bibtex

Actions and Attributes from Wholes and Parts
Georgia Gkioxari, Ross Girshick, Jitendra Malik
IEEE International Conference on Computer Vision (ICCV), 2015
bibtex

Hypercolumns for object segmentation and fine-grained localization
Bharath Hariharan, Pablo Arbeláez, Ross Girshick, Jitendra Malik
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
oral presentation
bibtex

Aligning 3D models to RGB-D images of cluttered scenes
Saurabh Gupta, Pablo Arbeláez, Ross Girshick, Jitendra Malik
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
bibtex

Deformable Part Models are Convolutional Neural Networks
Ross Girshick, Forrest Iandola, Trevor Darrell, Jitendra Malik
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
bibtex

2014

LSDA: Large Scale Detection through Adaptation
Judy Hoffman, Sergio Guadarrama, Eric Tzeng, Ronghang Hu, Jeff Donahue, Ross Girshick, Trevor Darrell, Kate Saenko
Neural Information Processing Systems (NIPS), 2014
project, code, models / bibtex

Simultaneous Detection and Segmentation, ECCV 2014

Simultaneous Detection and Segmentation
Bharath Hariharan, Pablo Arbeláez, Ross Girshick, Jitendra Malik
European Conference on Computer Vision (ECCV), 2014
project page (with code) / bibtex

Learning Rich Features from RGB-D Images for Object Detection and Segmentation, ECCV 2014

Learning Rich Features from RGB-D Images for Object Detection and Segmentation
Saurabh Gupta, Ross Girshick, Pablo Arbeláez, Jitendra Malik
European Conference on Computer Vision (ECCV), 2014
code, models, and data / bibtex

Analyzing the Performance of Multilayer Neural Networks for Object Recognition, ECCV 2014

Analyzing the Performance of Multilayer Neural Networks for Object Recognition
Pulkit Agrawal, Ross Girshick, Jitendra Malik
European Conference on Computer Vision (ECCV), 2014
bibtex

Part-based R-CNNs for Fine-grained Category Detection, ECCV 2014

Part-based R-CNNs for Fine-grained Category Detection
Ning Zhang, Jeff Donahue, Ross Girshick, Trevor Darrell
European Conference on Computer Vision (ECCV), 2014
oral presentation
bibtex

On learning to localize objects with minimal supervision, ICML 2014

On Learning to Localize Objects with Minimal Supervision
Hyun Oh Song, Ross Girshick, Stefanie Jegelka, Julien Mairal, Zaid Harchaoui, Trevor Darrell
International Conference on Machine Learning (ICML), 2014
code / bibtex

Rich feature hierarchies for accurate object detection and semantic segmentation, CVPR 2014

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation
R. Girshick, J. Donahue, T. Darrell, J. Malik
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014
oral presentation
arXiv tech report (includes ImageNet results) / supplement / code / poster / slides / bibtex

Using k-poselets for Detecting People and Localizing their Keypoints, CVPR 2014

Using k-poselets for Detecting People and Localizing their Keypoints
G. Gkioxari*, B. Hariharan*, R. Girshick, J. Malik
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014
* equal contribution
project page / code / github / bibtex

Understanding Objects in Detail with Fine-grained Attributes, CVPR 2014

Understanding Objects in Detail with Fine-grained Attributes
A. Vedaldi, S. Mahendran, S. Tsogkas, S. Maji, R. Girshick, J. Kannala, E. Rahtu, I. Kokkinos, M. B. Blaschko, D. Weiss, B. Taskar, K. Simonyan, N. Saphra, S. Mohamed
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014
dataset / bibtex

2013

Training Deformable Part Models with Decorrelated Features
R. Girshick, J. Malik
IEEE International Conference on Computer Vision (ICCV), 2013
supplement / LM-LLDA DPM training code / bibtex

Discriminatively activated sparselets, ICML 2013

Discriminatively Activated Sparselets
R. Girshick*, H. O. Song*, T. Darrell
International Conference on Machine Learning (ICML), 2013
oral presentation
supplement / Caltech-101 demo code / bibtex

2012 and earlier

Sparselet Models for Efficient Multiclass Object Detection
H.O. Song, S. Zickler, T. Althoff, R. Girshick, M. Fritz, C. Geyer, P. Felzenszwalb, T. Darrell
European Conference on Computer Vision (ECCV), 2012
code / bibtex

Object detection with grammar models, NIPS 2011

Object Detection with Grammar Models
R. Girshick, P. Felzenszwalb, D. McAllester
Neural Information Processing Systems (NIPS), 2011
spotlight video / code (voc-release5) / bibtex

Efficient regression of general-activity human poses from depth images, ICCV 2011

Efficient Regression of General-Activity Human Poses from Depth Images
R. Girshick, J. Shotton, P. Kohli, A. Criminisi, A. Fitzgibbon
IEEE International Conference on Computer Vision (ICCV), 2011
supplement / video / bibtex

Cascade object detection with deformable part models, CVPR 2010

Cascade Object Detection with Deformable Part Models ^†
P. Felzenszwalb, R. Girshick, D. McAllester
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2010
oral presentation
slides (pdf) / slides (keynote) / talk / code (voc-release5) / bibtex

Visibility constraints on features of 3D objects, CVPR 2009

Visibility Constraints on Features of 3D Objects ^†
R. Basri, P. Felzenszwalb, R. Girshick, D. Jacobs, C. Klivans
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009
bibtex

Simulating chinese brush painting, SIGGRAPH poster 2004

Simulating Chinese Brush Painting: the Parametric Hairy Brush
R. Girshick
ACM SIGGRAPH Posters, 2004
Session: Nonphotorealistic Animation and Rendering
bibtex

^†Authors listed alphabetically