Deeplab to UberNet: from task-specific to task-agnostic deep learning in computer vision
Date: September 26, 2016, 13:30-15:00
University of Amsterdam
Science Park 904
1098 XH Amsterdam
Over the last few years Convolutional Neural Networks (CNNs) have been shown to deliver excellent results in a broad range of low- and high-level vision tasks, spanning effectively the whole spectrum of computer vision problems.
In this talk we will present recent research progress along two complementary directions.
In the first part we will present research efforts on integrating established computer vision ideas with CNNs, thereby allowing us to incorporate task-specific domain knowledge in CNNs. We will present CNN-based adaptations of structured prediction techniques that use discrete (DenseCRF - Deeplab) and continuous energy-based formulations (Deep Gaussian CRF), and will also present methods to incorporate ideas from multi-scale processing, Multiple-Instance Learning and Spectral Clustering into CNNs.
In the second part of the talk we will turn to designing a generic architecture that can tackle a multitude of tasks jointly, aiming at designing a `swiss knife’ for vision. We call this network an ‘UberNet’ to underline its overarching nature. We will introduce techniques that allow us to train an UberNet while using datasets with diverse annotations, while also handling the memory limitations of current hardware. The proposed architecture is able to jointly address (a) boundary detection (b) saliency detection (c) normal estimation (d) semantic segmentation (e) human part segmentation (f) human boundary detection (g) region proposal generation and object detection in 0.7 seconds per frame, with a level of performance that is comparable to the current state-of-the-art on these tasks.
Iasonas Kokkinos, CentraleSupelec/INRIA
Iasonas Kokkinos obtained the Diploma of Engineering in 2001 and the Ph.D. Degree in 2006 from the School of Electrical and Computer Engineering of the National Technical University of Athens in Greece, and the Habilitation Degree in 2013 from Université Paris-Est.
In 2006 he joined the University of California at Los Angeles as a postdoctoral scholar, and in 2008 joined as faculty the Department of Applied Mathematics of Ecole Centrale Paris (CentraleSupelec). He is currently an associate professor in the Center for Visual Computing of CentraleSupelec and is also affiliated with INRIA-Saclay in Paris. His research activity is currently focused on deep learning and efficient algorithms for object detection.
He has been awarded a young researcher grant by the French National Research Agency, serves regularly as a reviewer for all major computer vision conferences and journals, and is an associate editor for the Image and Vision Computing and the Computer Vision and Image Understanding journals.
UberNet demo: cvn.ecp.fr/ubernet
Boundary Detection: cvn.ecp.fr/personnel/iasonas/deepboundaries
I. Kokkinos, UberNet: Training a ‘Universal’ CNN for Low-, Mid-, and High- Level Vision using Diverse Datasets and Limited Memory, arxiv, 2016
S. Chandra and I. Kokkinos, Fast, Exact and Multi-Scale Inference for Semantic Image Segmentation with Deep Gaussian CRFs, Proc. European Conf. on Computer Vision (ECCV), 2016
L.-C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, A. L. Yuille, DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs v1: ICLR 2015, v2: arxiv, 2016
I. Kokkinos, Pushing the Boundaries of Boundary Detection using Deep Learning, Int.l Conf. on Learning Representations (ICLR), 2016.