Blog
The latest from Google Research
Google at ICCV 2019
Monday, October 28, 2019
Posted by Andrew Helton, Editor, Google Research Communications
This week, Seoul, South Korea hosts the
International Conference on Computer Vision 2019
(ICCV 2019), one of the world's premier conferences on computer vision. As a leader in computer vision research and a Gold Sponsor, Google will have a strong presence at ICCV 2019 with over 200 Googlers in attendance, more than 40 research presentations, and involvement in the organization of a number of
workshops
and
tutorials
.
If you are attending ICCV this year, please stop by our booth. There you can chat with researchers who are actively pursuing the latest innovations in computer vision and demo some of their latest research, including the technology behind
MediaPipe
, the new
Open Images
dataset, new developments for
Google Lens
and much more.
This year Google researchers are recipients of three prestigious ICCV awards:
Distinguished Researcher Award —
Bill Freeman
, Research Scientist, Google Research
Helmholtz Prize (Test of Time Award) — ICCV 2009 paper, "
Building Rome in a Day
", by
Sameer Agarwal
,
Noah Snavely
, Ian Simon,
Steve Seitz
and Rick Szeliski
Marr Prize (Best Paper Award) — ICCV 2019 paper, "
SinGAN: Learning a Generative Model from a Single Natural Image
", by Tamar Rott Shaham,
Tali Dekel
and Tomer Michaeli
More details about the Google research being presented at ICCV 2019 can be found below (Google affiliations in
blue
).
Organizing Committee
includes:
Ming-Hsuan Yang (Program Chair)
Oral Presentations
Learning Single Camera Depth Estimation using Dual-Pixels
Rahul Garg
,
Neal Wadhwa
,
Sameer Ansari
,
Jonathan Barron
RIO: 3D Object Instance Re-Localization in Changing Indoor Environments
Johanna Wald, Armen Avetisyan, Nassir Navab,
Federico Tombari
, Matthias Niessner
ShapeMask: Learning to Segment Novel Objects by Refining Shape Priors
Weicheng Kuo
,
Anelia Angelova
, Jitendra Malik,
Tsung-Yi Lin
PuppetGAN: Cross-Domain Image Manipulation by Demonstration
Ben Usman
,
Nick Dufour
, Kate Saenko,
Chris Bregler
COCO-GAN: Generation by Parts via Conditional Coordinating
Chieh Hubert Lin, Chia-Che Chang, Yu-Sheng Chen,
Da-Cheng Juan
,
Wei Wei
, Hwann-Tzong Chen
Towards Unconstrained End-to-End Text Spotting
Siyang Qin
,
Alessandro Bissaco
,
Michalis Raptis
,
Yasuhisa Fujii
,
Ying Xiao
SinGAN: Learning a Generative Model from a Single Natural Image
Tamar Rott Shaham,
Tali Dekel
, Tomer Michaeli
(ICCV 2019 Marr Prize Winner — Best Paper Award)
Generative Modeling for Small-Data Object Detection
Lanlan Liu,
Michael Muelly
, Jia Deng,
Tomas Pfister
, Li-Jia Li
Searching for MobileNetV3
Andrew Howard
,
Mark Sandler
,
Bo Chen
,
Weijun Wang
,
Liang-Chieh Chen
,
Mingxing Tan
,
Grace Chu
,
Vijay Vasudevan
,
Yukun Zhu
,
Ruoming Pang
,
Hartwig Adam
,
Quoc Le
S⁴L: Self-Supervised Semi-supervised Learning
Lucas Beyer
,
Xiaohua Zhai
,
Avital Oliver
,
Alexander Kolesnikov
Sampling-Free Epistemic Uncertainty Estimation Using Approximated Variance Propagation
Janis Postels, Francesco Ferroni, Huseyin Coskun, Nassir Navab,
Federico Tombari
Linearized Multi-sampling for Differentiable Image Transformation
Wei Jiang, Weiwei Sun,
Andrea Tagliasacchi
,
Eduard Trulls
, Kwang Moo Yi
Poster Presentations
ELF: Embedded Localisation of Features in Pre-trained CNN
Assia Benbihi
,
Matthieu Geist
,
Cedric Pradalier
Depth from Videos in the Wild: Unsupervised Monocular Depth Learning from Unknown Cameras
Ariel Gordon
,
Hanhan Li
,
Rico Jonschkowski
,
Anelia Angelova
ForkNet: Multi-branch Volumetric Semantic Completion from a Single Depth Image
Yida Wang,
David Joseph Tan
, Nassir Navab,
Federico Tombari
A Learned Representation for Scalable Vector Graphics
Raphael Gontijo Lopes
,
David Ha
,
Douglas Eck
,
Jonathon Shlens
FrameNet: Learning Local Canonical Frames of 3D Surfaces from a Single RGB Image
Jingwei Huang
,
Yichao Zhou
,
Thomas Funkhouser
,
Leonidas Guibas
Prior-Aware Neural Network for Partially-Supervised Multi-Organ Segmentation
Yuyin Zhou,
Zhe Li
, Song Bai, Xinlei Chen, Mei Han, Chong Wang, Elliot Fishman, Alan Yuille
Boundless: Generative Adversarial Networks for Image Extension
Dilip Krishnan
,
Piotr Teterwak
,
Aaron Sarna
,
Aaron Maschinot
,
Ce Liu
,
David Belanger
,
William Freeman
Cap2Det: Learning to Amplify Weak Caption Supervision for Object Detection
Keren Ye, Mingda Zhang, Adriana Kovashka, Wei Li,
Danfeng Qin
, Jesse Berent
NOTE-RCNN: NOise Tolerant Ensemble RCNN for Semi-supervised Object Detection
Jiyang Gao,
Jiang Wang
,
Shengyang Dai
,
Li-Jia Li
, Ram Nevatia
Object-Driven Multi-Layer Scene Decomposition from a Single Image
Helisa Dhamo, Nassir Navab,
Federico Tombari
Improving Adversarial Robustness via Guided Complement Entropy
Hao-Yun Chen, Jhao-Hong Liang, Shih-Chieh Chang,
Jia-Yu Pan
,
Yu-Ting Chen
,
Wei Wei
,
Da-Cheng Juan
XRAI: Better Attributions Through Regions
Andrei Kapishnikov
,
Tolga Bolukbasi
,
Fernanda Viegas
,
Michael Terry
SegSort: Segment Sorting for Semantic Segmentation
Jyh-Jing Hwang, Stella Yu, Jianbo Shi,
Maxwell Collins
, Tien-Ju Yang, Xiao Zhang,
Liang-Chieh Chen
Self-Supervised Learning with Geometric Constraints in Monocular Video: Connecting Flow, Depth, and Camera
Yuhua Chen
,
Cordelia Schmid
,
Cristian Sminchisescu
VideoBERT: A Joint Model for Video and Language Representation Learning
Chen Sun
,
Austin Myers
,
Carl Vondrick
,
Kevin Murphy
,
Cordelia Schmid
Explaining the Ambiguity of Object Detection and 6D Pose from Visual Data
Fabian Manhardt, Diego Martín Arroyo, Christian Rupprecht, Benjamin Busam, Tolga Birdal, Nassir Navab,
Federico Tombari
Constructing Self-Motivated Pyramid Curriculums for Cross-Domain Semantic Segmentation
Qing Lian, Lixin Duan, Fengmao Lv,
Boqing Gong
Learning Shape Templates Using Structured Implicit Functions
Kyle Genova
,
Forrester Cole
,
Daniel Vlasic
,
Aaron Sarna
,
William Freeman
,
Thomas Funkhouser
Transferable Representation Learning in Vision-and-Language Navigation
Haoshuo Huang
,
Vihan Jain
,
Harsh Mehta
,
Alexander Ku
,
Gabriel Magalhaes
,
Jason Baldridge
,
Eugene Ie
Controllable Attention for Structured Layered Video Decomposition
Jean-Baptiste Alayrac
,
Joao Carreira
,
Relja Arandjelović
,
Andrew Zisserman
Pixel2Mesh++: Multi-view 3D Mesh Generation via Deformation
Chao Wen,
Yinda Zhang
, Zhuwen Li, Yanwei Fu
Beyond Cartesian Representations for Local Descriptors
Patrick Ebel, Anastasiia Mishchuk, Kwang Moo Yi, Pascal Fua,
Eduard Trulls
Domain Randomization and Pyramid Consistency: Simulation-to-Real Generalization without Accessing Target Domain Data
Xiangyu Yue, Yang Zhang, Sicheng Zhao, Alberto Sangiovanni-Vincentelli, Kurt Keutzer,
Boqing Gong
Evolving Space-Time Neural Architectures for Videos
AJ Piergiovanni
,
Anelia Angelova
,
Alexander Toshev
,
Michael Ryoo
Moulding Humans: Non-parametric 3D Human Shape Estimation from Single Images
Valentin Gabeur
, Jean-Sebastien Franco, Xavier Martin,
Cordelia Schmid
, Gregory Rogez
Multi-view Image Fusion
Marc Comino Trinidad,
Ricardo Martin-Brualla
,
Florian Kainz
,
Janne Kontkanen
EvalNorm: Estimating Batch Normalization Statistics for Evaluation
Saurabh Singh
, Abhinav Shrivastava
Attention Augmented Convolutional Networks
Irwan Bello
,
Barret Zoph
,
Quoc Le
,
Ashish Vaswani
,
Jonathon Shlens
Patchwork: A Patch-wise Attention Network for Efficient Object Detection and Segmentation in Video Streams
Yuning Chai
Workshops
Low-Power Computer Vision
Organizers include:
Bo Chen
Neural Architects
Organizers include:
Barret Zoph
The 3rd YouTube-8M Large-Scale Video Understanding Workshop
Organizers include:
Paul Natsev
,
Cordelia Schmid
,
Rahul Sukthankar
,
Joonseok Lee
,
George Toderici
Should We Pre-register Experiments in Computer Vision?
Organizers include:
Jack Valmadre
Extreme Vision Modeling
Organizers include:
Rahul Sukthankar
Joint COCO and Mapillary Recognition Challenge
Organizers include:
Tsung-Yi Lin
,
Yin Cui
Open Images Challenge
Organizers include:
Vittorio Ferrari
,
Alina Kuznetsova
,
Rodrigo Benenson
,
Victor Gomes
,
Matteo Malloci
Tutorials
Meta-Learning and Metric Learning Algorithms
Organizers include:
Kevin Swersky
Labels
accessibility
ACL
ACM
Acoustic Modeling
Adaptive Data Analysis
ads
adsense
adwords
Africa
AI
AI for Social Good
Algorithms
Android
Android Wear
API
App Engine
App Inventor
April Fools
Art
Audio
Augmented Reality
Australia
Automatic Speech Recognition
AutoML
Awards
BigQuery
Cantonese
Chemistry
China
Chrome
Cloud Computing
Collaboration
Compression
Computational Imaging
Computational Photography
Computer Science
Computer Vision
conference
conferences
Conservation
correlate
Course Builder
crowd-sourcing
CVPR
Data Center
Data Discovery
data science
datasets
Deep Learning
DeepDream
DeepMind
distributed systems
Diversity
Earth Engine
economics
Education
Electronic Commerce and Algorithms
electronics
EMEA
EMNLP
Encryption
entities
Entity Salience
Environment
Europe
Exacycle
Expander
Faculty Institute
Faculty Summit
Flu Trends
Fusion Tables
gamification
Gboard
Gmail
Google Accelerated Science
Google Books
Google Brain
Google Cloud Platform
Google Docs
Google Drive
Google Genomics
Google Maps
Google Photos
Google Play Apps
Google Science Fair
Google Sheets
Google Translate
Google Trips
Google Voice Search
Google+
Government
grants
Graph
Graph Mining
Hardware
HCI
Health
High Dynamic Range Imaging
ICCV
ICLR
ICML
ICSE
Image Annotation
Image Classification
Image Processing
Inbox
India
Information Retrieval
internationalization
Internet of Things
Interspeech
IPython
Journalism
jsm
jsm2011
K-12
Kaggle
KDD
Keyboard Input
Klingon
Korean
Labs
Linear Optimization
localization
Low-Light Photography
Machine Hearing
Machine Intelligence
Machine Learning
Machine Perception
Machine Translation
Magenta
MapReduce
market algorithms
Market Research
materials science
Mixed Reality
ML
ML Fairness
MOOC
Moore's Law
Multimodal Learning
NAACL
Natural Language Processing
Natural Language Understanding
Network Management
Networks
Neural Networks
NeurIPS
Nexus
Ngram
NIPS
NLP
On-device Learning
open source
operating systems
Optical Character Recognition
optimization
osdi
osdi10
patents
Peer Review
ph.d. fellowship
PhD Fellowship
PhotoScan
Physics
PiLab
Pixel
Policy
Professional Development
Proposals
Public Data Explorer
publication
Publications
Quantum AI
Quantum Computing
Recommender Systems
Reinforcement Learning
renewable energy
Research
Research Awards
resource optimization
Responsible AI
Robotics
schema.org
Search
search ads
Security and Privacy
Self-Supervised Learning
Semantic Models
Semi-supervised Learning
SIGCOMM
SIGMOD
Site Reliability Engineering
Social Networks
Software
Sound Search
Speech
Speech Recognition
statistics
Structured Data
Style Transfer
Supervised Learning
Systems
TensorBoard
TensorFlow
TPU
Translate
trends
TTS
TV
UI
University Relations
UNIX
Unsupervised Learning
User Experience
video
Video Analysis
Virtual Reality
Vision Research
Visiting Faculty
Visualization
VLDB
Voice Search
Wiki
wikipedia
WWW
Year in Review
YouTube
Archive
2022
Jun
May
Apr
Mar
Feb
Jan
2021
Dec
Nov
Oct
Sep
Aug
Jul
Jun
May
Apr
Mar
Feb
Jan
2020
Dec
Nov
Oct
Sep
Aug
Jul
Jun
May
Apr
Mar
Feb
Jan
2019
Dec
Nov
Oct
Sep
Aug
Jul
Jun
May
Apr
Mar
Feb
Jan
2018
Dec
Nov
Oct
Sep
Aug
Jul
Jun
May
Apr
Mar
Feb
Jan
2017
Dec
Nov
Oct
Sep
Aug
Jul
Jun
May
Apr
Mar
Feb
Jan
2016
Dec
Nov
Oct
Sep
Aug
Jul
Jun
May
Apr
Mar
Feb
Jan
2015
Dec
Nov
Oct
Sep
Aug
Jul
Jun
May
Apr
Mar
Feb
Jan
2014
Dec
Nov
Oct
Sep
Aug
Jul
Jun
May
Apr
Mar
Feb
Jan
2013
Dec
Nov
Oct
Sep
Aug
Jul
Jun
May
Apr
Mar
Feb
Jan
2012
Dec
Oct
Sep
Aug
Jul
Jun
May
Apr
Mar
Feb
Jan
2011
Dec
Nov
Sep
Aug
Jul
Jun
May
Apr
Mar
Feb
Jan
2010
Dec
Nov
Oct
Sep
Aug
Jul
Jun
May
Apr
Mar
Feb
Jan
2009
Dec
Nov
Aug
Jul
Jun
May
Apr
Mar
Feb
Jan
2008
Dec
Nov
Oct
Sep
Jul
May
Apr
Mar
Feb
2007
Oct
Sep
Aug
Jul
Jun
Feb
2006
Dec
Nov
Sep
Aug
Jul
Jun
Apr
Mar
Feb
Feed
Follow @googleai
Give us feedback in our
Product Forums
.