Some interesting Computer Vision papers from ICCV 2017:
The International Conference on Computer Vision (ICCV) is one of the top-tier conferences in computer vision. This year it was held in Venice, Italy. Out of 2143 valid submissions at ICCV, 621 papers were accepted with an acceptance rate of 28.9%. Some interesting highlights include the application of reinforcement learning for object tracking, visual dialog and activity forecasting, along with the improvement in the generation as well as the applications of Generative Adversarial Networks (GANs). The proposal of novel loss functions (focal loss, range loss) to address class imbalance was exciting.
There were a lot of interesting and exciting papers. Here is my list of some of the interesting papers from ICCV 2017 categorized by applications.
Segmentation:
-
Mask R-CNN (Best paper award)
-
Segmentation-Aware Convolutional Networks Using Local Attention Masks
-
SegFlow: Joint Learning for Video Object Segmentation and Optical Flow
-
Universal Adversarial Perturbations Against Semantic Image Segmentation
Tracking:
-
Robust Object Tracking based on Temporal and Spatial Deep Networks
-
Learning Policies for Adaptive Tracking with Deep Feature Cascades
Detection:
-
Focal Loss for Dense Object Detection (Best student paper award)
-
Adaptive Feeding: Achieving Fast and Accurate Detections by Adaptively Combining Object Detectors
-
Temporal Dynamic Graph LSTM for Action-driven Video Object Detection
-
Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection
Optimization:
-
High Order Tensor Formulation for Convolutional Sparse Coding
-
Performance Guaranteed Network Acceleration via High-Order Residual Quantization
-
Robust Kronecker-Decomposable Component Analysis for Low-Rank Modeling
Generative Adversarial Networks:
-
Generative Adversarial Networks Conditioned by Brain Signals
-
StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks
Face Analysis:
-
Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos
-
Realistic Dynamic Facial Textures from a Single Image using GANs
-
Learning Discriminative Aggregation Network for Video-based Face Recognition
-
Range Loss for Deep Face Recognition with Long-Tailed Training Data
-
Temporal Non-Volume Preserving Approach to Facial Age-Progression and Age-Invariant Face Recognition
-
DeepCoder: Semi-parametric Variational Autoencoders for Automatic Facial Action Coding
Action Analysis:
-
Online Real-time Multiple Spatiotemporal Action Localisation and Prediction
-
Learning long-term dependencies for action recognition with a biologically-inspired deep network
-
Spatial-Aware Object Embeddings for Zero-Shot Localization and Classification of Actions
-
TORNADO: A Spatio-Temporal Convolutional Regression Network for Video Action Proposal
General/Other interesting problems:
-
One Network to Solve Them All — Solving Linear Inverse Problems using Deep Projection Models
-
EnhanceNet: Single Image Super-Resolution Through Automated Texture Synthesis
-
Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning
-
Towards Diverse and Natural Image Descriptions via a Conditional GAN