2019.04.14 Vision papers

 

04-10-2019

Depth from Videos in the Wild: Unsupervised Monocular Depth Learning from Unknown Cameras
by Ariel Gordon et al

04-10-2019

Pixel-Adaptive Convolutional Neural Networks
by Hang Su et al

04-11-2019

Direct Fitting of Gaussian Mixture Models
by Leonid Keselman et al

04-11-2019

Reasoning Visual Dialogs with Structural and Partial Observations
by Zilong Zheng et al

04-11-2019

Unified Visual-Semantic Embeddings: Bridging Vision and Language with Structured Meaning Representations
by Hao Wu et al

04-11-2019

Factor Graph Attention
by Idan Schwartz et al

04-11-2019

3D Dense Face Alignment via Graph Convolution Networks
by Huawei Wei et al

04-11-2019

FrameRank: A Text Processing Approach to Video Summarization
by Zhuo Lei et al

04-10-2019

Predicting Future Pedestrian Motion in Video Sequences using Crowd Simulation
by Cliceres dal Bianco et al

04-10-2019

Mitigating Information Leakage in Image Representations: A Maximum Entropy Approach
by Proteek Chandan Roy et al

04-11-2019

FTGAN: A Fully-trained Generative Adversarial Networks for Text to Face Generation
by Xiang Chen et al

04-11-2019

A Simple Baseline for Audio-Visual Scene-Aware Dialog
by Idan Schwartz Alexander Schwing et al

04-11-2019

Generating Multiple Hypotheses for 3D Human Pose Estimation with Mixture Density Network
by Chen Li et al

04-11-2019

YUVMultiNet: Real-time YUV multi-task CNN for autonomous driving
by Thomas Boulay et al

04-11-2019

Two Body Problem: Collaborative Visual Task Completion
by Unnat Jain et al

04-10-2019

Efficient and Robust Registration on the 3D Special Euclidean Group
by Uttaran Bhattacharya et al

04-11-2019

KeyIn: Discovering Subgoal Structure with Keyframe-based Video Prediction
by Karl Pertsch et al

04-10-2019

Analyzing Dynamical Brain Functional Connectivity As Trajectories on Space of Covariance Matrices
by Mengyu Dai et al

04-10-2019

Spherical Regression: Learning Viewpoints, Surface Normals and 3D Rotations on n-Spheres
by Shuai Liao et al

04-11-2019

Variational Information Distillation for Knowledge Transfer
by Sungsoo Ahn et al

04-11-2019

Probabilistic Permutation Synchronization using the Riemannian Structure of the Birkhoff Polytope
by Tolga Birdal et al

04-10-2019

BAOD: Budget-Aware Object Detection
by Alejandro Pardo et al

04-11-2019

C-MIL: Continuation Multiple Instance Learning for Weakly Supervised Object Detection
by Fang Wan et al

04-09-2019

SWNet: Small-World Neural Networks and Rapid Convergence
by Mojan Javaheripi et al

04-11-2019

An Empirical Study of Spatial Attention Mechanisms in Deep Networks
by Xizhou Zhu et al

04-10-2019

Attentive Action and Context Factorization
by Yang Wang et al

04-10-2019

Generalizing Monocular 3D Human Pose Estimation in the Wild
by Luyang Wang et al

04-10-2019

Weakly-Supervised White and Grey Matter Segmentation in 3D Brain Ultrasound
by Beatrice Demiray et al

04-10-2019

Instance Segmentation based Semantic Matting for Compositing Applications
by Guanqing Hu et al

04-11-2019

A Relation-Augmented Fully Convolutional Network for Semantic Segmentation in Aerial Scenes
by Lichao Mou et al

04-10-2019

Learning to Generate Synthetic Data via Compositing
by Shashank Tripathi et al

04-11-2019

An Analysis of Pre-Training on Object Detection
by Hengduo Li et al

04-09-2019

User-Controllable Multi-Texture Synthesis with Generative Adversarial Networks
by Aibek Alanov et al

04-09-2019

Foreground-aware Pyramid Reconstruction for Alignment-free Occluded Person Re-identification
by Lingxiao He et al

04-11-2019

FRNET: Flattened Residual Network for Infant MRI Skull Stripping
by Qian Zhang et al

04-11-2019

Learning Single Camera Depth Estimation using Dual-Pixels
by Rahul Garg et al

04-11-2019

Max-Sliced Wasserstein Distance and its use for GANs
by Ishan Deshpande et al

04-11-2019

Expressive Body Capture: 3D Hands, Face, and Body from a Single Image
by Georgios Pavlakos et al

04-10-2019

Sliced Wasserstein Generative Models
by Jiqing Wu et al

04-09-2019

Fast Accurate CT Metal Artifact Reduction using Data Domain Deep Learning
by Muhammad Usman Ghani et al

04-09-2019

Gaussian YOLOv3: An Accurate and Fast Object Detector Using Localization Uncertainty for Autonomous Driving
by Jiwoong Choi et al

04-10-2019

Black-box Adversarial Attacks on Video Recognition Models
by Linxi Jiang et al

04-11-2019

Black-Box Decision based Adversarial Attack with Symmetric αα-stable Distribution
by Vignesh Srinivasan et al

04-11-2019

MAIN: Multi-Attention Instance Network for Video Segmentation
by Juan Leon Alcazar et al

04-10-2019

C3AE: Exploring the Limits of Compact Model for Age Estimation
by Chao Zhang et al

04-11-2019

Reducing Lateral Visual Biases in Displays
by Inbar Huberman et al

04-09-2019

3D Object Instance Recognition and Pose Estimation Using Triplet Loss with Dynamic Margin
by Sergey Zakharov et al

04-10-2019

CNN-Based Deep Architecture for Reinforced Concrete Delamination Segmentation Through Thermography
by Chongsheng Cheng et al

04-11-2019

Elucidating image-to-set prediction: An analysis of models, losses and datasets
by Luis Pineda et al

04-09-2019

A Non-linear Differential CNN-Rendering Module for 3D Data Enhancement
by Yonatan Svirsky et al

04-11-2019

Recurrent Space-time Graphs for Video Understanding
by Andrei Nicolicioiu et al

04-10-2019

Predicting Progression of Age-related Macular Degeneration from Fundus Images using Deep Learning
by Boris Babenko et al

04-11-2019

Improved training of binary networks for human pose estimation and image recognition
by Adrian Bulat et al

04-09-2019

Action Recognition from Single Timestamp Supervision in Untrimmed Videos
by Davide Moltisanti et al

04-11-2019

Detecting Repeating Objects using Patch Correlation Analysis
by Inbar Huberman et al

04-09-2019

Soft Conditional Computation
by Brandon Yang et al

04-09-2019

Towards Analyzing Semantic Robustness of Deep Neural Networks
by Abdullah Hamdi et al

04-11-2019

Software Based Higher Order Structural Foot Abnormality Detection Using Image Processing
by Arnesh Sen et al

04-11-2019

Learning joint reconstruction of hands and manipulated objects
by Yana Hasson et al

04-10-2019

Predicting Novel Views Using Generative Adversarial Query Network
by Phong Nguyen-Ha et al

04-10-2019

ThumbNet: One Thumbnail Image Contains All You Need for Recognition
by Chen Zhao et al

04-11-2019

Retinal Vessels Segmentation Based on Dilated Multi-Scale Convolutional Neural Network
by Yun Jiang et al

04-09-2019

Learning from Videos with Deep Convolutional LSTM Networks
by Logan Courtney et al

04-11-2019

Difficulty-aware Image Super Resolution via Deep Adaptive Dual-Network
by Jinghui Qin et al

04-10-2019

DSNet: An Efficient CNN for Road Scene Segmentation
by Ping-Rong Chen et al

04-09-2019

Intra-Ensemble in Neural Networks
by Yuan Gao et al

04-10-2019

Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations
by Jiwoon Ahn et al

04-10-2019

Cross-lingual Visual Verb Sense Disambiguation
by Spandana Gella et al

04-10-2019

Actor-Critic Instance Segmentation
by Nikita Araslanov et al

04-09-2019

Automated Search for Configurations of Deep Neural Network Architectures
by Salah Ghamizi et al

04-11-2019

Reconstructing Network Inputs with Additive Perturbation Signatures
by Nick Moran et al

04-09-2019

Contextual Attention for Hand Detection in the Wild
by Supreeth Narasimhaswamy et al

04-12-2019

Incremental multi-domain learning with network latent tensor factorization
by Adrian Bulat et al

04-09-2019

Cross-Modal Self-Attention Network for Referring Image Segmentation
by Linwei Ye et al

04-09-2019

Multi-Agent Tensor Fusion for Contextual Trajectory Prediction
by Tianyang Zhao et al

04-09-2019

Label Propagation for Deep Semi-supervised Learning
by Ahmet Iscen et al

04-09-2019

Convolutional Temporal Attention Model for Video-based Person Re-identification
by Tanzila Rahman et al

04-09-2019

On zero-shot recognition of generic objects
by Tristan Hascoet et al

04-09-2019

POSEAMM: A Unified Framework for Solving Pose Problems using an Alternating Minimization Method
by Joao Campos et al

04-09-2019

Regression Concept Vectors for Bidirectional Explanations in Histopathology
by Mara Graziani et al

04-09-2019

Back to the Future: Knowledge Distillation for Human Action Anticipation
by Vinh Tran et al

04-10-2019

Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution
by Yunpeng Chen et al

04-10-2019

Relational Knowledge Distillation
by Wonpyo Park et al

04-12-2019

Generative Hybrid Representations for Activity Forecasting with No-Regret Learning
by Jiaqi Guan et al

04-10-2019

Next-Active-Object prediction from Egocentric Videos
by Antonino Furnari et al

04-12-2019

Towards Photographic Image Manipulation with Balanced Growing of Generative Autoencoders
by Ari Heljakka et al

04-10-2019

StegaStamp: Invisible Hyperlinks in Physical Photographs
by Matthew Tancik et al

04-09-2019

Data Priming Network for Automatic Check-Out
by Congcong Li et al

04-09-2019

Unsupervised 3D Pose Estimation with Geometric Self-Supervision
by Ching-Hang Chen et al

04-09-2019

Deep Virtual Networks for Memory Efficient Inference of Multiple Tasks
by Eunwoo Kim et al

04-09-2019

Rain Oer Me: Synthesizing real rain to derain with data distillation
by Huangxing Lin et al

04-10-2019

SOSNet: Second Order Similarity Regularization for Local Descriptor Learning
by Yurun Tian et al

04-09-2019

Towards High-fidelity Nonlinear 3D Face Morphable Model
by Luan Tran et al

04-10-2019

Semi-Supervised Graph Classification: A Hierarchical Graph Perspective
by Jia Li et al

04-10-2019

Imitating Targets from all sides: An Unsupervised Transfer Learning method for Person Re-identification
by Jiajie Tian et al

04-10-2019

Deep Learning Inversion of Electrical Resistivity Data
by Bin Liu et al

04-10-2019

H+O: Unified Egocentric Recognition of 3D Hand-Object Poses and Interactions
by Bugra Tekin et al

04-09-2019

Adversarial Learning of Disentangled and Generalizable Representations for Visual Attributes
by James Oldfield et al

04-10-2019

Large-Scale Long-Tailed Recognition in an Open World
by Ziwei Liu et al

04-11-2019

TAFE-Net: Task-Aware Feature Embeddings for Low Shot Learning
by Xin Wang et al

04-12-2019

EvalNorm: Estimating Batch Normalization Statistics for Evaluation
by Saurabh Singh et al

04-09-2019

Context-Aware Embeddings for Automatic Art Analysis
by Noa Garcia et al

04-09-2019

High-Resolution Representations for Labeling Pixels and Regions
by Ke Sun et al

04-09-2019

Non-Lambertian Surface Shape and Reflectance Reconstruction Using Concentric Multi-Spectral Light Field
by Mingyuan Zhou et al

04-11-2019

Synthetic Examples Improve Generalization for Rare Classes
by Sara Beery et al

04-09-2019

CMIR-NET : A Deep Learning Based Model For Cross-Modal Retrieval In Remote Sensing
by Ushasi Chaudhuri et al

04-10-2019

Iterative Residual Refinement for Joint Optical Flow and Occlusion Estimation
by Junhwa Hur et al

04-09-2019

Gait Recognition via Disentangled Representation Learning
by Ziyuan Zhang et al

04-09-2019

Decorrelated Adversarial Learning for Age-Invariant Face Recognition
by Hao Wang et al

04-12-2019

Multimodal Machine Learning-based Knee Osteoarthritis Progression Prediction from Plain Radiographs and Clinical Data
by Aleksei Tiulpin et al

04-09-2019

Image Quality Assessment for Omnidirectional Cross-reference Stitching
by Kaiwen Yu et al

04-10-2019

Person Re-identification with Metric Learning using Privileged Information
by Xun Yang et al

04-11-2019

The Sound of Motions
by Hang Zhao et al

04-09-2019

Prime Sample Attention in Object Detection
by Yuhang Cao et al

04-09-2019

BoLTVOS: Box-Level Tracking for Video Object Segmentation
by Paul Voigtlaender et al

04-09-2019

Domain-Symmetric Networks for Adversarial Domain Adaptation
by Yabin Zhang et al

04-09-2019

Graphonomy: Universal Human Parsing via Graph Transfer Learning
by Ke Gong et al

04-12-2019

Cycle-Consistent Adversarial GAN: the integration of adversarial attack and defense
by Lingyun Jiang et al

04-09-2019

Spatiotemporal Knowledge Distillation for Efficient Estimation of Aerial Video Saliency
by Jia Li et al

04-12-2019

Evaluating the Representational Hub of Language and Vision Models
by Ravi Shekhar et al

04-09-2019

FAMNet: Joint Learning of Feature, Affinity and Multi-dimensional Assignment for Online Multiple Object Tracking
by Peng Chu et al

04-09-2019

Uncertainty Measures and Prediction Quality Rating for the Semantic Segmentation of Nested Multi Resolution Street Scene Images
by Matthias Rottmann et al

04-10-2019

Text Guided Person Image Synthesis
by Xingran Zhou et al

04-12-2019

Big but Imperceptible Adversarial Perturbations via Semantic Manipulation
by Anand Bhattad et al

04-10-2019

Heavy Rain Image Restoration: Integrating Physics Model and Conditional Adversarial Learning
by Ruotent Li et al

04-10-2019

DAVANet: Stereo Deblurring with View Aggregation
by Shangchen Zhou et al

04-09-2019

Multi-Target Embodied Question Answering
by Licheng Yu et al

04-09-2019

End-to-End Learning-Based Ultrasound Reconstruction
by Walter Simson et al

04-09-2019

Learning Across Tasks and Domains
by Pierluigi Zama Ramirez et al

04-09-2019

Segmentation of Skeletal Muscle in Thigh Dixon MRI Based on Texture Analysis
by Rafael Rodrigues et al

04-11-2019

Topological signature for periodic motion recognition
by Javier Lamar-Leon et al

04-11-2019

Cramnet: Layer-wise Deep Neural Network Compression with Knowledge Transfer from a Teacher Network
by Jon Hoffman

04-11-2019

An Introduction to Person Re-identification with Generative Adversarial Networks
by Hamed Alqahtani et al

04-12-2019

Digging Deeper into Egocentric Gaze Prediction
by Hamed R. Tavakoli et al

04-12-2019

Evaluating Robustness of Deep Image Super-Resolution against Adversarial Attacks
by Jun-Ho Choi et al

04-09-2019

Assessing Capsule Networks With Biased Data
by Bruno Ferrarini et al

04-09-2019

3DPeople: Modeling the Geometry of Dressed Humans
by Albert Pumarola et al

04-09-2019

PUNCH: Positive UNlabelled Classification based information retrieval in Hyperspectral images
by Anirban Santara et al

04-12-2019

PWOC-3D: Deep Occlusion-Aware End-to-End Scene Flow Estimation
by Rohan Saxena et al

04-10-2019

Efficient Retrieval of Logos Using Rough Set Reducts
by Ushasi Chaudhuri et al

04-11-2019

Real-Time Dense Stereo Embedded in A UAV for Road Inspection
by Rui Fan et al

04-12-2019

Face De-occlusion using 3D Morphable Model and Generative Adversarial Network
by Xiaowei Yuan et al

04-11-2019

A New Loss Function for CNN Classifier Based on Pre-defined Evenly-Distributed Class Centroids
by Qiuyu Zhu et al

04-10-2019

Active Multi-Kernel Domain Adaptation for Hyperspectral Image Classification
by Cheng Deng et al

04-09-2019

MVF-Net: Multi-View 3D Face Morphable Model Regression
by Fanzi Wu et al

04-12-2019

Unifying Heterogeneous Classifiers with Distillation
by Jayakorn Vongkulbhisal et al

04-10-2019

Egocentric Visitors Localization in Cultural Sites
by Francesco Ragusa et al

04-12-2019

Multi-View Region Adaptive Multi-temporal DMM and RGB Action Recognition
by Mahmoud Al-Faris et al

04-10-2019

Instance Segmentation of Biological Images Using Harmonic Embeddings
by Victor Kulikov et al

04-12-2019

ACE: Adapting to Changing Environments for Semantic Segmentation
by Zuxuan Wu et al

04-10-2019

Localized Trajectories for 2D and 3D Action Recognition
by Konstantinos Papadopoulos et al

04-11-2019

Compressing deep neural networks by matrix product operators
by Ze-Feng Gao et al

04-10-2019

Joint Manifold Diffusion for Combining Predictions on Decoupled Observations
by Kwang In Kim et al

04-10-2019

Diagnosis of Celiac Disease and Environmental Enteropathy on Biopsy Images Using Color Balancing on Convolutional Neural Networks
by Kamran Kowsari et al

04-11-2019

Absolute Human Pose Estimation with Depth Prediction Network
by Márton Véges et al

04-12-2019

An Empirical Evaluation Study on the Training of SDC Features for Dense Pixel Matching
by René Schuster et al

04-10-2019

Curriculum semi-supervised segmentation
by Hoel Kervadec et al

04-09-2019

Vision-model-based Real-time Localization of Unmanned Aerial Vehicle for Autonomous Structure Inspection under GPS-denied Environment
by Zhexiong Shang et al

04-09-2019

A Data Fusion Platform for Supporting Bridge Deck Condition Monitoring by Merging Aerial and Ground Inspection Imagery
by Zhexiong Shang et al

04-11-2019

Learning Digital Camera Pipeline for Extreme Low-Light Imaging
by Syed Waqas Zamir et al

04-12-2019

Prior-aware Neural Network for Partially-Supervised Multi-Organ Segmentation
by Yuyin Zhou et al

04-12-2019

Adaptive Weighting Multi-Field-of-View CNN for Semantic Segmentation in Pathology
by Hiroki Tokunaga et al

04-12-2019

Unsupervised Method to Localize Masses in Mammograms
by Bilal Ahmed Lodhi

04-12-2019

Generalized Presentation Attack Detection: a face anti-spoofing evaluation proposal
by Artur Costa-Pazo et al

04-11-2019

The iWildCam 2018 Challenge Dataset
by Sara Beery et al

04-11-2019

Automatic Pulmonary Nodule Detection in CT Scans Using Convolutional Neural Networks Based on Maximum Intensity Projection
by Sunyi Zheng et al

04-11-2019

A Light Dual-Task Neural Network for Haze Removal
by Yu Zhang et al

04-09-2019

Holistic and Comprehensive Annotation of Clinically Significant Findings on Diverse CT Images: Learning from Radiology Reports and Label Ontology
by Ke Yan et al

04-09-2019

UG2+2+ Track 2: A Collective Benchmark Effort for Evaluating and Advancing Image Understanding in Poor Visibility Environments
by Ye Yuan et al

04-10-2019

Evaluation of a Dual Convolutional Neural Network Architecture for Object-wise Anomaly Detection in Cluttered X-ray Security Imagery
by Yona Falinie A. Gaus et al

04-12-2019

Boundary-Preserved Deep Denoising of the Stochastic Resonance Enhanced Multiphoton Images
by Sheng-Yong Niu et al

04-12-2019

MAANet: Multi-view Aware Attention Networks for Image Super-Resolution
by Jingcai Guo et al

04-09-2019

Generative Models for Novelty Detection: Applications in abnormal event and situational change detection from data series
by Mahdyar Ravanbakhsh

04-12-2019

GeoCapsNet: Aerial to Ground view Image Geo-localization using Capsule Network
by Bin Sun et al

 
Craig Smith