2022.5.23 Vision papers

 

05-17-2022

AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars
by Fangzhou Hong et al

05-18-2022

Masked Autoencoders As Spatiotemporal Learners
by Christoph Feichtenhofer et al

05-19-2022

Towards Unified Keyframe Propagation Models
by Patrick Esser et al

05-17-2022

Disentangling Visual Embeddings for Attributes and Objects
by Nirat Saini et al

05-18-2022

BodyMap: Learning Full-Body Dense Correspondence Map
by Anastasia Ianina et al

05-17-2022

Self-supervised Neural Articulated Shape and Appearance Models
by Fangyin Wei et al

05-19-2022

Robust and Efficient Medical Imaging with Self-Supervision
by Shekoofeh Azizi et al

05-19-2022

Oracle-MNIST: a Realistic Image Dataset for Benchmarking Machine Learning Algorithms
by Mei Wang et al

05-19-2022

TRT-ViT: TensorRT-oriented Vision Transformer
by Xin Xia et al

05-17-2022

Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in Latent Space
by Kuan Fang et al

05-17-2022

MATrIX -- Modality-Aware Transformer for Information eXtraction
by Thomas Delteil et al

05-18-2022

Training Vision-Language Transformers from Captions Alone
by Liangke Gui et al

05-17-2022

A CLIP-Hitchhikers Guide to Long Video Retrieval
by Max Bain et al

05-18-2022

LeRaC: Learning Rate Curriculum
by Florinel-Alin Croitoru et al

05-17-2022

Unraveling Attention via Convex Duality: Analysis and Interpretations of Vision Transformers
by Arda Sahiner et al

05-19-2022

Physically-Based Editing of Indoor Scene Lighting from a Single Image
by Zhengqin Li et al

05-19-2022

Lets Talk! Striking Up Conversations via Conversational Visual Question Generation
by Shih-Han Chan et al

05-18-2022

On the Limits of Evaluating Embodied Agent Model Generalization Using Validation Sets
by Hyounghun Kim et al

05-17-2022

Dark Solitons in Bose-Einstein Condensates: A Dataset for Many-body Physics Research
by Amilson R. Fritsch et al

05-18-2022

MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D Scenes
by Anton Ratnarajah et al

05-19-2022

Voxel-informed Language Grounding
by Rodolfo Corona et al

05-20-2022

Self-Supervised Depth Estimation with Isometric-Self-Sample-Based Learning
by Geonho Cha et al

05-19-2022

Domain Enhanced Arbitrary Image Style Transfer via Contrastive Learning
by Yuxin Zhang et al

05-20-2022

Pre-Train Your Loss: Easy Bayesian Transfer Learning with Informative Priors
by Ravid Shwartz-Ziv et al

05-19-2022

HandoverSim: A Simulation Framework and Benchmark for Human-to-Robot Object Handovers
by Yu-Wei Chao et al

05-17-2022

Gender and Racial Bias in Visual Question Answering Datasets
by Yusuke Hirota et al

05-19-2022

Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation
by Vikram Voleti et al

05-17-2022

Dimensionality Reduced Training by Pruning and Freezing Parts of a Deep Neural Network, a Survey
by Paul Wimmer et al

05-19-2022

Image Augmentation Based Momentum Memory Intrinsic Reward for Sparse Reward Visual Scenes
by Zheng Fang et al

05-17-2022

Do Neural Networks Compress Manifolds Optimally?
by Sourbh Bhadane et al

05-17-2022

CellTypeGraph: A New Geometric Computer Vision Benchmark
by Lorenzo Cerrone et al

05-17-2022

Deep learning on rail profiles matching
by Kunqi Wang

05-17-2022

Hyperparameter Optimization with Neural Network Pruning
by Kangil Lee et al

05-19-2022

Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-based Beam Search
by Xiao Wang et al

05-19-2022

Bi-LSTM Scoring Based Similarity Measurement with Agglomerative Hierarchical Clustering (AHC) for Speaker Diarization
by Siddharth S. Nijhawan et al

05-18-2022

TTAPS: Test-Time Adaption by Aligning Prototypes using Self-Supervision
by Alexander Bartler et al

05-18-2022

Deep-learned orthogonal basis patterns for fast, noise-robust single-pixel imaging
by Ritz Ann Aguilar et al

05-18-2022

COVID-Net UV: An End-to-End Spatio-Temporal Deep Neural Network Architecture for Automated Diagnosis of COVID-19 Infection from Ultrasound Videos
by Hilda Azimi et al

05-18-2022

Computing the ensemble spread from deterministic weather predictions using conditional generative adversarial networks
by Rüdiger Brecht et al

05-18-2022

Deep Features for CBIR with Scarce Data using Hebbian Learning
by Gabriele Lagani et al

05-18-2022

Scalable Multi-view Clustering with Graph Filtering
by Liang Liu et al

05-17-2022

Brachial Plexus Nerve Trunk Segmentation Using Deep Learning: A Comparative Study with Doctors Manual Segmentation
by Yu Wang et al

05-18-2022

Cross-subject Action Unit Detection with Meta Learning and Transformer-based Relation Modeling
by Jiyuan Cao et al

05-18-2022

It Isnt Sh!tposting, Its My CAT Posting
by Parthsarthi Rawat et al

05-18-2022

Transformer based multiple instance learning for weakly supervised histopathology image segmentation
by Ziniu Qian et al

05-19-2022

k-strip: A novel segmentation algorithm in k-space for the application of skull stripping
by Moritz Rempe et al

05-19-2022

Diverse Weight Averaging for Out-of-Distribution Generalization
by Alexandre Rame et al

05-17-2022

Vision Transformer Adapter for Dense Predictions
by Zhe Chen et al

05-19-2022

Discovering Dynamic Functional Brain Networks via Spatial and Channel-wise Attention
by Yiheng Liu et al

05-18-2022

Large Neural Networks Learning from Scratch with Very Few Data and without Regularization
by Christoph Linse et al

05-18-2022

Pluralistic Image Completion with Probabilistic Mixture-of-Experts
by Xiaobo Xia et al

05-18-2022

Visual Attention-based Self-supervised Absolute Depth Estimation using Geometric Priors in Autonomous Driving
by Jie Xiang et al

05-17-2022

blob loss: instance imbalance aware loss functions for semantic segmentation
by Florian Kofler et al

05-19-2022

Semi-Supervised Learning for Image Classification using Compact Networks in the BioMedical Context
by Adrián Inés et al

05-17-2022

SemiCurv: Semi-Supervised Curvilinear Structure Segmentation
by Xun Xu et al

05-18-2022

Remote Sensing Novel View Synthesis with Implicit Multiplane Representations
by Yongchang Wu et al

05-18-2022

Global Contrast Masked Autoencoders Are Powerful Pathological Representation Learners
by Hao Quan et al

05-18-2022

Constraining the Attack Space of Machine Learning Models with Distribution Clamping Preprocessing
by Ryan Feng et al

05-17-2022

K-textures, a self supervised hard clustering deep learning algorithm for satellite images segmentation
by Fabien H. Wagner et al

05-17-2022

Conditional Visual Servoing for Multi-Step Tasks
by Sergio Izquierdo et al

05-20-2022

UViM: A Unified Modeling Approach for Vision with Learned Guiding Codes
by Alexander Kolesnikov et al

05-19-2022

Focused Adversarial Attacks
by Thomas Cilloni et al

05-18-2022

Sparse MDOD: Training End-to-End Multi-Object Detector without Bipartite Matching
by Jaeyoung Yoo et al

05-19-2022

Estimating the ultrasound attenuation coefficient using convolutional neural networks -- a feasibility study
by Piotr Jarosik et al

05-18-2022

Validation of a photogrammetric approach for the study of ancient bowed instruments
by Philémon Beghin et al

05-17-2022

Uncertainty-based Network for Few-shot Image Classification
by Minglei Yuan et al

05-17-2022

UnPWC-SVDLO: Multi-SVD on PointPWC for Unsupervised Lidar Odometry
by Yiming Tu

05-18-2022

Passive Defense Against 3D Adversarial Point Clouds Through the Lens of 3D Steganalysis
by Jiahao Zhu

05-17-2022

Computerized Tomography Pulmonary Angiography Image Simulation using Cycle Generative Adversarial Network from Chest CT imaging in Pulmonary Embolism Patients
by Chia-Hung Yang et al

05-19-2022

BabyNet: Residual Transformer Module for Birth Weight Prediction on Fetal Ultrasound Video
by Szymon Płotka et al

05-19-2022

A Topological Approach for Semi-Supervised Learning
by Adrián Inés et al

05-19-2022

CLCNet: Rethinking of Ensemble Modeling with Classification Confidence Network
by Yao-Ching Yu et al

05-18-2022

RandomMix: A mixed sample data augmentation method with multiple mixed modes
by Xiaoliang Liu et al

05-17-2022

Learnable Optimal Sequential Grouping for Video Scene Detection
by Daniel Rotman et al

05-17-2022

Pairwise Comparison Network for Remote Sensing Scene Classification
by Zhang Yue et al

05-18-2022

Speckle Image Restoration without Clean Data
by Tsung-Ming Tai et al

05-18-2022

Bayesian Convolutional Neural Networks for Limited Data Hyperspectral Remote Sensing Image Classification
by Mohammad Joshaghani et al

05-19-2022

BEVerse: Unified Perception and Prediction in Birds-Eye-View for Vision-Centric Autonomous Driving
by Yunpeng Zhang et al

05-20-2022

Learning Task-relevant Representations for Generalization via Characteristic Functions of Reward Sequence Distributions
by Rui Yang et al

05-17-2022

DynPL-SVO: A New Method Using Point and Line Features for Stereo Visual Odometry in Dynamic Scenes
by Xiaoguang Ma et al

05-17-2022

Self-Supervised Learning of Multi-Object Keypoints for Robotic Manipulation
by Jan Ole von Hartz et al

05-18-2022

3D Segmentation Guided Style-based Generative Adversarial Networks for PET Synthesis
by Yang Zhou et al

05-17-2022

Region-Aware Metric Learning for Open World Semantic Segmentation via Meta-Channel Aggregation
by Hexin Dong et al

05-18-2022

VRAG: Region Attention Graphs for Content-Based Video Retrieval
by Kennard Ng et al

05-17-2022

Learning Monocular Depth Estimation via Selective Distillation of Stereo Knowledge
by Kyeongseob Song et al

05-17-2022

Using artificial intelligence to detect chest X-rays with no significant findings in a primary health care setting in Oulu, Finland
by Tommi Keski-Filppula et al

05-19-2022

Unconventional Visual Sensors for Autonomous Vehicles
by You Li et al

05-17-2022

Application of Graph Based Features in Computer Aided Diagnosis for Histopathological Image Classification of Gastric Cancer
by Haiqing Zhang et al

05-19-2022

EXACT: How to Train Your Accuracy
by Ivan Karpukhin et al

05-18-2022

Anomaly detection using prediction error with Spatio-Temporal Convolutional LSTM
by Hanh Thi Minh Tran et al

05-18-2022

PhoCaL: A Multi-Modal Dataset for Category-Level Object Pose Estimation with Photometrically Challenging Objects
by Pengyuan Wang et al

05-18-2022

Financial Time Series Data Augmentation with Generative Adversarial Networks and Extended Intertemporal Return Plots
by Justin Hellermann et al

05-18-2022

Empirical Advocacy of Bio-inspired Models for Robust Image Recognition
by Harshitha Machiraju et al

05-18-2022

Positional Information is All You Need: A Novel Pipeline for Self-Supervised SVDE from Videos
by Juan Luis Gonzalez Bello et al

05-18-2022

Free Lunch for Surgical Video Understanding by Distilling Self-Supervisions
by Xinpeng Ding et al

05-19-2022

Diversity Matters: Fully Exploiting Depth Clues for Reliable Monocular 3D Object Detection
by Zhuoling Li et al

05-17-2022

CAS-Net: Conditional Atlas Generation and Brain Segmentation for Fetal MRI
by Liu Li et al

05-19-2022

On Demographic Bias in Fingerprint Recognition
by Akash Godbole et al

05-20-2022

Kernel Normalized Convolutional Networks
by Reza Nasirigerdeh et al

05-20-2022

Swapping Semantic Contents for Mixing Images
by Rémy Sun et al

05-19-2022

CORPS: Cost-free Rigorous Pseudo-labeling based on Similarity-ranking for Brain MRI Segmentation
by Can Taylan Sari et al

05-19-2022

A Sub-pixel Accurate Quantification of Joint Space Narrowing Progression in Rheumatoid Arthritis
by Yafei Ou et al

05-17-2022

Semi-Supervised Building Footprint Generation with Feature and Output Consistency Training
by Qingyu Li et al

05-20-2022

Towards the Generation of Synthetic Images of Palm Vein Patterns: A Review
by Edwin H. Salazar-Jurado et al

05-17-2022

A Linear Comb Filter for Event Flicker Removal
by Ziwei Wang et al

05-17-2022

Efficient Stereo Depth Estimation for Pseudo LiDAR: A Self-Supervised Approach Based on Multi-Input ResNet Encoder
by Sabir Hossain et al

05-19-2022

A Comparative Study of Feature Expansion Unit for 3D Point Cloud Upsampling
by Qiang Li et al

05-18-2022

Support-set based Multi-modal Representation Enhancement for Video Captioning
by Xiaoya Chen et al

05-17-2022

Unified Interactive Image Matting
by Stephen. D. H Yang et al

05-17-2022

GraphMapper: Efficient Visual Navigation by Scene Graph Generation
by Zachary Seymour et al

05-19-2022

On Trace of PGD-Like Adversarial Attacks
by Mo Zhou et al

05-19-2022

A graph-transformer for whole slide image classification
by Yi Zheng et al

05-18-2022

Trading Positional Complexity vs. Deepness in Coordinate Networks
by Jianqiao Zheng et al

05-20-2022

The developmental trajectory of object recognition robustness: children are like small adults but unlike big deep neural networks
by Lukas S. Huber et al

05-17-2022

Exploring the Adjugate Matrix Approach to Quaternion Pose Extraction
by Andrew J. Hanson et al

05-18-2022

A lightweight multi-scale context network for salient object detection in optical remote sensing images
by Yuhan Lin et al

05-19-2022

Plane Geometry Diagram Parsing
by Ming-Liang Zhang et al

05-17-2022

ColonFormer: An Efficient Transformer based Method for Colon Polyp Segmentation
by Nguyen Thanh Duc et al

05-17-2022

Semantically Accurate Super-Resolution Generative Adversarial Networks
by Tristan Frizza et al

05-20-2022

A Demographic Attribute Guided Approach to Age Estimation
by Zhicheng Cao et al

05-19-2022

Transferable Physical Attack against Object Detection with Separable Attention
by Yu Zhang et al

05-20-2022

Diverse super-resolution with pretrained deep hiererarchical VAEs
by Jean Prost et al

05-17-2022

HoVer-Trans: Anatomy-aware HoVer-Transformer for ROI-free Breast Cancer Diagnosis in Ultrasound Images
by Yuhao Mo et al

05-20-2022

B-cos Networks: Alignment is All We Need for Interpretability
by Moritz Böhle et al

05-19-2022

Mip-NeRF RGB-D: Depth Assisted Fast Neural Radiance Fields
by Arnab Dey et al

05-19-2022

Enhancing the Transferability of Adversarial Examples via a Few Queries
by Xiangyuan Yang et al

05-19-2022

Light In The Black: An Evaluation of Data Augmentation Techniques for COVID-19 CTs Semantic Segmentation
by Bruno A. Krinski et al

05-18-2022

3DConvCaps: 3DUnet with Convolutional Capsule Encoder for Medical Image Segmentation
by Minh Tran et al

05-20-2022

How to Guide Adaptive Depth Sampling?
by Ilya Tcenov et al

05-20-2022

Few-Shot Font Generation by Learning Fine-Grained Local Styles
by Licheng Tang et al

05-20-2022

E-Scooter Rider Detection and Classification in Dense Urban Environments
by Shane Gilroy et al

05-20-2022

Test-time Batch Normalization
by Tao Yang et al

05-19-2022

Cross-Enhancement Transformer for Action Segmentation
by Jiahui Wang et al

05-17-2022

RARITYNet: Rarity Guided Affective Emotion Learning Framework
by Monu Verma et al

05-17-2022

Label-Efficient Self-Supervised Federated Learning for Tackling Data Heterogeneity in Medical Imaging
by Rui Yan et al

05-20-2022

Visual Concepts Tokenization
by Tao Yang et al

05-19-2022

PYSKL: Towards Good Practices for Skeleton Action Recognition
by Haodong Duan et al

05-17-2022

Text Detection & Recognition in the Wild for Robot Localization
by Zobeir Raisi et al

05-19-2022

VNT-Net: Rotational Invariant Vector Neuron Transformers
by Hedi Zisling et al

05-17-2022

Towards Robust Low Light Image Enhancement
by Sara Aghajanzadeh et al

05-20-2022

Degradation-Aware Unfolding Half-Shuffle Transformer for Spectral Compressive Imaging
by Yuanhao Cai et al

05-17-2022

Unsupervised Segmentation in Real-World Images via Spelke Object Inference
by Honglin Chen et al

05-20-2022

Self-supervised 3D anatomy segmentation using self-distilled masked image transformer (SMIT)
by Jue Jiang et al

05-19-2022

Learning Feature Fusion for Unsupervised Domain Adaptive Person Re-identification
by Jin Ding et al

05-20-2022

Constructive Interpretability with CoLabel: Corroborative Integration, Complementary Features, and Collaborative Learning
by Abhijit Suprem et al

05-20-2022

Efficient visual object representation using a biologically plausible spike-latency code and winner-take-all inhibition
by Melani Sanchez-Garcia et al

05-20-2022

UCC: Uncertainty guided Cross-head Co-training for Semi-Supervised Semantic Segmentation
by Jiashuo Fan et al

05-19-2022

UIF: An Objective Quality Assessment for Underwater Image Enhancement
by Yannan Zheng et al

05-20-2022

Unintended memorisation of unique features in neural networks
by John Hartley et al

05-20-2022

Analysis of Co-Laughter Gesture Relationship on RGB videos in Dyadic Conversation Contex
by Hugo Bohy et al

05-17-2022

Detection Masking for Improved OCR on Noisy Documents
by Daniel Rotman et al

05-19-2022

Masked Image Modeling with Denoising Contrast
by Kun Yi et al

05-19-2022

A Peek at Peak Emotion Recognition
by Tzvi Michelson et al

05-20-2022

Learning to Count Anything: Reference-less Class-agnostic Counting with Weak Supervision
by Michael Hobley et al

05-20-2022

Structured Attention Composition for Temporal Action Localization
by Le Yang et al

05-20-2022

Advanced Feature Learning on Point Clouds using Multi-resolution Features and Learnable Pooling
by Kevin Tirta Wijaya et al

05-20-2022

Mask-guided Vision Transformer (MG-ViT) for Few-Shot Learning
by Yuzhong Chen et al

05-20-2022

People Tracking and Re-Identifying in Distributed Contexts: Extension of PoseTReID
by Ratha Siv et al

05-20-2022

Unsupervised Flow-Aligned Sequence-to-Sequence Learning for Video Restoration
by Jing Lin et al

05-17-2022

Privacy Preserving Image Registration
by Riccardo Taiello et al

05-19-2022

Hyperspectral Unmixing Based on Nonnegative Matrix Factorization: A Comprehensive Review
by Xin-Ru Feng et al

05-19-2022

Clustering as Attention: Unified Image Segmentation with Hierarchical Clustering
by Teppei Suzuki

05-17-2022

MulT: An End-to-End Multitask Learning Transformer
by Deblina Bhattacharjee et al

05-19-2022

Identifying outliers in astronomical images with unsupervised machine learning
by Yang Han et al

05-19-2022

Label-invariant Augmentation for Semi-Supervised Graph Classification
by Han Yue et al

05-20-2022

Emergence of Double-slit Interference by Representing Visual Space in Artificial Neural Networks
by Xiuxiu Bai et al

05-19-2022

PGDP5K: A Diagram Parsing Dataset for Plane Geometry Problems
by Yihan Hao et al

05-20-2022

MSTRIQ: No Reference Image Quality Assessment Based on Swin Transformer with Multi-Stage Fusion
by Jing Wang et al

05-20-2022

Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality
by Xiang Li et al

05-20-2022

Enriching StyleGAN with Illumination Physics
by Anand Bhattad et al

05-20-2022

InDistill: Transferring Knowledge From Pruned Intermediate Layers
by Ioannis Sarridis et al

05-20-2022

Reliability-based Mesh-to-Grid Image Reconstruction
by Ján Koloda et al

05-20-2022

Action parsing using context features
by Nagita Mehrseresht

05-20-2022

A Novel Underwater Image Enhancement and Improved Underwater Biological Detection Pipeline
by Zheng Liu et al

05-20-2022

Contrastive Learning with Cross-Modal Knowledge Mining for Multimodal Human Activity Recognition
by Razvan Brinzea et al

05-20-2022

Assessing Demographic Bias Transfer from Dataset to Model: A Case Study in Facial Expression Recognition
by Iris Dominguez-Catena et al

05-19-2022

Deep transfer learning for image classification: a survey
by Jo Plested et al

05-20-2022

Salient Skin Lesion Segmentation via Dilated Scale-Wise Feature Fusion Network
by Pourya Shamsolmoali et al

05-20-2022

Compression ensembles quantify aesthetic complexity and the evolution of visual art
by Andres Karjus et al

05-19-2022

Integral Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection
by Xiaosong Zhang et al

05-19-2022

Beyond Labels: Visual Representations for Bone Marrow Cell Morphology Recognition
by Shayan Fazeli et al

05-19-2022

Subcellular Protein Localisation in the Human Protein Atlas using Ensembles of Diverse Deep Architectures
by Syed Sameed Husain et al

05-19-2022

Real Time Multi-Object Detection for Helmet Safety
by Mrinal Mathur et al

05-19-2022

Human Gender Prediction Based on Deep Transfer Learning from Panoramic Radiograph Images
by I. Atas

05-19-2022

Unsupervised Learning of Depth, Camera Pose and Optical Flow from Monocular Video
by Dipan Mandal et al

05-17-2022

User Localization using RF Sensing: A Performance comparison between LIS and mmWave Radars
by Cristian J. Vaca-Rubio et al

05-19-2022

Generation of Artificial CT Images using Patch-based Conditional Generative Adversarial Networks
by Marija Habijan et al

 
Craig Smith