2022.4.4 Vision papers

 

03-30-2022

Exploring Plain Vision Transformer Backbones for Object Detection
by Yanghao Li et al

03-29-2022

Contrasting the landscape of contrastive and non-contrastive learning
by Ashwini Pokle et al

03-31-2022

MyStyle: A Personalized Generative Prior
by Yotam Nitzan et al

03-31-2022

Visual Prompting: Modifying Pixel Space to Adapt Pre-trained Models
by Hyojin Bahng et al

03-29-2022

Dressing in the Wild by Watching Dance Videos
by Xin Dong et al

03-30-2022

VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers
by Estelle Aflalo et al

03-31-2022

Bringing Old Films Back to Life
by Ziyu Wan et al

03-29-2022

EnvEdit: Environment Editing for Vision-and-Language Navigation
by Jialu Li et al

03-31-2022

R2L: Distilling Neural Radiance Field to Neural Light Field for Efficient Novel View Synthesis
by Huan Wang et al

03-30-2022

FALCON: Fast Visual Concept Learning by Integrating Images, Linguistic descriptions, and Conceptual Relations
by Lingjie Mei et al

03-30-2022

CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs
by Jiteng Mu et al

03-30-2022

MeMOT: Multi-Object Tracking with Memory
by Jiarui Cai et al

04-01-2022

Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
by Andy Zeng et al

03-29-2022

Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation
by Xiao Fu et al

03-29-2022

ITTR: Unpaired Image-to-Image Translation with Transformers
by Wanfeng Zheng et al

03-31-2022

DiffSkill: Skill Abstraction from Differentiable Physics for Deformable Object Manipulations with Tools
by Xingyu Lin et al

03-30-2022

TubeDETR: Spatio-Temporal Video Grounding with Transformers
by Antoine Yang et al

03-31-2022

Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis
by Karren Yang et al

03-31-2022

Continuous Scene Representations for Embodied AI
by Samir Yitzhak Gadre et al

03-30-2022

CaDeX: Learning Canonical Deformation Coordinate Space for Dynamic Surface Representation via Neural Homeomorphism
by Jiahui Lei et al

03-29-2022

Diffusion Models for Counterfactual Explanations
by Guillaume Jeanneret et al

03-29-2022

Fine-tuning Image Transformers using Learnable Memory
by Mark Sandler et al

03-30-2022

To Find Waldo You Need Contextual Cues: Debiasing Whos Waldo
by Yiran Luo et al

04-01-2022

Simplicial Embeddings in Self-Supervised Learning and Downstream Classification
by Samuel Lavoie et al

03-29-2022

Online Continual Learning on a Contaminated Data Stream with Blurry Task Boundaries
by Jihwan Bang et al

03-30-2022

Balanced MSE for Imbalanced Visual Regression
by Jiawei Ren et al

03-30-2022

AmsterTime: A Visual Place Recognition Benchmark Dataset for Severe Domain Shift
by Burak Yildiz et al

03-30-2022

DDNeRF: Depth Distribution Neural Radiance Fields
by David Dadon et al

03-31-2022

TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing
by Yanbo Xu et al

03-31-2022

Generating High Fidelity Data from Low-density Regions using Diffusion Models
by Vikash Sehwag et al

03-29-2022

Disentangled3D: Learning a 3D Generative Model with Disentangled Geometry and Appearance from Monocular Images
by Ayush Tewari et al

04-01-2022

Perception Prioritized Training of Diffusion Models
by Jooyoung Choi et al

03-29-2022

Iterative Deep Homography Estimation
by Si-Yuan Cao et al

03-29-2022

Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian
by Jihyun Lee et al

03-30-2022

Enhancing Cancer Prediction in Challenging Screen-Detected Incident Lung Nodules Using Time-Series Deep Learning
by Shahab Aslani et al

03-31-2022

Towards Driving-Oriented Metric for Lane Detection Models
by Takami Sato et al

03-29-2022

Parameter-efficient Fine-tuning for Vision Transformers
by Xuehai He et al

03-29-2022

DRaCoN -- Differentiable Rasterization Conditioned Neural Radiance Fields for Articulated Avatars
by Amit Raj et al

03-29-2022

MatteFormer: Transformer-Based Image Matting via Prior-Tokens
by GyuTae Park et al

03-31-2022

BEVFormer: Learning Birds-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
by Zhiqi Li et al

03-31-2022

Its All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher
by Kanghyun Choi et al

03-29-2022

Image Retrieval from Contextual Descriptions
by Benno Krojer et al

03-30-2022

ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval
by Mengjun Cheng et al

03-30-2022

Mask Atari for Deep Reinforcement Learning as POMDP Benchmarks
by Yang Shao et al

03-31-2022

A Closer Look at Rehearsal-Free Continual Learning
by James Seale Smith et al

03-29-2022

Integrative Few-Shot Learning for Classification and Segmentation
by Dahyun Kang et al

03-30-2022

Exploiting Explainable Metrics for Augmented SGD
by Mahdi S. Hosseini et al

03-29-2022

SepViT: Separable Vision Transformer
by Wei Li et al

03-30-2022

Online Motion Style Transfer for Interactive Character Control
by Yingtian Tang et al

03-29-2022

A Style-aware Discriminator for Controllable Image Translation
by Kunhee Kim et al

03-31-2022

SimVQA: Exploring Simulated Environments for Visual Question Answering
by Paola Cascante-Bonilla et al

03-29-2022

ME-CapsNet: A Multi-Enhanced Capsule Networks with Routing Mechanism
by Jerrin Bright et al

03-31-2022

Cross-modal Learning of Graph Representations using Radar Point Cloud for Long-Range Gesture Recognition
by Souvik Hazra et al

03-29-2022

BARC: Learning to Regress 3D Dog Shape from Images by Exploiting Breed Information
by Nadine Rueegg et al

03-30-2022

Fast, Accurate and Memory-Efficient Partial Permutation Synchronization
by Shaohan Li et al

03-29-2022

Classification of Hyperspectral Images Using SVM with Shape-adaptive Reconstruction and Smoothed Total Variation
by Ruoning Li et al

03-31-2022

Mutual Scene Synthesis for Mixed Reality Telepresence
by Mohammad Keshavarzi et al

03-30-2022

ReSTR: Convolution-free Referring Image Segmentation Using Transformers
by Namyup Kim et al

03-29-2022

Deep Equilibrium Assisted Block Sparse Coding of Inter-dependent Signals: Application to Hyperspectral Imaging
by Alexandros Gkillas et al

03-30-2022

HDSDF: Hybrid Directional and Signed Distance Functions for Fast Inverse Rendering
by Tarun Yenamandra et al

03-31-2022

Human Instance Segmentation and Tracking via Data Association and Single-stage Detector
by Lu Cheng et al

04-01-2022

Autoencoder Attractors for Uncertainty Estimation
by Steve Dias Da Cruz et al

03-29-2022

Improved Counting and Localization from Density Maps for Object Detection in 2D and 3D Microscopy Imaging
by Shijie Li et al

03-29-2022

Treatment Learning Transformer for Noisy Image Classification
by Chao-Han Huck Yang et al

03-31-2022

Towards Robust Rain Removal Against Adversarial Attacks: A Comprehensive Benchmark Analysis and Beyond
by Yi Yu et al

03-29-2022

SHOP: A Deep Learning Based Pipeline for near Real-Time Detection of Small Handheld Objects Present in Blurry Video
by Abhinav Ganguly et al

03-29-2022

How Deep is Your Art: An Experimental Study on the Limits of Artistic Understanding in a Single-Task, Single-Modality Neural Network
by Mahan Agha Zahedi et al

03-29-2022

Deep Reinforcement Learning for Data-Driven Adaptive Scanning in Ptychography
by Marcel Schloz et al

03-31-2022

Multimodal Fusion Transformer for Remote Sensing Image Classification
by Swalpa Kumar Roy et al

03-30-2022

Federated Learning for the Classification of Tumor Infiltrating Lymphocytes
by Ujjwal Baid et al

03-31-2022

Model Predictive Control for Fluid Human-to-Robot Handovers
by Wei Yang et al

03-29-2022

AutoCoMet: Smart Neural Architecture Search via Co-Regulated Shaping Reinforcement
by Mayukh Das et al

03-31-2022

Ternary and Binary Quantization for Improved Classification
by Weizhi Lu et al

03-29-2022

CNN Filter DB: An Empirical Investigation of Trained Convolutional Filters
by Paul Gavrikov et al

03-31-2022

Few-Shot Class-Incremental Learning by Sampling Multi-Phase Tasks
by Da-Wei Zhou et al

03-30-2022

Unseen Classes at a Later Time? No Problem
by Hari Chandana Kuchibhotla et al

03-29-2022

The Sound of Bounding-Boxes
by Takashi Oya et al

03-30-2022

An Improved Lightweight YOLOv5 Model Based on Attention Mechanism for Face Mask Detection
by Sheng Xu

03-29-2022

A deep learning model for burn depth classification using ultrasound imaging
by Sangrock Lee et al

03-30-2022

Biclustering Algorithms Based on Metaheuristics: A Review
by Adan Jose-Garcia et al

03-29-2022

Equivariance Allows Handling Multiple Nuisance Variables When Analyzing Pooled Neuroimaging Datasets
by Vishnu Suresh Lokhande et al

03-29-2022

AutoPoly: Predicting a Polygonal Mesh Construction Sequence from a Silhouette Image
by I-Chao Shen et al

03-29-2022

Learning Structured Gaussians to Approximate Deep Ensembles
by Ivor J. A. Simpson et al

03-29-2022

Transformer Inertial Poser: Attention-based Real-time Human Motion Reconstruction from Sparse IMUs
by Yifeng Jiang et al

03-29-2022

Zero-Query Transfer Attacks on Context-Aware Object Detectors
by Zikui Cai et al

03-31-2022

FindIt: Generalized Localization with Natural Language Queries
by Weicheng Kuo et al

04-01-2022

Selecting task with optimal transport self-supervised learning for few-shot classification
by Renjie Xu et al

03-29-2022

Auditing Privacy Defenses in Federated Learning via Generative Gradient Leakage
by Zhuohang Li et al

03-30-2022

Learning Local Displacements for Point Cloud Completion
by Yida Wang et al

03-31-2022

Deep Hyperspectral Unmixing using Transformer Network
by Preetam Ghosh et al

03-31-2022

3D Equivariant Graph Implicit Functions
by Yunlu Chen et al

03-31-2022

Leverage Your Local and Global Representations: A New Self-Supervised Learning Strategy
by Tong Zhang et al

03-31-2022

Time Lens++: Event-based Frame Interpolation with Parametric Non-linear Flow and Multi-scale Fusion
by Stepan Tulyakov et al

03-31-2022

Rethinking Portrait Matting with Privacy Preserving
by Sihan Ma et al

03-30-2022

COSMOS: Cross-Modality Unsupervised Domain Adaptation for 3D Medical Image Segmentation based on Target-aware Domain Translation and Iterative Self-Training
by Hyungseob Shin et al

03-29-2022

TransductGAN: a Transductive Adversarial Model for Novelty Detection
by Najiba Toron et al

03-31-2022

Templates for 3D Object Pose Estimation Revisited: Generalization to New Objects and Robustness to Occlusions
by Van Nguyen Nguyen et al

03-31-2022

Do Vision-Language Pretrained Models Learn Primitive Concepts?
by Tian Yun et al

03-30-2022

Knowledge-based Entity Prediction for Improved Machine Perception in Autonomous Systems
by Ruwan Wickramarachchi et al

04-01-2022

Connect, Not Collapse: Explaining Contrastive Learning for Unsupervised Domain Adaptation
by Kendrick Shen et al

03-31-2022

An End-to-end Supervised Domain Adaptation Framework for Cross-Domain Change Detection
by Jia Liu et al

03-30-2022

Forecasting from LiDAR via Future Object Detection
by Neehar Peri et al

03-29-2022

Agreement or Disagreement in Noise-tolerant Mutual Learning?
by Jiarun Liu et al

03-31-2022

Semi-Weakly Supervised Object Detection by Sampling Pseudo Ground-Truth Boxes
by Akhil Meethal et al

03-29-2022

Photographic Visualization of Weather Forecasts with Generative Adversarial Networks
by Christian Sigg et al

03-31-2022

Adaptive Mean-Residue Loss for Robust Facial Age Estimation
by Ziyuan Zhao et al

03-30-2022

PromptDet: Expand Your Detector Vocabulary with Uncurated Images
by Chengjian Feng et al

03-31-2022

Measuring hand use in the home after cervical spinal cord injury using egocentric video
by Andrea Bandini et al

03-31-2022

A Unified Framework for Domain Adaptive Pose Estimation
by Donghyun Kim et al

03-30-2022

Learning Program Representations for Food Images and Cooking Recipes
by Dim P. Papadopoulos et al

03-30-2022

Recommendation of Compatible Outfits Conditioned on Style
by Debopriyo Banerjee et al

03-30-2022

FLOAT: Factorized Learning of Object Attributes for Improved Multi-object Multi-part Scene Parsing
by Rishubh Singh et al

03-29-2022

Kernel Modulation: A Parameter-Efficient Method for Training Convolutional Neural Networks
by Yuhuang Hu et al

03-29-2022

Vision Transformers in Medical Computer Vision -- A Contemplative Retrospection
by Arshi Parvaiz et al

03-29-2022

Abstract Flow for Temporal Semantic Segmentation on the Permutohedral Lattice
by Peer Schütt et al

03-29-2022

Harmonizing Pathological and Normal Pixels for Pseudo-healthy Synthesis
by Yunlong Zhang et al

03-30-2022

On learning adaptive acquisition policies for undersampled multi-coil MRI reconstruction
by Tim Bakker et al

03-30-2022

AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval
by Riku Togashi et al

04-01-2022

Bridging the Gap between Classification and Localization for Weakly Supervised Object Localization
by Eunji Kim et al

03-29-2022

Quantifying Societal Bias Amplification in Image Captioning
by Yusuke Hirota et al

03-31-2022

Rethinking Video Salient Object Ranking
by Jiaying Lin et al

03-29-2022

Efficient Reflectance Capture with a Deep Gated Mixture-of-Experts
by Xiaohe Ma et al

03-29-2022

VI-IKD: High-Speed Accurate Off-Road Navigation using Learned Visual-Inertial Inverse Kinodynamics
by Haresh Karnan et al

04-01-2022

ObjectMix: Data Augmentation by Copy-Pasting Objects in Videos for Action Recognition
by Jun Kimata et al

04-01-2022

Online panoptic 3D reconstruction as a Linear Assignment Problem
by Leevi Raivio et al

03-29-2022

Target and Task specific Source-Free Domain Adaptive Image Segmentation
by Vibashan VS et al

03-31-2022

ImpDet: Exploring Implicit Fields for 3D Object Detection
by Xuelin Qian et al

03-31-2022

MPS-NeRF: Generalizable 3D Human Rendering from Multiview Images
by Xiangjun Gao et al

03-31-2022

A Dataset of Images of Public Streetlights with Operational Monitoring using Computer Vision Techniques
by Ioannis Mavromatis et al

03-29-2022

Few Could Be Better Than All: Feature Sampling and Grouping for Scene Text Detection
by Jingqun Tang et al

03-29-2022

SIOD: Single Instance Annotated Per Category Per Image for Object Detection
by Hanjun Li et al

03-30-2022

Task Adaptive Parameter Sharing for Multi-Task Learning
by Matthew Wallingford et al

04-01-2022

MS-HLMO: Multi-scale Histogram of Local Main Orientation for Remote Sensing Image Registration
by Chenzhong Gao et al

03-30-2022

Knowledge-Spreader: Learning Facial Action Unit Dynamics with Extremely Limited Labels
by Xiaotian Li et al

03-29-2022

MAT: Mask-Aware Transformer for Large Hole Image Inpainting
by Wenbo Li et al

03-31-2022

End-to-End Trajectory Distribution Prediction Based on Occupancy Grid Maps
by Ke Guo et al

03-29-2022

Using Active Speaker Faces for Diarization in TV shows
by Rahul Sharma et al

03-29-2022

Deeply Interleaved Two-Stream Encoder for Referring Video Segmentation
by Guang Feng et al

03-30-2022

Investigating Top-kk White-Box and Transferable Black-box Attack
by Chaoning Zhang et al

03-29-2022

MAP-Gen: An Automated 3D-Box Annotation Flow with Multimodal Attention Point Generator
by Chang Liu et al

03-31-2022

Contributions to interframe coding
by Marcos Faundez-Zanuy et al

03-31-2022

Automatic Classification of Alzheimers Disease using brain MRI data and deep Convolutional Neural Networks
by Zahraa Sh. Aaraji et al

04-01-2022

Quantized GAN for Complex Music Generation from Dance Videos
by Ye Zhu et al

03-31-2022

A Survey of Robust 3D Object Detection Methods in Point Clouds
by Walter Zimmer et al

03-30-2022

Personalized Image Aesthetics Assessment with Rich Attributes
by Yuzhe Yang et al

03-31-2022

Semantic Pose Verification for Outdoor Visual Localization with Self-supervised Contrastive Learning
by Semih Orhan et al

03-29-2022

Monitored Distillation for Positive Congruent Depth Completion
by Tian Yu Liu et al

03-31-2022

GraftNet: Towards Domain Generalized Stereo Matching with a Broad-Spectrum and Task-Oriented Feature
by Biyang Liu et al

03-30-2022

Stochastic Backpropagation: A Memory Efficient Strategy for Training Video Models
by Feng Cheng et al

03-30-2022

Casual 6-DoF: free-viewpoint panorama using a handheld 360 camera
by Rongsen Chen et al

03-31-2022

Dynamic Multimodal Fusion
by Zihui Xue et al

03-31-2022

CADG: A Model Based on Cross Attention for Domain Generalization
by Cheng Dai et al

03-30-2022

Sensor Data Validation and Driving Safety in Autonomous Driving Systems
by Jindi Zhang

03-29-2022

Meta-Sampler: Almost-Universal yet Task-Oriented Sampling for Point Clouds
by Ta-Ying Cheng et al

03-30-2022

Multi-Robot Active Mapping via Neural Bipartite Graph Matching
by Kai Ye et al

03-31-2022

Real-Time and Robust 3D Object Detection Within Road-Side LiDARs Using Domain Adaptation
by Walter Zimmer et al

03-29-2022

High-resolution Face Swapping via Latent Semantics Disentanglement
by Yangyang Xu et al

03-29-2022

StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis
by Zhiheng Li et al

03-29-2022

Instance Relation Graph Guided Source-Free Domain Adaptive Object Detection
by Vibashan VS et al

03-30-2022

LAKe-Net: Topology-Aware Point Cloud Completion by Localizing Aligned Keypoints
by Junshu Tang et al

03-29-2022

PSMNet: Position-aware Stereo Merging Network for Room Layout Estimation
by Haiyan Wang et al

03-30-2022

Learning the Effect of Registration Hyperparameters with HyperMorph
by Andrew Hoopes et al

04-01-2022

Proper Reuse of Image Classification Features Improves Object Detection
by Cristina Vasconcelos et al

03-31-2022

Stereo Unstructured Magnification: Multiple Homography Image for View Synthesis
by Qi Zhang et al

03-31-2022

LASER: LAtent SpacE Rendering for 2D Visual Localization
by Zhixiang Min et al

03-29-2022

Proactive Image Manipulation Detection
by Vishal Asnani et al

03-30-2022

Tampered VAE for Improved Satellite Image Time Series Classification
by Xin Cai et al

04-01-2022

Epipolar Focus Spectrum: A Novel Light Field Representation and Application in Dense-view Reconstruction
by Yaning Li et al

03-29-2022

Fine-Grained Object Classification via Self-Supervised Pose Alignment
by Xuhui Yang et al

03-30-2022

Collaborative Transformers for Grounded Situation Recognition
by Junhyeong Cho et al

03-30-2022

End to End Lip Synchronization with a Temporal AutoEncoder
by Yoav Shalev et al

03-30-2022

Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object Detection
by Jinyuan Liu et al

03-29-2022

PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision
by Kehong Gong et al

04-01-2022

Autoencoder for Synthetic to Real Generalization: From Simple to More Complex Scenes
by Steve Dias Da Cruz et al

03-30-2022

CREATE: A Benchmark for Chinese Short Video Retrieval and Title Generation
by Ziqi Zhang et al

03-30-2022

SpatioTemporal Focus for Skeleton-based Action Recognition
by Liyu Wu et al

04-01-2022

Face identification by means of a neural net classifier
by Virginia Espinosa-Duro et al

03-29-2022

StyleFool: Fooling Video Classification Systems via Style Transfer
by Yuxin Cao et al

04-01-2022

Comparison of convolutional neural networks for cloudy optical images reconstruction from single or multitemporal joint SAR and optical images
by Rémi Cresson et al

03-29-2022

CHEX: CHannel EXploration for CNN Model Compression
by Zejiang Hou et al

03-29-2022

Neural Face Video Compression using Multiple Views
by Anna Volokitin et al

04-01-2022

Learning to Deblur using Light Field Generated and Real Defocus Images
by Lingyan Ruan et al

03-30-2022

Image-to-Lidar Self-Supervised Distillation for Autonomous Driving Data
by Corentin Sautier et al

03-29-2022

Text-Driven Video Acceleration: A Weakly-Supervised Reinforcement Learning Method
by Washington Ramos et al

03-31-2022

Speaker Extraction with Co-Speech Gestures Cue
by Zexu Pan et al

03-30-2022

AdaMixer: A Fast-Converging Query-Based Object Detector
by Ziteng Gao et al

03-29-2022

Signing at Scale: Learning to Co-Articulate Signs for Large-Scale Photo-Realistic Sign Language Production
by Ben Saunders et al

03-31-2022

Dynamic Supervisor for Cross-dataset Object Detection
by Ze Chen et al

03-29-2022

A Computational Architecture for Machine Consciousness and Artificial Superintelligence: Updating Working Memory Iteratively
by Jared Edward Reser

03-31-2022

CRAFT: Cross-Attentional Flow Transformer for Robust Optical Flow
by Xiuchao Sui et al

03-31-2022

Deformation and Correspondence Aware Unsupervised Synthetic-to-Real Scene Flow Estimation for Point Clouds
by Zhao Jin et al

03-30-2022

ConceptEvo: Interpreting Concept Evolution in Deep Learning Training
by Haekyu Park et al

04-01-2022

On the Importance of Asymmetry for Siamese Representation Learning
by Xiao Wang et al

03-31-2022

Investigating Modality Bias in Audio Visual Video Parsing
by Piyush Singh Pasi et al

03-30-2022

Region of Interest focused MRI to Synthetic CT Translation using Regression and Classification Multi-task Network
by Sandeep Kaushik et al

03-29-2022

Temporal Feature Alignment and Mutual Information Maximization for Video-Based Human Pose Estimation
by Zhenguang Liu et al

03-29-2022

Nested Collaborative Learning for Long-Tailed Visual Recognition
by Jun Li et al

03-29-2022

End-to-End Transformer Based Model for Image Captioning
by Yiyu Wang et al

03-31-2022

GALA: Toward Geometry-and-Lighting-Aware Object Search for Compositing
by Sijie Zhu et al

04-01-2022

Unimodal-Concentrated Loss: Fully Adaptive Label Distribution Learning for Ordinal Regression
by Qiang Li et al

03-30-2022

PIE-Net: Photometric Invariant Edge Guided Network for Intrinsic Image Decomposition
by Partha Das et al

03-29-2022

Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots
by Pranay Mathur et al

03-31-2022

Efficient Maximal Coding Rate Reduction by Variational Forms
by Christina Baek et al

03-30-2022

Acknowledging the Unknown for Multi-label Learning with Single Positive Labels
by Donghao Zhou et al

03-29-2022

Self-Supervised Image Representation Learning with Geometric Set Consistency
by Nenglun Chen et al

03-29-2022

Domain Invariant Siamese Attention Mask for Small Object Change Detection via Everyday Indoor Robot Navigation
by Koji Takeda et al

03-30-2022

Learning of Global Objective for Network Flow in Multi-Object Tracking
by Shuai Li et al

03-29-2022

Semantic Line Detection Using Mirror Attention and Comparative Ranking and Matching
by Dongkwon Jin et al

03-30-2022

SIT: A Bionic and Non-Linear Neuron for Spiking Neural Network
by Cheng Jin et al

03-29-2022

Angular Super-Resolution in Diffusion MRI with a 3D Recurrent Convolutional Autoencoder
by Matthew Lyon et al

03-29-2022

mc-BEiT: Multi-choice Discretization for Image BERT Pre-training
by Xiaotong Li et al

03-29-2022

Texture based Prototypical Network for Few-Shot Semantic Segmentation of Forest Cover: Generalizing for Different Geographical Regions
by Gokul P et al

03-29-2022

An EEG-Based Multi-Modal Emotion Database with Both Posed and Authentic Facial Actions for Emotion Analysis
by Xiaotian Li et al

03-30-2022

SeqTR: A Simple yet Universal Network for Visual Grounding
by Chaoyang Zhu et al

03-30-2022

Constrained Few-shot Class-incremental Learning
by Michael Hersche et al

04-01-2022

Unitail: Detecting, Reading, and Matching in Retail Scene
by Fangyi Chen et al

03-29-2022

Self-Supervised Leaf Segmentation under Complex Lighting Conditions
by Xufeng Lin et al

03-29-2022

FisherMatch: Semi-Supervised Rotation Regression via Entropy-based Filtering
by Yingda Yin et al

03-29-2022

Towards Learning Neural Representations from Shadows
by Kushagra Tiwary et al

03-30-2022

Learning Instance-Specific Adaptation for Cross-Domain Segmentation
by Yuliang Zou et al

03-30-2022

TR-MOT: Multi-Object Tracking by Reference
by Mingfei Chen et al

03-29-2022

NICGSlowDown: Evaluating the Efficiency Robustness of Neural Image Caption Generation Models
by Simin Chen et al

03-30-2022

RFNet-4D: Joint Object Reconstruction and Flow Estimation from 4D Point Clouds
by Tuan-Anh Vu et al

03-29-2022

Cross-Modality High-Frequency Transformer for MR Image Super-Resolution
by Chaowei Fang et al

03-29-2022

Hybrid Routing Transformer for Zero-Shot Learning
by De Cheng et al

03-30-2022

Recognition of polar lows in Sentinel-1 SAR images with deep learning
by Jakob Grahn et al

03-31-2022

BEVDet4D: Exploit Temporal Cues in Multi-camera 3D Object Detection
by Junjie Huang et al

03-30-2022

Interactive Multi-scale Fusion of 2D and 3D Features for Multi-object Tracking
by Guangming Wang et al

03-30-2022

Controllable Augmentations for Video Representation Learning
by Rui Qian et al

03-30-2022

Foveation-based Deep Video Compression without Motion Search
by Meixu Chen et al

03-29-2022

Image Segmentation with Adaptive Spatial Priors from Joint Registration
by Haifeng Li et al

03-31-2022

Perceptual Quality Assessment of UGC Gaming Videos
by Xiangxu Yu et al

03-29-2022

Eventor: An Efficient Event-Based Monocular Multi-View Stereo Accelerator on FPGA Platform
by Mingjun Li et al

03-29-2022

Robust Single Image Dehazing Based on Consistent and Contrast-Assisted Reconstruction
by De Cheng et al

03-30-2022

End-to-end Document Recognition and Understanding with Dessurt
by Brian Davis et al

03-31-2022

Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization
by Junyu Gao et al

03-30-2022

Fair Contrastive Learning for Facial Attribute Classification
by Sungho Park et al

03-30-2022

Spatial-Temporal Parallel Transformer for Arm-Hand Dynamic Estimation
by Shuying Liu et al

04-01-2022

Generic Event Boundary Captioning: A Benchmark for Status Changes Understanding
by Yuxuan Wang et al

03-29-2022

Long-term Video Frame Interpolation via Feature Propagation
by Dawit Mureja Argaw et al

03-29-2022

UnShadowNet: Illumination Critic Guided Contrastive Learning For Shadow Removal
by Subhrajyoti Dasgupta et al

03-29-2022

Infrared and Visible Image Fusion via Interactive Compensatory Attention Adversarial Learning
by Zhishe Wang et al

03-29-2022

End-to-End Compressed Video Representation Learning for Generic Event Boundary Detection
by Congcong Li et al

03-29-2022

Learning to Detect Mobile Objects from LiDAR Scans Without Labels
by Yurong You et al

03-30-2022

Practical Learned Lossless JPEG Recompression with Multi-Level Cross-Channel Entropy Model in the DCT Domain
by Lina Guo et al

03-31-2022

AEGNN: Asynchronous Event-based Graph Neural Networks
by Simon Schaefer et al

03-31-2022

Video-Text Representation Learning via Differentiable Weak Temporal Alignment
by Dohwan Ko et al

03-31-2022

Weakly Supervised Patch Label Inference Networks for Efficient Pavement Distress Detection and Recognition in the Wild
by Sheng Huang et al

03-31-2022

Reflection and Rotation Symmetry Detection via Equivariant Learning
by Ahyun Seo et al

03-29-2022

Eigenlanes: Data-Driven Lane Descriptors for Structurally Diverse Lanes
by Dongkwon Jin et al

04-01-2022

GrowliFlower: An image time series dataset for GROWth analysis of cauLIFLOWER
by Jana Kierdorf et al

03-29-2022

NL-FCOS: Improving FCOS through Non-Local Modules for Object Detection
by Lukas Pavez et al

03-31-2022

Deformable Video Transformer
by Jue Wang et al

04-01-2022

CAT-Det: Contrastively Augmented Transformer for Multi-modal 3D Object Detection
by Yanan Zhang et al

03-29-2022

OdontoAI: A human-in-the-loop labeled data set and an online platform to boost research on dental panoramic radiographs
by Bernardo Silva et al

03-31-2022

Logit Normalization for Long-tail Object Detection
by Liang Zhao et al

03-31-2022

TransGeo: Transformer Is All You Need for Cross-view Image Geo-localization
by Sijie Zhu et al

03-31-2022

Tooth Instance Segmentation on Panoramic Dental Radiographs Using U-Nets and Morphological Processing
by Selahattin Serdar Helli et al

03-30-2022

Face Relighting with Geometrically Consistent Shadows
by Andrew Hou et al

03-30-2022

Self-Distillation from the Last Mini-Batch for Consistency Regularization
by Yiqing Shen et al

03-29-2022

Edge Detection and Deep Learning Based SETI Signal Classification Method
by Zhewei Chen et al

04-01-2022

RMS-FlowNet: Efficient and Robust Multi-Scale Scene Flow Estimation for Large-Scale Point Clouds
by Ramy Battrawy et al

03-30-2022

FlowFormer: A Transformer Architecture for Optical Flow
by Zhaoyang Huang et al

03-29-2022

Interactive Multi-Class Tiny-Object Detection
by Chunggi Lee et al

04-01-2022

DFNet: Enhance Aboslute Pose Regression with Direct Feature Matching
by Shuai Chen et al

03-29-2022

A Naturalistic Database of Thermal Emotional Facial Expressions and Effects of Induced Emotions on Memory
by Anna Esposito et al

03-29-2022

Learning-based Point Cloud Registration for 6D Object Pose Estimation in the Real World
by Zheng Dang et al

03-31-2022

A Temporal Learning Approach to Inpainting Endoscopic Specularities and Its effect on Image Correspondence
by Rema Daher et al

03-31-2022

Self-distillation Augmented Masked Autoencoders for Histopathological Image Classification
by Yang Luo et al

03-29-2022

Shifting More Attention to Visual Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding
by Jiabo Ye et al

03-29-2022

Face segmentation: A comparison between visible and thermal images
by Jiri Mekyska et al

03-29-2022

Uncertainty-Aware Adaptation for Self-Supervised 3D Human Pose Estimation
by Jogendra Nath Kundu et al

04-01-2022

Marginal Contrastive Correspondence for Guided Image Generation
by Fangneng Zhan et al

03-29-2022

Clean Implicit 3D Structure from Noisy 2D STEM Images
by Hannah Kniesel et al

03-29-2022

Contextual Information Based Anomaly Detection for a Multi-Scene UAV Aerial Videos
by Girisha S et al

03-30-2022

Understanding 3D Object Articulation in Internet Videos
by Shengyi Qian et al

03-30-2022

Large-Scale Pre-training for Person Re-identification with Noisy Labels
by Dengpan Fu et al

03-30-2022

CycDA: Unsupervised Cycle Domain Adaptation from Image to Video
by Wei Lin et al

03-30-2022

InstaFormer: Instance-Aware Image-to-Image Translation with Transformer
by Soohyun Kim et al

04-01-2022

Weakly Supervised Regional and Temporal Learning for Facial Action Unit Recognition
by Jingwei Yan et al

04-01-2022

Few-shot One-class Domain Adaptation Based on Frequency for Iris Presentation Attack Detection
by Yachun Li et al

03-29-2022

On Triangulation as a Form of Self-Supervision for 3D Human Pose Estimation
by Soumava Kumar Roy et al

03-29-2022

Few-shot Structured Radiology Report Generation Using Natural Language Prompts
by Matthias Keicher et al

04-01-2022

Autonomous crater detection on asteroids using a fully-convolutional neural network
by Francesco Latorre et al

03-30-2022

Surface Vision Transformers: Attention-Based Modelling applied to Cortical Analysis
by Simon Dahan et al

03-30-2022

L^3U-net: Low-Latency Lightweight U-net Based Image Segmentation Model for Parallel CNN Processors
by Osman Erman Okman et al

03-30-2022

PP-YOLOE: An evolved version of YOLO
by Shangliang Xu et al

03-31-2022

Point Scene Understanding via Disentangled Instance Mesh Reconstruction
by Jiaxiang Tang et al

03-29-2022

Balanced Multimodal Learning via On-the-fly Gradient Modulation
by Xiaokang Peng et al

03-30-2022

The impact of using voxel-level segmentation metrics on evaluating multifocal prostate cancer localisation
by Wen Yan et al

03-29-2022

Fine-Grained Visual Entailment
by Christopher Thomas et al

03-29-2022

OakInk: A Large-scale Knowledge Repository for Understanding Hand-Object Interaction
by Lixin Yang et al

03-29-2022

Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation
by Yueming Jin et al

03-29-2022

Task-specific Inconsistency Alignment for Domain Adaptive Object Detection
by Liang Zhao et al

03-29-2022

Category Guided Attention Network for Brain Tumor Segmentation in MRI
by Jiangyun Li et al

03-29-2022

Exploring Frequency Adversarial Attacks for Face Forgery Detection
by Shuai Jia et al

04-01-2022

Fast and Automatic Object Registration for Human-Robot Collaboration in Industrial Manufacturing
by Manuela Geiß et al

03-29-2022

Semi-Supervised Image-to-Image Translation using Latent Space Mapping
by Pan Zhang et al

03-31-2022

Digitizing Historical Balance Sheet Data: A Practitioners Guide
by Sergio Correia et al

03-30-2022

STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video Prediction
by Zheng Chang et al

03-29-2022

SAR-ShipNet: SAR-Ship Detection Neural Network via Bidirectional Coordinate Attention and Multi-resolution Feature Fusion
by Yuwen Deng et al

03-29-2022

Robust Structured Declarative Classifiers for 3D Point Clouds: Defending Adversarial Attacks with Implicit Gradients
by Kaidong Li et al

03-29-2022

ReplaceBlock: An improved regularization method based on background information
by Zhemin Zhang et al

03-29-2022

Eigencontours: Novel Contour Descriptors Based on Low-Rank Approximation
by Wonhui Park et al

03-29-2022

In-N-Out Generative Learning for Dense Unsupervised Video Segmentation
by Xiao Pan et al

03-31-2022

Semantic-shape Adaptive Feature Modulation for Semantic Image Synthesis
by Zhengyao Lv et al

03-31-2022

Multi-Granularity Alignment Domain Adaptation for Object Detection
by Wenzhang Zhou et al

03-29-2022

Identification and classification of exfoliated graphene flakes from microscopy images using a hierarchical deep convolutional neural network
by Soroush Mahjoubi et al

03-29-2022

AnyFace: Free-style Text-to-Face Synthesis and Manipulation
by Jianxin Sun et al

03-30-2022

Interpretable Vertebral Fracture Diagnosis
by Paul Engstler et al

03-29-2022

Alignment-Uniformity aware Representation Learning for Zero-shot Video Classification
by Shi Pu et al

03-30-2022

Escaping Data Scarcity for High-Resolution Heterogeneous Face Hallucination
by Yiqun Mei et al

03-30-2022

CardioID: Mitigating the Effects of Irregular Cardiac Signals for Biometric Identification
by Weizheng Wang et al

03-29-2022

OSOP: A Multi-Stage One Shot Object Pose Estimation Framework
by Ivan Shugurov et al

03-30-2022

Threshold Matters in WSSS: Manipulating the Activation for the Robust and Accurate Segmentation Model Against Thresholds
by Minhyun Lee et al

03-30-2022

Progressively Generating Better Initial Guesses Towards Next Stages for High-Quality Human Motion Prediction
by Tiezheng Ma et al

03-30-2022

Self-supervised 360∘∘ Room Layout Estimation
by Hao-Wen Ting et al

03-29-2022

Learning a Structured Latent Space for Unsupervised Point Cloud Completion
by Yingjie Cai et al

03-30-2022

Contribution of the Temperature of the Objects to the Problem of Thermal Imaging Focusing
by Virginia Espinosa-Duró et al

03-30-2022

Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation
by Hanqing Wang et al

03-29-2022

ACR Loss: Adaptive Coordinate-based Regression Loss for Face Alignment
by Ali Pourramezan Fard et al

03-29-2022

Efficient Virtual View Selection for 3D Hand Pose Estimation
by Jian Cheng et al

03-30-2022

Preliminary experiments on thermal emissivity adjustment for face images
by Marcos Faundez-Zanuy et al

03-29-2022

Efficient Hybrid Network: Inducting Scattering Features
by Dmitry Minskiy et al

03-30-2022

PseCo: Pseudo Labeling and Consistency Training for Semi-Supervised Object Detection
by Gang Li et al

03-30-2022

Towards Multimodal Depth Estimation from Light Fields
by Titus Leistner et al

03-30-2022

PEGG-Net: Background Agnostic Pixel-Wise Efficient Grasp Generation Under Closed-Loop Conditions
by Zhiyang Liu et al

04-01-2022

Vision Transformer with Cross-attention by Temporal Shift for Efficient Action Recognition
by Ryota Hashiguchi et al

04-01-2022

FrequencyLowCut Pooling -- Plug & Play against Catastrophic Overfitting
by Julia Grabinski et al

03-30-2022

Fast Light-Weight Near-Field Photometric Stereo
by Daniel Lichy et al

03-30-2022

OPD: Single-view 3D Openable Part Detection
by Hanxiao Jiang et al

03-30-2022

On the Road to Online Adaptation for Semantic Image Segmentation
by Riccardo Volpi et al

03-29-2022

Semi-Supervised Learning of Semantic Correspondence with Pseudo-Labels
by Jiwon Kim et al

04-01-2022

DIP: Deep Inverse Patchmatch for High-Resolution Optical Flow
by Zihua Zheng et al

03-30-2022

Graph-based Active Learning for Semi-supervised Classification of SAR Data
by Kevin Miller et al

03-29-2022

NNLander-VeriF: A Neural Network Formal Verification Framework for Vision-Based Autonomous Aircraft Landing
by Ulices Santa Cruz et al

03-30-2022

Rabbit, toad, and the Moon: Can machine categorize them into one class?
by Daigo Shoji

03-30-2022

Automatic Facial Skin Feature Detection for Everyone
by Qian Zheng et al

03-30-2022

An Iterative Co-Training Transductive Framework for Zero Shot Learning
by Bo Liu et al

03-30-2022

Omni-DETR: Omni-Supervised Object Detection with Transformers
by Pei Wang et al

03-30-2022

Pay Attention to Hidden States for Video Deblurring: Ping-Pong Recurrent Neural Networks and Selective Non-Local Attention
by JoonKyu Park et al

03-29-2022

VPTR: Efficient Transformers for Video Prediction
by Xi Ye et al

03-30-2022

Global Tracking via Ensemble of Local Trackers
by Zikun Zhou et al

03-30-2022

An Efficient Anchor-free Universal Lesion Detection in CT-scans
by Manu Sheoran et al

03-29-2022

A Multi-Stage Duplex Fusion ConvNet for Aerial Scene Classification
by Jingjun Yi et al

03-29-2022

Neural Inertial Localization
by Sachini Herath et al

03-30-2022

Ball 3D localization from a single calibrated image
by Gabriel Van Zandycke et al

 
Craig Smith