2022.4.4 Vision papers

03-30-2022	Exploring Plain Vision Transformer Backbones for Object Detection by Yanghao Li et al
03-29-2022	Contrasting the landscape of contrastive and non-contrastive learning by Ashwini Pokle et al
03-31-2022	MyStyle: A Personalized Generative Prior by Yotam Nitzan et al
03-31-2022	Visual Prompting: Modifying Pixel Space to Adapt Pre-trained Models by Hyojin Bahng et al
03-29-2022	Dressing in the Wild by Watching Dance Videos by Xin Dong et al
03-30-2022	VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers by Estelle Aflalo et al
03-31-2022	Bringing Old Films Back to Life by Ziyu Wan et al
03-29-2022	EnvEdit: Environment Editing for Vision-and-Language Navigation by Jialu Li et al
03-31-2022	R2L: Distilling Neural Radiance Field to Neural Light Field for Efficient Novel View Synthesis by Huan Wang et al
03-30-2022	FALCON: Fast Visual Concept Learning by Integrating Images, Linguistic descriptions, and Conceptual Relations by Lingjie Mei et al
03-30-2022	CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs by Jiteng Mu et al
03-30-2022	MeMOT: Multi-Object Tracking with Memory by Jiarui Cai et al
04-01-2022	Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language by Andy Zeng et al
03-29-2022	Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation by Xiao Fu et al
03-29-2022	ITTR: Unpaired Image-to-Image Translation with Transformers by Wanfeng Zheng et al
03-31-2022	DiffSkill: Skill Abstraction from Differentiable Physics for Deformable Object Manipulations with Tools by Xingyu Lin et al
03-30-2022	TubeDETR: Spatio-Temporal Video Grounding with Transformers by Antoine Yang et al
03-31-2022	Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis by Karren Yang et al
03-31-2022	Continuous Scene Representations for Embodied AI by Samir Yitzhak Gadre et al
03-30-2022	CaDeX: Learning Canonical Deformation Coordinate Space for Dynamic Surface Representation via Neural Homeomorphism by Jiahui Lei et al
03-29-2022	Diffusion Models for Counterfactual Explanations by Guillaume Jeanneret et al
03-29-2022	Fine-tuning Image Transformers using Learnable Memory by Mark Sandler et al
03-30-2022	To Find Waldo You Need Contextual Cues: Debiasing Whos Waldo by Yiran Luo et al
04-01-2022	Simplicial Embeddings in Self-Supervised Learning and Downstream Classification by Samuel Lavoie et al
03-29-2022	Online Continual Learning on a Contaminated Data Stream with Blurry Task Boundaries by Jihwan Bang et al
03-30-2022	Balanced MSE for Imbalanced Visual Regression by Jiawei Ren et al
03-30-2022	AmsterTime: A Visual Place Recognition Benchmark Dataset for Severe Domain Shift by Burak Yildiz et al
03-30-2022	DDNeRF: Depth Distribution Neural Radiance Fields by David Dadon et al
03-31-2022	TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing by Yanbo Xu et al
03-31-2022	Generating High Fidelity Data from Low-density Regions using Diffusion Models by Vikash Sehwag et al
03-29-2022	Disentangled3D: Learning a 3D Generative Model with Disentangled Geometry and Appearance from Monocular Images by Ayush Tewari et al
04-01-2022	Perception Prioritized Training of Diffusion Models by Jooyoung Choi et al
03-29-2022	Iterative Deep Homography Estimation by Si-Yuan Cao et al
03-29-2022	Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian by Jihyun Lee et al
03-30-2022	Enhancing Cancer Prediction in Challenging Screen-Detected Incident Lung Nodules Using Time-Series Deep Learning by Shahab Aslani et al
03-31-2022	Towards Driving-Oriented Metric for Lane Detection Models by Takami Sato et al
03-29-2022	Parameter-efficient Fine-tuning for Vision Transformers by Xuehai He et al
03-29-2022	DRaCoN -- Differentiable Rasterization Conditioned Neural Radiance Fields for Articulated Avatars by Amit Raj et al
03-29-2022	MatteFormer: Transformer-Based Image Matting via Prior-Tokens by GyuTae Park et al
03-31-2022	BEVFormer: Learning Birds-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers by Zhiqi Li et al
03-31-2022	Its All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher by Kanghyun Choi et al
03-29-2022	Image Retrieval from Contextual Descriptions by Benno Krojer et al
03-30-2022	ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval by Mengjun Cheng et al
03-30-2022	Mask Atari for Deep Reinforcement Learning as POMDP Benchmarks by Yang Shao et al
03-31-2022	A Closer Look at Rehearsal-Free Continual Learning by James Seale Smith et al
03-29-2022	Integrative Few-Shot Learning for Classification and Segmentation by Dahyun Kang et al
03-30-2022	Exploiting Explainable Metrics for Augmented SGD by Mahdi S. Hosseini et al
03-29-2022	SepViT: Separable Vision Transformer by Wei Li et al
03-30-2022	Online Motion Style Transfer for Interactive Character Control by Yingtian Tang et al
03-29-2022	A Style-aware Discriminator for Controllable Image Translation by Kunhee Kim et al
03-31-2022	SimVQA: Exploring Simulated Environments for Visual Question Answering by Paola Cascante-Bonilla et al
03-29-2022	ME-CapsNet: A Multi-Enhanced Capsule Networks with Routing Mechanism by Jerrin Bright et al
03-31-2022	Cross-modal Learning of Graph Representations using Radar Point Cloud for Long-Range Gesture Recognition by Souvik Hazra et al
03-29-2022	BARC: Learning to Regress 3D Dog Shape from Images by Exploiting Breed Information by Nadine Rueegg et al
03-30-2022	Fast, Accurate and Memory-Efficient Partial Permutation Synchronization by Shaohan Li et al
03-29-2022	Classification of Hyperspectral Images Using SVM with Shape-adaptive Reconstruction and Smoothed Total Variation by Ruoning Li et al
03-31-2022	Mutual Scene Synthesis for Mixed Reality Telepresence by Mohammad Keshavarzi et al
03-30-2022	ReSTR: Convolution-free Referring Image Segmentation Using Transformers by Namyup Kim et al
03-29-2022	Deep Equilibrium Assisted Block Sparse Coding of Inter-dependent Signals: Application to Hyperspectral Imaging by Alexandros Gkillas et al
03-30-2022	HDSDF: Hybrid Directional and Signed Distance Functions for Fast Inverse Rendering by Tarun Yenamandra et al
03-31-2022	Human Instance Segmentation and Tracking via Data Association and Single-stage Detector by Lu Cheng et al
04-01-2022	Autoencoder Attractors for Uncertainty Estimation by Steve Dias Da Cruz et al
03-29-2022	Improved Counting and Localization from Density Maps for Object Detection in 2D and 3D Microscopy Imaging by Shijie Li et al
03-29-2022	Treatment Learning Transformer for Noisy Image Classification by Chao-Han Huck Yang et al
03-31-2022	Towards Robust Rain Removal Against Adversarial Attacks: A Comprehensive Benchmark Analysis and Beyond by Yi Yu et al
03-29-2022	SHOP: A Deep Learning Based Pipeline for near Real-Time Detection of Small Handheld Objects Present in Blurry Video by Abhinav Ganguly et al
03-29-2022	How Deep is Your Art: An Experimental Study on the Limits of Artistic Understanding in a Single-Task, Single-Modality Neural Network by Mahan Agha Zahedi et al
03-29-2022	Deep Reinforcement Learning for Data-Driven Adaptive Scanning in Ptychography by Marcel Schloz et al
03-31-2022	Multimodal Fusion Transformer for Remote Sensing Image Classification by Swalpa Kumar Roy et al
03-30-2022	Federated Learning for the Classification of Tumor Infiltrating Lymphocytes by Ujjwal Baid et al
03-31-2022	Model Predictive Control for Fluid Human-to-Robot Handovers by Wei Yang et al
03-29-2022	AutoCoMet: Smart Neural Architecture Search via Co-Regulated Shaping Reinforcement by Mayukh Das et al
03-31-2022	Ternary and Binary Quantization for Improved Classification by Weizhi Lu et al
03-29-2022	CNN Filter DB: An Empirical Investigation of Trained Convolutional Filters by Paul Gavrikov et al
03-31-2022	Few-Shot Class-Incremental Learning by Sampling Multi-Phase Tasks by Da-Wei Zhou et al
03-30-2022	Unseen Classes at a Later Time? No Problem by Hari Chandana Kuchibhotla et al
03-29-2022	The Sound of Bounding-Boxes by Takashi Oya et al
03-30-2022	An Improved Lightweight YOLOv5 Model Based on Attention Mechanism for Face Mask Detection by Sheng Xu
03-29-2022	A deep learning model for burn depth classification using ultrasound imaging by Sangrock Lee et al
03-30-2022	Biclustering Algorithms Based on Metaheuristics: A Review by Adan Jose-Garcia et al
03-29-2022	Equivariance Allows Handling Multiple Nuisance Variables When Analyzing Pooled Neuroimaging Datasets by Vishnu Suresh Lokhande et al
03-29-2022	AutoPoly: Predicting a Polygonal Mesh Construction Sequence from a Silhouette Image by I-Chao Shen et al
03-29-2022	Learning Structured Gaussians to Approximate Deep Ensembles by Ivor J. A. Simpson et al
03-29-2022	Transformer Inertial Poser: Attention-based Real-time Human Motion Reconstruction from Sparse IMUs by Yifeng Jiang et al
03-29-2022	Zero-Query Transfer Attacks on Context-Aware Object Detectors by Zikui Cai et al
03-31-2022	FindIt: Generalized Localization with Natural Language Queries by Weicheng Kuo et al
04-01-2022	Selecting task with optimal transport self-supervised learning for few-shot classification by Renjie Xu et al
03-29-2022	Auditing Privacy Defenses in Federated Learning via Generative Gradient Leakage by Zhuohang Li et al
03-30-2022	Learning Local Displacements for Point Cloud Completion by Yida Wang et al
03-31-2022	Deep Hyperspectral Unmixing using Transformer Network by Preetam Ghosh et al
03-31-2022	3D Equivariant Graph Implicit Functions by Yunlu Chen et al
03-31-2022	Leverage Your Local and Global Representations: A New Self-Supervised Learning Strategy by Tong Zhang et al
03-31-2022	Time Lens++: Event-based Frame Interpolation with Parametric Non-linear Flow and Multi-scale Fusion by Stepan Tulyakov et al
03-31-2022	Rethinking Portrait Matting with Privacy Preserving by Sihan Ma et al
03-30-2022	COSMOS: Cross-Modality Unsupervised Domain Adaptation for 3D Medical Image Segmentation based on Target-aware Domain Translation and Iterative Self-Training by Hyungseob Shin et al
03-29-2022	TransductGAN: a Transductive Adversarial Model for Novelty Detection by Najiba Toron et al
03-31-2022	Templates for 3D Object Pose Estimation Revisited: Generalization to New Objects and Robustness to Occlusions by Van Nguyen Nguyen et al
03-31-2022	Do Vision-Language Pretrained Models Learn Primitive Concepts? by Tian Yun et al
03-30-2022	Knowledge-based Entity Prediction for Improved Machine Perception in Autonomous Systems by Ruwan Wickramarachchi et al
04-01-2022	Connect, Not Collapse: Explaining Contrastive Learning for Unsupervised Domain Adaptation by Kendrick Shen et al
03-31-2022	An End-to-end Supervised Domain Adaptation Framework for Cross-Domain Change Detection by Jia Liu et al
03-30-2022	Forecasting from LiDAR via Future Object Detection by Neehar Peri et al
03-29-2022	Agreement or Disagreement in Noise-tolerant Mutual Learning? by Jiarun Liu et al
03-31-2022	Semi-Weakly Supervised Object Detection by Sampling Pseudo Ground-Truth Boxes by Akhil Meethal et al
03-29-2022	Photographic Visualization of Weather Forecasts with Generative Adversarial Networks by Christian Sigg et al
03-31-2022	Adaptive Mean-Residue Loss for Robust Facial Age Estimation by Ziyuan Zhao et al
03-30-2022	PromptDet: Expand Your Detector Vocabulary with Uncurated Images by Chengjian Feng et al
03-31-2022	Measuring hand use in the home after cervical spinal cord injury using egocentric video by Andrea Bandini et al
03-31-2022	A Unified Framework for Domain Adaptive Pose Estimation by Donghyun Kim et al
03-30-2022	Learning Program Representations for Food Images and Cooking Recipes by Dim P. Papadopoulos et al
03-30-2022	Recommendation of Compatible Outfits Conditioned on Style by Debopriyo Banerjee et al
03-30-2022	FLOAT: Factorized Learning of Object Attributes for Improved Multi-object Multi-part Scene Parsing by Rishubh Singh et al
03-29-2022	Kernel Modulation: A Parameter-Efficient Method for Training Convolutional Neural Networks by Yuhuang Hu et al
03-29-2022	Vision Transformers in Medical Computer Vision -- A Contemplative Retrospection by Arshi Parvaiz et al
03-29-2022	Abstract Flow for Temporal Semantic Segmentation on the Permutohedral Lattice by Peer Schütt et al
03-29-2022	Harmonizing Pathological and Normal Pixels for Pseudo-healthy Synthesis by Yunlong Zhang et al
03-30-2022	On learning adaptive acquisition policies for undersampled multi-coil MRI reconstruction by Tim Bakker et al
03-30-2022	AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval by Riku Togashi et al
04-01-2022	Bridging the Gap between Classification and Localization for Weakly Supervised Object Localization by Eunji Kim et al
03-29-2022	Quantifying Societal Bias Amplification in Image Captioning by Yusuke Hirota et al
03-31-2022	Rethinking Video Salient Object Ranking by Jiaying Lin et al
03-29-2022	Efficient Reflectance Capture with a Deep Gated Mixture-of-Experts by Xiaohe Ma et al
03-29-2022	VI-IKD: High-Speed Accurate Off-Road Navigation using Learned Visual-Inertial Inverse Kinodynamics by Haresh Karnan et al
04-01-2022	ObjectMix: Data Augmentation by Copy-Pasting Objects in Videos for Action Recognition by Jun Kimata et al
04-01-2022	Online panoptic 3D reconstruction as a Linear Assignment Problem by Leevi Raivio et al
03-29-2022	Target and Task specific Source-Free Domain Adaptive Image Segmentation by Vibashan VS et al
03-31-2022	ImpDet: Exploring Implicit Fields for 3D Object Detection by Xuelin Qian et al
03-31-2022	MPS-NeRF: Generalizable 3D Human Rendering from Multiview Images by Xiangjun Gao et al
03-31-2022	A Dataset of Images of Public Streetlights with Operational Monitoring using Computer Vision Techniques by Ioannis Mavromatis et al
03-29-2022	Few Could Be Better Than All: Feature Sampling and Grouping for Scene Text Detection by Jingqun Tang et al
03-29-2022	SIOD: Single Instance Annotated Per Category Per Image for Object Detection by Hanjun Li et al
03-30-2022	Task Adaptive Parameter Sharing for Multi-Task Learning by Matthew Wallingford et al
04-01-2022	MS-HLMO: Multi-scale Histogram of Local Main Orientation for Remote Sensing Image Registration by Chenzhong Gao et al
03-30-2022	Knowledge-Spreader: Learning Facial Action Unit Dynamics with Extremely Limited Labels by Xiaotian Li et al
03-29-2022	MAT: Mask-Aware Transformer for Large Hole Image Inpainting by Wenbo Li et al
03-31-2022	End-to-End Trajectory Distribution Prediction Based on Occupancy Grid Maps by Ke Guo et al
03-29-2022	Using Active Speaker Faces for Diarization in TV shows by Rahul Sharma et al
03-29-2022	Deeply Interleaved Two-Stream Encoder for Referring Video Segmentation by Guang Feng et al
03-30-2022	Investigating Top-kk White-Box and Transferable Black-box Attack by Chaoning Zhang et al
03-29-2022	MAP-Gen: An Automated 3D-Box Annotation Flow with Multimodal Attention Point Generator by Chang Liu et al
03-31-2022	Contributions to interframe coding by Marcos Faundez-Zanuy et al
03-31-2022	Automatic Classification of Alzheimers Disease using brain MRI data and deep Convolutional Neural Networks by Zahraa Sh. Aaraji et al
04-01-2022	Quantized GAN for Complex Music Generation from Dance Videos by Ye Zhu et al
03-31-2022	A Survey of Robust 3D Object Detection Methods in Point Clouds by Walter Zimmer et al
03-30-2022	Personalized Image Aesthetics Assessment with Rich Attributes by Yuzhe Yang et al
03-31-2022	Semantic Pose Verification for Outdoor Visual Localization with Self-supervised Contrastive Learning by Semih Orhan et al
03-29-2022	Monitored Distillation for Positive Congruent Depth Completion by Tian Yu Liu et al
03-31-2022	GraftNet: Towards Domain Generalized Stereo Matching with a Broad-Spectrum and Task-Oriented Feature by Biyang Liu et al
03-30-2022	Stochastic Backpropagation: A Memory Efficient Strategy for Training Video Models by Feng Cheng et al
03-30-2022	Casual 6-DoF: free-viewpoint panorama using a handheld 360 camera by Rongsen Chen et al
03-31-2022	Dynamic Multimodal Fusion by Zihui Xue et al
03-31-2022	CADG: A Model Based on Cross Attention for Domain Generalization by Cheng Dai et al

03-30-2022	Sensor Data Validation and Driving Safety in Autonomous Driving Systems by Jindi Zhang
03-29-2022	Meta-Sampler: Almost-Universal yet Task-Oriented Sampling for Point Clouds by Ta-Ying Cheng et al
03-30-2022	Multi-Robot Active Mapping via Neural Bipartite Graph Matching by Kai Ye et al
03-31-2022	Real-Time and Robust 3D Object Detection Within Road-Side LiDARs Using Domain Adaptation by Walter Zimmer et al
03-29-2022	High-resolution Face Swapping via Latent Semantics Disentanglement by Yangyang Xu et al
03-29-2022	StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis by Zhiheng Li et al
03-29-2022	Instance Relation Graph Guided Source-Free Domain Adaptive Object Detection by Vibashan VS et al
03-30-2022	LAKe-Net: Topology-Aware Point Cloud Completion by Localizing Aligned Keypoints by Junshu Tang et al
03-29-2022	PSMNet: Position-aware Stereo Merging Network for Room Layout Estimation by Haiyan Wang et al
03-30-2022	Learning the Effect of Registration Hyperparameters with HyperMorph by Andrew Hoopes et al
04-01-2022	Proper Reuse of Image Classification Features Improves Object Detection by Cristina Vasconcelos et al
03-31-2022	Stereo Unstructured Magnification: Multiple Homography Image for View Synthesis by Qi Zhang et al
03-31-2022	LASER: LAtent SpacE Rendering for 2D Visual Localization by Zhixiang Min et al
03-29-2022	Proactive Image Manipulation Detection by Vishal Asnani et al
03-30-2022	Tampered VAE for Improved Satellite Image Time Series Classification by Xin Cai et al
04-01-2022	Epipolar Focus Spectrum: A Novel Light Field Representation and Application in Dense-view Reconstruction by Yaning Li et al
03-29-2022	Fine-Grained Object Classification via Self-Supervised Pose Alignment by Xuhui Yang et al
03-30-2022	Collaborative Transformers for Grounded Situation Recognition by Junhyeong Cho et al
03-30-2022	End to End Lip Synchronization with a Temporal AutoEncoder by Yoav Shalev et al
03-30-2022	Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object Detection by Jinyuan Liu et al
03-29-2022	PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision by Kehong Gong et al
04-01-2022	Autoencoder for Synthetic to Real Generalization: From Simple to More Complex Scenes by Steve Dias Da Cruz et al
03-30-2022	CREATE: A Benchmark for Chinese Short Video Retrieval and Title Generation by Ziqi Zhang et al
03-30-2022	SpatioTemporal Focus for Skeleton-based Action Recognition by Liyu Wu et al
04-01-2022	Face identification by means of a neural net classifier by Virginia Espinosa-Duro et al
03-29-2022	StyleFool: Fooling Video Classification Systems via Style Transfer by Yuxin Cao et al
04-01-2022	Comparison of convolutional neural networks for cloudy optical images reconstruction from single or multitemporal joint SAR and optical images by Rémi Cresson et al
03-29-2022	CHEX: CHannel EXploration for CNN Model Compression by Zejiang Hou et al
03-29-2022	Neural Face Video Compression using Multiple Views by Anna Volokitin et al
04-01-2022	Learning to Deblur using Light Field Generated and Real Defocus Images by Lingyan Ruan et al
03-30-2022	Image-to-Lidar Self-Supervised Distillation for Autonomous Driving Data by Corentin Sautier et al
03-29-2022	Text-Driven Video Acceleration: A Weakly-Supervised Reinforcement Learning Method by Washington Ramos et al
03-31-2022	Speaker Extraction with Co-Speech Gestures Cue by Zexu Pan et al
03-30-2022	AdaMixer: A Fast-Converging Query-Based Object Detector by Ziteng Gao et al
03-29-2022	Signing at Scale: Learning to Co-Articulate Signs for Large-Scale Photo-Realistic Sign Language Production by Ben Saunders et al
03-31-2022	Dynamic Supervisor for Cross-dataset Object Detection by Ze Chen et al
03-29-2022	A Computational Architecture for Machine Consciousness and Artificial Superintelligence: Updating Working Memory Iteratively by Jared Edward Reser
03-31-2022	CRAFT: Cross-Attentional Flow Transformer for Robust Optical Flow by Xiuchao Sui et al
03-31-2022	Deformation and Correspondence Aware Unsupervised Synthetic-to-Real Scene Flow Estimation for Point Clouds by Zhao Jin et al
03-30-2022	ConceptEvo: Interpreting Concept Evolution in Deep Learning Training by Haekyu Park et al
04-01-2022	On the Importance of Asymmetry for Siamese Representation Learning by Xiao Wang et al
03-31-2022	Investigating Modality Bias in Audio Visual Video Parsing by Piyush Singh Pasi et al
03-30-2022	Region of Interest focused MRI to Synthetic CT Translation using Regression and Classification Multi-task Network by Sandeep Kaushik et al
03-29-2022	Temporal Feature Alignment and Mutual Information Maximization for Video-Based Human Pose Estimation by Zhenguang Liu et al
03-29-2022	Nested Collaborative Learning for Long-Tailed Visual Recognition by Jun Li et al
03-29-2022	End-to-End Transformer Based Model for Image Captioning by Yiyu Wang et al
03-31-2022	GALA: Toward Geometry-and-Lighting-Aware Object Search for Compositing by Sijie Zhu et al
04-01-2022	Unimodal-Concentrated Loss: Fully Adaptive Label Distribution Learning for Ordinal Regression by Qiang Li et al
03-30-2022	PIE-Net: Photometric Invariant Edge Guided Network for Intrinsic Image Decomposition by Partha Das et al
03-29-2022	Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots by Pranay Mathur et al
03-31-2022	Efficient Maximal Coding Rate Reduction by Variational Forms by Christina Baek et al
03-30-2022	Acknowledging the Unknown for Multi-label Learning with Single Positive Labels by Donghao Zhou et al
03-29-2022	Self-Supervised Image Representation Learning with Geometric Set Consistency by Nenglun Chen et al
03-29-2022	Domain Invariant Siamese Attention Mask for Small Object Change Detection via Everyday Indoor Robot Navigation by Koji Takeda et al
03-30-2022	Learning of Global Objective for Network Flow in Multi-Object Tracking by Shuai Li et al
03-29-2022	Semantic Line Detection Using Mirror Attention and Comparative Ranking and Matching by Dongkwon Jin et al
03-30-2022	SIT: A Bionic and Non-Linear Neuron for Spiking Neural Network by Cheng Jin et al
03-29-2022	Angular Super-Resolution in Diffusion MRI with a 3D Recurrent Convolutional Autoencoder by Matthew Lyon et al
03-29-2022	mc-BEiT: Multi-choice Discretization for Image BERT Pre-training by Xiaotong Li et al
03-29-2022	Texture based Prototypical Network for Few-Shot Semantic Segmentation of Forest Cover: Generalizing for Different Geographical Regions by Gokul P et al
03-29-2022	An EEG-Based Multi-Modal Emotion Database with Both Posed and Authentic Facial Actions for Emotion Analysis by Xiaotian Li et al
03-30-2022	SeqTR: A Simple yet Universal Network for Visual Grounding by Chaoyang Zhu et al
03-30-2022	Constrained Few-shot Class-incremental Learning by Michael Hersche et al
04-01-2022	Unitail: Detecting, Reading, and Matching in Retail Scene by Fangyi Chen et al
03-29-2022	Self-Supervised Leaf Segmentation under Complex Lighting Conditions by Xufeng Lin et al
03-29-2022	FisherMatch: Semi-Supervised Rotation Regression via Entropy-based Filtering by Yingda Yin et al
03-29-2022	Towards Learning Neural Representations from Shadows by Kushagra Tiwary et al
03-30-2022	Learning Instance-Specific Adaptation for Cross-Domain Segmentation by Yuliang Zou et al
03-30-2022	TR-MOT: Multi-Object Tracking by Reference by Mingfei Chen et al
03-29-2022	NICGSlowDown: Evaluating the Efficiency Robustness of Neural Image Caption Generation Models by Simin Chen et al
03-30-2022	RFNet-4D: Joint Object Reconstruction and Flow Estimation from 4D Point Clouds by Tuan-Anh Vu et al
03-29-2022	Cross-Modality High-Frequency Transformer for MR Image Super-Resolution by Chaowei Fang et al
03-29-2022	Hybrid Routing Transformer for Zero-Shot Learning by De Cheng et al
03-30-2022	Recognition of polar lows in Sentinel-1 SAR images with deep learning by Jakob Grahn et al
03-31-2022	BEVDet4D: Exploit Temporal Cues in Multi-camera 3D Object Detection by Junjie Huang et al
03-30-2022	Interactive Multi-scale Fusion of 2D and 3D Features for Multi-object Tracking by Guangming Wang et al
03-30-2022	Controllable Augmentations for Video Representation Learning by Rui Qian et al
03-30-2022	Foveation-based Deep Video Compression without Motion Search by Meixu Chen et al
03-29-2022	Image Segmentation with Adaptive Spatial Priors from Joint Registration by Haifeng Li et al
03-31-2022	Perceptual Quality Assessment of UGC Gaming Videos by Xiangxu Yu et al
03-29-2022	Eventor: An Efficient Event-Based Monocular Multi-View Stereo Accelerator on FPGA Platform by Mingjun Li et al
03-29-2022	Robust Single Image Dehazing Based on Consistent and Contrast-Assisted Reconstruction by De Cheng et al
03-30-2022	End-to-end Document Recognition and Understanding with Dessurt by Brian Davis et al
03-31-2022	Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization by Junyu Gao et al
03-30-2022	Fair Contrastive Learning for Facial Attribute Classification by Sungho Park et al
03-30-2022	Spatial-Temporal Parallel Transformer for Arm-Hand Dynamic Estimation by Shuying Liu et al
04-01-2022	Generic Event Boundary Captioning: A Benchmark for Status Changes Understanding by Yuxuan Wang et al
03-29-2022	Long-term Video Frame Interpolation via Feature Propagation by Dawit Mureja Argaw et al
03-29-2022	UnShadowNet: Illumination Critic Guided Contrastive Learning For Shadow Removal by Subhrajyoti Dasgupta et al
03-29-2022	Infrared and Visible Image Fusion via Interactive Compensatory Attention Adversarial Learning by Zhishe Wang et al
03-29-2022	End-to-End Compressed Video Representation Learning for Generic Event Boundary Detection by Congcong Li et al
03-29-2022	Learning to Detect Mobile Objects from LiDAR Scans Without Labels by Yurong You et al
03-30-2022	Practical Learned Lossless JPEG Recompression with Multi-Level Cross-Channel Entropy Model in the DCT Domain by Lina Guo et al
03-31-2022	AEGNN: Asynchronous Event-based Graph Neural Networks by Simon Schaefer et al
03-31-2022	Video-Text Representation Learning via Differentiable Weak Temporal Alignment by Dohwan Ko et al
03-31-2022	Weakly Supervised Patch Label Inference Networks for Efficient Pavement Distress Detection and Recognition in the Wild by Sheng Huang et al
03-31-2022	Reflection and Rotation Symmetry Detection via Equivariant Learning by Ahyun Seo et al
03-29-2022	Eigenlanes: Data-Driven Lane Descriptors for Structurally Diverse Lanes by Dongkwon Jin et al
04-01-2022	GrowliFlower: An image time series dataset for GROWth analysis of cauLIFLOWER by Jana Kierdorf et al
03-29-2022	NL-FCOS: Improving FCOS through Non-Local Modules for Object Detection by Lukas Pavez et al
03-31-2022	Deformable Video Transformer by Jue Wang et al
04-01-2022	CAT-Det: Contrastively Augmented Transformer for Multi-modal 3D Object Detection by Yanan Zhang et al
03-29-2022	OdontoAI: A human-in-the-loop labeled data set and an online platform to boost research on dental panoramic radiographs by Bernardo Silva et al
03-31-2022	Logit Normalization for Long-tail Object Detection by Liang Zhao et al
03-31-2022	TransGeo: Transformer Is All You Need for Cross-view Image Geo-localization by Sijie Zhu et al
03-31-2022	Tooth Instance Segmentation on Panoramic Dental Radiographs Using U-Nets and Morphological Processing by Selahattin Serdar Helli et al
03-30-2022	Face Relighting with Geometrically Consistent Shadows by Andrew Hou et al
03-30-2022	Self-Distillation from the Last Mini-Batch for Consistency Regularization by Yiqing Shen et al
03-29-2022	Edge Detection and Deep Learning Based SETI Signal Classification Method by Zhewei Chen et al
04-01-2022	RMS-FlowNet: Efficient and Robust Multi-Scale Scene Flow Estimation for Large-Scale Point Clouds by Ramy Battrawy et al
03-30-2022	FlowFormer: A Transformer Architecture for Optical Flow by Zhaoyang Huang et al
03-29-2022	Interactive Multi-Class Tiny-Object Detection by Chunggi Lee et al
04-01-2022	DFNet: Enhance Aboslute Pose Regression with Direct Feature Matching by Shuai Chen et al
03-29-2022	A Naturalistic Database of Thermal Emotional Facial Expressions and Effects of Induced Emotions on Memory by Anna Esposito et al
03-29-2022	Learning-based Point Cloud Registration for 6D Object Pose Estimation in the Real World by Zheng Dang et al
03-31-2022	A Temporal Learning Approach to Inpainting Endoscopic Specularities and Its effect on Image Correspondence by Rema Daher et al
03-31-2022	Self-distillation Augmented Masked Autoencoders for Histopathological Image Classification by Yang Luo et al
03-29-2022	Shifting More Attention to Visual Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding by Jiabo Ye et al
03-29-2022	Face segmentation: A comparison between visible and thermal images by Jiri Mekyska et al
03-29-2022	Uncertainty-Aware Adaptation for Self-Supervised 3D Human Pose Estimation by Jogendra Nath Kundu et al
04-01-2022	Marginal Contrastive Correspondence for Guided Image Generation by Fangneng Zhan et al
03-29-2022	Clean Implicit 3D Structure from Noisy 2D STEM Images by Hannah Kniesel et al
03-29-2022	Contextual Information Based Anomaly Detection for a Multi-Scene UAV Aerial Videos by Girisha S et al
03-30-2022	Understanding 3D Object Articulation in Internet Videos by Shengyi Qian et al
03-30-2022	Large-Scale Pre-training for Person Re-identification with Noisy Labels by Dengpan Fu et al
03-30-2022	CycDA: Unsupervised Cycle Domain Adaptation from Image to Video by Wei Lin et al
03-30-2022	InstaFormer: Instance-Aware Image-to-Image Translation with Transformer by Soohyun Kim et al
04-01-2022	Weakly Supervised Regional and Temporal Learning for Facial Action Unit Recognition by Jingwei Yan et al
04-01-2022	Few-shot One-class Domain Adaptation Based on Frequency for Iris Presentation Attack Detection by Yachun Li et al
03-29-2022	On Triangulation as a Form of Self-Supervision for 3D Human Pose Estimation by Soumava Kumar Roy et al
03-29-2022	Few-shot Structured Radiology Report Generation Using Natural Language Prompts by Matthias Keicher et al
04-01-2022	Autonomous crater detection on asteroids using a fully-convolutional neural network by Francesco Latorre et al
03-30-2022	Surface Vision Transformers: Attention-Based Modelling applied to Cortical Analysis by Simon Dahan et al
03-30-2022	L^3U-net: Low-Latency Lightweight U-net Based Image Segmentation Model for Parallel CNN Processors by Osman Erman Okman et al
03-30-2022	PP-YOLOE: An evolved version of YOLO by Shangliang Xu et al
03-31-2022	Point Scene Understanding via Disentangled Instance Mesh Reconstruction by Jiaxiang Tang et al
03-29-2022	Balanced Multimodal Learning via On-the-fly Gradient Modulation by Xiaokang Peng et al
03-30-2022	The impact of using voxel-level segmentation metrics on evaluating multifocal prostate cancer localisation by Wen Yan et al
03-29-2022	Fine-Grained Visual Entailment by Christopher Thomas et al
03-29-2022	OakInk: A Large-scale Knowledge Repository for Understanding Hand-Object Interaction by Lixin Yang et al
03-29-2022	Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation by Yueming Jin et al
03-29-2022	Task-specific Inconsistency Alignment for Domain Adaptive Object Detection by Liang Zhao et al
03-29-2022	Category Guided Attention Network for Brain Tumor Segmentation in MRI by Jiangyun Li et al
03-29-2022	Exploring Frequency Adversarial Attacks for Face Forgery Detection by Shuai Jia et al
04-01-2022	Fast and Automatic Object Registration for Human-Robot Collaboration in Industrial Manufacturing by Manuela Geiß et al
03-29-2022	Semi-Supervised Image-to-Image Translation using Latent Space Mapping by Pan Zhang et al
03-31-2022	Digitizing Historical Balance Sheet Data: A Practitioners Guide by Sergio Correia et al
03-30-2022	STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video Prediction by Zheng Chang et al

03-29-2022	SAR-ShipNet: SAR-Ship Detection Neural Network via Bidirectional Coordinate Attention and Multi-resolution Feature Fusion by Yuwen Deng et al
03-29-2022	Robust Structured Declarative Classifiers for 3D Point Clouds: Defending Adversarial Attacks with Implicit Gradients by Kaidong Li et al
03-29-2022	ReplaceBlock: An improved regularization method based on background information by Zhemin Zhang et al
03-29-2022	Eigencontours: Novel Contour Descriptors Based on Low-Rank Approximation by Wonhui Park et al
03-29-2022	In-N-Out Generative Learning for Dense Unsupervised Video Segmentation by Xiao Pan et al
03-31-2022	Semantic-shape Adaptive Feature Modulation for Semantic Image Synthesis by Zhengyao Lv et al
03-31-2022	Multi-Granularity Alignment Domain Adaptation for Object Detection by Wenzhang Zhou et al
03-29-2022	Identification and classification of exfoliated graphene flakes from microscopy images using a hierarchical deep convolutional neural network by Soroush Mahjoubi et al
03-29-2022	AnyFace: Free-style Text-to-Face Synthesis and Manipulation by Jianxin Sun et al
03-30-2022	Interpretable Vertebral Fracture Diagnosis by Paul Engstler et al
03-29-2022	Alignment-Uniformity aware Representation Learning for Zero-shot Video Classification by Shi Pu et al
03-30-2022	Escaping Data Scarcity for High-Resolution Heterogeneous Face Hallucination by Yiqun Mei et al
03-30-2022	CardioID: Mitigating the Effects of Irregular Cardiac Signals for Biometric Identification by Weizheng Wang et al
03-29-2022	OSOP: A Multi-Stage One Shot Object Pose Estimation Framework by Ivan Shugurov et al
03-30-2022	Threshold Matters in WSSS: Manipulating the Activation for the Robust and Accurate Segmentation Model Against Thresholds by Minhyun Lee et al
03-30-2022	Progressively Generating Better Initial Guesses Towards Next Stages for High-Quality Human Motion Prediction by Tiezheng Ma et al
03-30-2022	Self-supervised 360∘∘ Room Layout Estimation by Hao-Wen Ting et al
03-29-2022	Learning a Structured Latent Space for Unsupervised Point Cloud Completion by Yingjie Cai et al
03-30-2022	Contribution of the Temperature of the Objects to the Problem of Thermal Imaging Focusing by Virginia Espinosa-Duró et al
03-30-2022	Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation by Hanqing Wang et al
03-29-2022	ACR Loss: Adaptive Coordinate-based Regression Loss for Face Alignment by Ali Pourramezan Fard et al
03-29-2022	Efficient Virtual View Selection for 3D Hand Pose Estimation by Jian Cheng et al
03-30-2022	Preliminary experiments on thermal emissivity adjustment for face images by Marcos Faundez-Zanuy et al
03-29-2022	Efficient Hybrid Network: Inducting Scattering Features by Dmitry Minskiy et al
03-30-2022	PseCo: Pseudo Labeling and Consistency Training for Semi-Supervised Object Detection by Gang Li et al
03-30-2022	Towards Multimodal Depth Estimation from Light Fields by Titus Leistner et al
03-30-2022	PEGG-Net: Background Agnostic Pixel-Wise Efficient Grasp Generation Under Closed-Loop Conditions by Zhiyang Liu et al
04-01-2022	Vision Transformer with Cross-attention by Temporal Shift for Efficient Action Recognition by Ryota Hashiguchi et al
04-01-2022	FrequencyLowCut Pooling -- Plug & Play against Catastrophic Overfitting by Julia Grabinski et al
03-30-2022	Fast Light-Weight Near-Field Photometric Stereo by Daniel Lichy et al
03-30-2022	OPD: Single-view 3D Openable Part Detection by Hanxiao Jiang et al
03-30-2022	On the Road to Online Adaptation for Semantic Image Segmentation by Riccardo Volpi et al
03-29-2022	Semi-Supervised Learning of Semantic Correspondence with Pseudo-Labels by Jiwon Kim et al
04-01-2022	DIP: Deep Inverse Patchmatch for High-Resolution Optical Flow by Zihua Zheng et al
03-30-2022	Graph-based Active Learning for Semi-supervised Classification of SAR Data by Kevin Miller et al
03-29-2022	NNLander-VeriF: A Neural Network Formal Verification Framework for Vision-Based Autonomous Aircraft Landing by Ulices Santa Cruz et al
03-30-2022	Rabbit, toad, and the Moon: Can machine categorize them into one class? by Daigo Shoji
03-30-2022	Automatic Facial Skin Feature Detection for Everyone by Qian Zheng et al
03-30-2022	An Iterative Co-Training Transductive Framework for Zero Shot Learning by Bo Liu et al
03-30-2022	Omni-DETR: Omni-Supervised Object Detection with Transformers by Pei Wang et al
03-30-2022	Pay Attention to Hidden States for Video Deblurring: Ping-Pong Recurrent Neural Networks and Selective Non-Local Attention by JoonKyu Park et al
03-29-2022	VPTR: Efficient Transformers for Video Prediction by Xi Ye et al
03-30-2022	Global Tracking via Ensemble of Local Trackers by Zikun Zhou et al
03-30-2022	An Efficient Anchor-free Universal Lesion Detection in CT-scans by Manu Sheoran et al
03-29-2022	A Multi-Stage Duplex Fusion ConvNet for Aerial Scene Classification by Jingjun Yi et al
03-29-2022	Neural Inertial Localization by Sachini Herath et al
03-30-2022	Ball 3D localization from a single calibrated image by Gabriel Van Zandycke et al

Craig SmithApril 4, 2022