03-30-2022
|
Exploring Plain Vision Transformer Backbones for Object
Detection
by
Yanghao Li
et al
|
|
|
|
03-29-2022
|
Contrasting the landscape of contrastive and
non-contrastive learning
by
Ashwini Pokle
et al
|
|
|
|
03-31-2022
|
MyStyle: A Personalized Generative Prior
by
Yotam Nitzan
et al
|
|
|
|
03-31-2022
|
Visual Prompting: Modifying Pixel Space to Adapt
Pre-trained Models
by
Hyojin Bahng
et al
|
|
|
|
03-29-2022
|
Dressing in the Wild by Watching Dance Videos
by
Xin Dong
et al
|
|
|
|
03-30-2022
|
VL-InterpreT: An Interactive Visualization Tool for
Interpreting Vision-Language Transformers
by
Estelle Aflalo
et al
|
|
|
|
03-31-2022
|
Bringing Old Films Back to Life
by
Ziyu Wan
et al
|
|
|
|
03-29-2022
|
EnvEdit: Environment Editing for Vision-and-Language
Navigation
by
Jialu Li
et al
|
|
|
|
03-31-2022
|
R2L: Distilling Neural Radiance Field to Neural Light
Field for Efficient Novel View Synthesis
by
Huan Wang
et al
|
|
|
|
03-30-2022
|
FALCON: Fast Visual Concept Learning by Integrating
Images, Linguistic descriptions, and Conceptual
Relations
by
Lingjie Mei
et al
|
|
|
|
03-30-2022
|
CoordGAN: Self-Supervised Dense Correspondences Emerge
from GANs
by
Jiteng Mu
et al
|
|
|
|
03-30-2022
|
MeMOT: Multi-Object Tracking with Memory
by
Jiarui Cai
et al
|
|
|
|
04-01-2022
|
Socratic Models: Composing Zero-Shot Multimodal
Reasoning with Language
by
Andy Zeng
et al
|
|
|
|
03-29-2022
|
Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic
Urban Scene Segmentation
by
Xiao Fu
et al
|
|
|
|
03-29-2022
|
ITTR: Unpaired Image-to-Image Translation with
Transformers
by
Wanfeng Zheng
et al
|
|
|
|
03-31-2022
|
DiffSkill: Skill Abstraction from Differentiable
Physics for Deformable Object Manipulations with Tools
by
Xingyu Lin
et al
|
|
|
|
03-30-2022
|
TubeDETR: Spatio-Temporal Video Grounding with
Transformers
by
Antoine Yang
et al
|
|
|
|
03-31-2022
|
Audio-Visual Speech Codecs: Rethinking Audio-Visual
Speech Enhancement by Re-Synthesis
by
Karren Yang
et al
|
|
|
|
03-31-2022
|
Continuous Scene Representations for Embodied AI
by
Samir Yitzhak Gadre
et al
|
|
|
|
03-30-2022
|
CaDeX: Learning Canonical Deformation Coordinate Space
for Dynamic Surface Representation via Neural
Homeomorphism
by
Jiahui Lei
et al
|
|
|
|
03-29-2022
|
Diffusion Models for Counterfactual Explanations
by
Guillaume Jeanneret
et al
|
|
|
|
03-29-2022
|
Fine-tuning Image Transformers using Learnable Memory
by
Mark Sandler
et al
|
|
|
|
03-30-2022
|
To Find Waldo You Need Contextual Cues: Debiasing Whos
Waldo
by
Yiran Luo
et al
|
|
|
|
04-01-2022
|
Simplicial Embeddings in Self-Supervised Learning and
Downstream Classification
by
Samuel Lavoie
et al
|
|
|
|
03-29-2022
|
Online Continual Learning on a Contaminated Data Stream
with Blurry Task Boundaries
by
Jihwan Bang
et al
|
|
|
|
03-30-2022
|
Balanced MSE for Imbalanced Visual Regression
by
Jiawei Ren
et al
|
|
|
|
03-30-2022
|
AmsterTime: A Visual Place Recognition Benchmark
Dataset for Severe Domain Shift
by
Burak Yildiz
et al
|
|
|
|
03-30-2022
|
DDNeRF: Depth Distribution Neural Radiance Fields
by
David Dadon
et al
|
|
|
|
03-31-2022
|
TransEditor: Transformer-Based Dual-Space GAN for
Highly Controllable Facial Editing
by
Yanbo Xu
et al
|
|
|
|
03-31-2022
|
Generating High Fidelity Data from Low-density Regions
using Diffusion Models
by
Vikash Sehwag
et al
|
|
|
|
03-29-2022
|
Disentangled3D: Learning a 3D Generative Model with
Disentangled Geometry and Appearance from Monocular
Images
by
Ayush Tewari
et al
|
|
|
|
04-01-2022
|
Perception Prioritized Training of Diffusion Models
by
Jooyoung Choi
et al
|
|
|
|
03-29-2022
|
Iterative Deep Homography Estimation
by
Si-Yuan Cao
et al
|
|
|
|
03-29-2022
|
Pop-Out Motion: 3D-Aware Image Deformation via Learning
the Shape Laplacian
by
Jihyun Lee
et al
|
|
|
|
03-30-2022
|
Enhancing Cancer Prediction in Challenging
Screen-Detected Incident Lung Nodules Using Time-Series
Deep Learning
by
Shahab Aslani
et al
|
|
|
|
03-31-2022
|
Towards Driving-Oriented Metric for Lane Detection
Models
by
Takami Sato
et al
|
|
|
|
03-29-2022
|
Parameter-efficient Fine-tuning for Vision Transformers
by
Xuehai He
et al
|
|
|
|
03-29-2022
|
DRaCoN -- Differentiable Rasterization Conditioned
Neural Radiance Fields for Articulated Avatars
by
Amit Raj
et al
|
|
|
|
03-29-2022
|
MatteFormer: Transformer-Based Image Matting via
Prior-Tokens
by
GyuTae Park
et al
|
|
|
|
03-31-2022
|
BEVFormer: Learning Birds-Eye-View Representation from
Multi-Camera Images via Spatiotemporal Transformers
by
Zhiqi Li
et al
|
|
|
|
03-31-2022
|
Its All In the Teacher: Zero-Shot Quantization Brought
Closer to the Teacher
by
Kanghyun Choi
et al
|
|
|
|
03-29-2022
|
Image Retrieval from Contextual Descriptions
by
Benno Krojer
et al
|
|
|
|
03-30-2022
|
ViSTA: Vision and Scene Text Aggregation for
Cross-Modal Retrieval
by
Mengjun Cheng
et al
|
|
|
|
03-30-2022
|
Mask Atari for Deep Reinforcement Learning as POMDP
Benchmarks
by
Yang Shao
et al
|
|
|
|
03-31-2022
|
A Closer Look at Rehearsal-Free Continual Learning
by
James Seale Smith
et al
|
|
|
|
03-29-2022
|
Integrative Few-Shot Learning for Classification and
Segmentation
by
Dahyun Kang
et al
|
|
|
|
03-30-2022
|
Exploiting Explainable Metrics for Augmented SGD
by
Mahdi S. Hosseini
et al
|
|
|
|
03-29-2022
|
SepViT: Separable Vision Transformer
by
Wei Li
et al
|
|
|
|
03-30-2022
|
Online Motion Style Transfer for Interactive Character
Control
by
Yingtian Tang
et al
|
|
|
|
03-29-2022
|
A Style-aware Discriminator for Controllable Image
Translation
by
Kunhee Kim
et al
|
|
|
|
03-31-2022
|
SimVQA: Exploring Simulated Environments for Visual
Question Answering
by
Paola Cascante-Bonilla
et al
|
|
|
|
03-29-2022
|
ME-CapsNet: A Multi-Enhanced Capsule Networks with
Routing Mechanism
by
Jerrin Bright
et al
|
|
|
|
03-31-2022
|
Cross-modal Learning of Graph Representations using
Radar Point Cloud for Long-Range Gesture Recognition
by
Souvik Hazra
et al
|
|
|
|
03-29-2022
|
BARC: Learning to Regress 3D Dog Shape from Images by
Exploiting Breed Information
by
Nadine Rueegg
et al
|
|
|
|
03-30-2022
|
Fast, Accurate and Memory-Efficient Partial Permutation
Synchronization
by
Shaohan Li
et al
|
|
|
|
03-29-2022
|
Classification of Hyperspectral Images Using SVM with
Shape-adaptive Reconstruction and Smoothed Total
Variation
by
Ruoning Li
et al
|
|
|
|
03-31-2022
|
Mutual Scene Synthesis for Mixed Reality Telepresence
by
Mohammad Keshavarzi
et al
|
|
|
|
03-30-2022
|
ReSTR: Convolution-free Referring Image Segmentation
Using Transformers
by
Namyup Kim
et al
|
|
|
|
03-29-2022
|
Deep Equilibrium Assisted Block Sparse Coding of
Inter-dependent Signals: Application to Hyperspectral
Imaging
by
Alexandros Gkillas
et al
|
|
|
|
03-30-2022
|
HDSDF: Hybrid Directional and Signed Distance Functions
for Fast Inverse Rendering
by
Tarun Yenamandra
et al
|
|
|
|
03-31-2022
|
Human Instance Segmentation and Tracking via Data
Association and Single-stage Detector
by
Lu Cheng
et al
|
|
|
|
04-01-2022
|
Autoencoder Attractors for Uncertainty Estimation
by
Steve Dias Da Cruz
et al
|
|
|
|
03-29-2022
|
Improved Counting and Localization from Density Maps
for Object Detection in 2D and 3D Microscopy Imaging
by
Shijie Li
et al
|
|
|
|
03-29-2022
|
Treatment Learning Transformer for Noisy Image
Classification
by
Chao-Han Huck Yang
et al
|
|
|
|
03-31-2022
|
Towards Robust Rain Removal Against Adversarial
Attacks: A Comprehensive Benchmark Analysis and Beyond
by
Yi Yu
et al
|
|
|
|
03-29-2022
|
SHOP: A Deep Learning Based Pipeline for near Real-Time
Detection of Small Handheld Objects Present in Blurry
Video
by
Abhinav Ganguly
et al
|
|
|
|
03-29-2022
|
How Deep is Your Art: An Experimental Study on the
Limits of Artistic Understanding in a Single-Task,
Single-Modality Neural Network
by
Mahan Agha Zahedi
et al
|
|
|
|
03-29-2022
|
Deep Reinforcement Learning for Data-Driven Adaptive
Scanning in Ptychography
by
Marcel Schloz
et al
|
|
|
|
03-31-2022
|
Multimodal Fusion Transformer for Remote Sensing Image
Classification
by
Swalpa Kumar Roy
et al
|
|
|
|
03-30-2022
|
Federated Learning for the Classification of Tumor
Infiltrating Lymphocytes
by
Ujjwal Baid
et al
|
|
|
|
03-31-2022
|
Model Predictive Control for Fluid Human-to-Robot
Handovers
by
Wei Yang
et al
|
|
|
|
03-29-2022
|
AutoCoMet: Smart Neural Architecture Search via
Co-Regulated Shaping Reinforcement
by
Mayukh Das
et al
|
|
|
|
03-31-2022
|
Ternary and Binary Quantization for Improved
Classification
by
Weizhi Lu
et al
|
|
|
|
03-29-2022
|
CNN Filter DB: An Empirical Investigation of Trained
Convolutional Filters
by
Paul Gavrikov
et al
|
|
|
|
03-31-2022
|
Few-Shot Class-Incremental Learning by Sampling
Multi-Phase Tasks
by
Da-Wei Zhou
et al
|
|
|
|
03-30-2022
|
Unseen Classes at a Later Time? No Problem
by
Hari Chandana Kuchibhotla
et al
|
|
|
|
03-29-2022
|
The Sound of Bounding-Boxes
by
Takashi Oya
et al
|
|
|
|
03-30-2022
|
An Improved Lightweight YOLOv5 Model Based on Attention
Mechanism for Face Mask Detection
by
Sheng Xu
|
|
|
|
03-29-2022
|
A deep learning model for burn depth classification
using ultrasound imaging
by
Sangrock Lee
et al
|
|
|
|
03-30-2022
|
Biclustering Algorithms Based on Metaheuristics: A
Review
by
Adan Jose-Garcia
et al
|
|
|
|
03-29-2022
|
Equivariance Allows Handling Multiple Nuisance
Variables When Analyzing Pooled Neuroimaging Datasets
by
Vishnu Suresh Lokhande
et al
|
|
|
|
03-29-2022
|
AutoPoly: Predicting a Polygonal Mesh Construction
Sequence from a Silhouette Image
by
I-Chao Shen
et al
|
|
|
|
03-29-2022
|
Learning Structured Gaussians to Approximate Deep
Ensembles
by
Ivor J. A. Simpson
et al
|
|
|
|
03-29-2022
|
Transformer Inertial Poser: Attention-based Real-time
Human Motion Reconstruction from Sparse IMUs
by
Yifeng Jiang
et al
|
|
|
|
03-29-2022
|
Zero-Query Transfer Attacks on Context-Aware Object
Detectors
by
Zikui Cai
et al
|
|
|
|
03-31-2022
|
FindIt: Generalized Localization with Natural Language
Queries
by
Weicheng Kuo
et al
|
|
|
|
04-01-2022
|
Selecting task with optimal transport self-supervised
learning for few-shot classification
by
Renjie Xu
et al
|
|
|
|
03-29-2022
|
Auditing Privacy Defenses in Federated Learning via
Generative Gradient Leakage
by
Zhuohang Li
et al
|
|
|
|
03-30-2022
|
Learning Local Displacements for Point Cloud Completion
by
Yida Wang
et al
|
|
|
|
03-31-2022
|
Deep Hyperspectral Unmixing using Transformer Network
by
Preetam Ghosh
et al
|
|
|
|
03-31-2022
|
3D Equivariant Graph Implicit Functions
by
Yunlu Chen
et al
|
|
|
|
03-31-2022
|
Leverage Your Local and Global Representations: A New
Self-Supervised Learning Strategy
by
Tong Zhang
et al
|
|
|
|
03-31-2022
|
Time Lens++: Event-based Frame Interpolation with
Parametric Non-linear Flow and Multi-scale Fusion
by
Stepan Tulyakov
et al
|
|
|
|
03-31-2022
|
Rethinking Portrait Matting with Privacy Preserving
by
Sihan Ma
et al
|
|
|
|
03-30-2022
|
COSMOS: Cross-Modality Unsupervised Domain Adaptation
for 3D Medical Image Segmentation based on Target-aware
Domain Translation and Iterative Self-Training
by
Hyungseob Shin
et al
|
|
|
|
03-29-2022
|
TransductGAN: a Transductive Adversarial Model for
Novelty Detection
by
Najiba Toron
et al
|
|
|
|
03-31-2022
|
Templates for 3D Object Pose Estimation Revisited:
Generalization to New Objects and Robustness to
Occlusions
by
Van Nguyen Nguyen
et al
|
|
|
|
03-31-2022
|
Do Vision-Language Pretrained Models Learn Primitive
Concepts?
by
Tian Yun
et al
|
|
|
|
03-30-2022
|
Knowledge-based Entity Prediction for Improved Machine
Perception in Autonomous Systems
by
Ruwan Wickramarachchi
et al
|
|
|
|
04-01-2022
|
Connect, Not Collapse: Explaining Contrastive Learning
for Unsupervised Domain Adaptation
by
Kendrick Shen
et al
|
|
|
|
03-31-2022
|
An End-to-end Supervised Domain Adaptation Framework
for Cross-Domain Change Detection
by
Jia Liu
et al
|
|
|
|
03-30-2022
|
Forecasting from LiDAR via Future Object Detection
by
Neehar Peri
et al
|
|
|
|
03-29-2022
|
Agreement or Disagreement in Noise-tolerant Mutual
Learning?
by
Jiarun Liu
et al
|
|
|
|
03-31-2022
|
Semi-Weakly Supervised Object Detection by Sampling
Pseudo Ground-Truth Boxes
by
Akhil Meethal
et al
|
|
|
|
03-29-2022
|
Photographic Visualization of Weather Forecasts with
Generative Adversarial Networks
by
Christian Sigg
et al
|
|
|
|
03-31-2022
|
Adaptive Mean-Residue Loss for Robust Facial Age
Estimation
by
Ziyuan Zhao
et al
|
|
|
|
03-30-2022
|
PromptDet: Expand Your Detector Vocabulary with
Uncurated Images
by
Chengjian Feng
et al
|
|
|
|
03-31-2022
|
Measuring hand use in the home after cervical spinal
cord injury using egocentric video
by
Andrea Bandini
et al
|
|
|
|
03-31-2022
|
A Unified Framework for Domain Adaptive Pose Estimation
by
Donghyun Kim
et al
|
|
|
|
03-30-2022
|
Learning Program Representations for Food Images and
Cooking Recipes
by
Dim P. Papadopoulos
et al
|
|
|
|
03-30-2022
|
Recommendation of Compatible Outfits Conditioned on
Style
by
Debopriyo Banerjee
et al
|
|
|
|
03-30-2022
|
FLOAT: Factorized Learning of Object Attributes for
Improved Multi-object Multi-part Scene Parsing
by
Rishubh Singh
et al
|
|
|
|
03-29-2022
|
Kernel Modulation: A Parameter-Efficient Method for
Training Convolutional Neural Networks
by
Yuhuang Hu
et al
|
|
|
|
03-29-2022
|
Vision Transformers in Medical Computer Vision -- A
Contemplative Retrospection
by
Arshi Parvaiz
et al
|
|
|
|
03-29-2022
|
Abstract Flow for Temporal Semantic Segmentation on the
Permutohedral Lattice
by
Peer Schütt
et al
|
|
|
|
03-29-2022
|
Harmonizing Pathological and Normal Pixels for
Pseudo-healthy Synthesis
by
Yunlong Zhang
et al
|
|
|
|
03-30-2022
|
On learning adaptive acquisition policies for
undersampled multi-coil MRI reconstruction
by
Tim Bakker
et al
|
|
|
|
03-30-2022
|
AxIoU: An Axiomatically Justified Measure for Video
Moment Retrieval
by
Riku Togashi
et al
|
|
|
|
04-01-2022
|
Bridging the Gap between Classification and
Localization for Weakly Supervised Object Localization
by
Eunji Kim
et al
|
|
|
|
03-29-2022
|
Quantifying Societal Bias Amplification in Image
Captioning
by
Yusuke Hirota
et al
|
|
|
|
03-31-2022
|
Rethinking Video Salient Object Ranking
by
Jiaying Lin
et al
|
|
|
|
03-29-2022
|
Efficient Reflectance Capture with a Deep Gated
Mixture-of-Experts
by
Xiaohe Ma
et al
|
|
|
|
03-29-2022
|
VI-IKD: High-Speed Accurate Off-Road Navigation using
Learned Visual-Inertial Inverse Kinodynamics
by
Haresh Karnan
et al
|
|
|
|
04-01-2022
|
ObjectMix: Data Augmentation by Copy-Pasting Objects in
Videos for Action Recognition
by
Jun Kimata
et al
|
|
|
|
04-01-2022
|
Online panoptic 3D reconstruction as a Linear
Assignment Problem
by
Leevi Raivio
et al
|
|
|
|
03-29-2022
|
Target and Task specific Source-Free Domain Adaptive
Image Segmentation
by
Vibashan VS
et al
|
|
|
|
03-31-2022
|
ImpDet: Exploring Implicit Fields for 3D Object
Detection
by
Xuelin Qian
et al
|
|
|
|
03-31-2022
|
MPS-NeRF: Generalizable 3D Human Rendering from
Multiview Images
by
Xiangjun Gao
et al
|
|
|
|
03-31-2022
|
A Dataset of Images of Public Streetlights with
Operational Monitoring using Computer Vision Techniques
by
Ioannis Mavromatis
et al
|
|
|
|
03-29-2022
|
Few Could Be Better Than All: Feature Sampling and
Grouping for Scene Text Detection
by
Jingqun Tang
et al
|
|
|
|
03-29-2022
|
SIOD: Single Instance Annotated Per Category Per Image
for Object Detection
by
Hanjun Li
et al
|
|
|
|
03-30-2022
|
Task Adaptive Parameter Sharing for Multi-Task Learning
by
Matthew Wallingford
et al
|
|
|
|
04-01-2022
|
MS-HLMO: Multi-scale Histogram of Local Main
Orientation for Remote Sensing Image Registration
by
Chenzhong Gao
et al
|
|
|
|
03-30-2022
|
Knowledge-Spreader: Learning Facial Action Unit
Dynamics with Extremely Limited Labels
by
Xiaotian Li
et al
|
|
|
|
03-29-2022
|
MAT: Mask-Aware Transformer for Large Hole Image
Inpainting
by
Wenbo Li
et al
|
|
|
|
03-31-2022
|
End-to-End Trajectory Distribution Prediction Based on
Occupancy Grid Maps
by
Ke Guo
et al
|
|
|
|
03-29-2022
|
Using Active Speaker Faces for Diarization in TV shows
by
Rahul Sharma
et al
|
|
|
|
03-29-2022
|
Deeply Interleaved Two-Stream Encoder for Referring
Video Segmentation
by
Guang Feng
et al
|
|
|
|
03-30-2022
|
Investigating Top-kk White-Box and Transferable
Black-box Attack
by
Chaoning Zhang
et al
|
|
|
|
03-29-2022
|
MAP-Gen: An Automated 3D-Box Annotation Flow with
Multimodal Attention Point Generator
by
Chang Liu
et al
|
|
|
|
03-31-2022
|
Contributions to interframe coding
by
Marcos Faundez-Zanuy
et al
|
|
|
|
03-31-2022
|
Automatic Classification of Alzheimers Disease using
brain MRI data and deep Convolutional Neural Networks
by
Zahraa Sh. Aaraji
et al
|
|
|
|
04-01-2022
|
Quantized GAN for Complex Music Generation from Dance
Videos
by
Ye Zhu
et al
|
|
|
|
03-31-2022
|
A Survey of Robust 3D Object Detection Methods in Point
Clouds
by
Walter Zimmer
et al
|
|
|
|
03-30-2022
|
Personalized Image Aesthetics Assessment with Rich
Attributes
by
Yuzhe Yang
et al
|
|
|
|
03-31-2022
|
Semantic Pose Verification for Outdoor Visual
Localization with Self-supervised Contrastive Learning
by
Semih Orhan
et al
|
|
|
|
03-29-2022
|
Monitored Distillation for Positive Congruent Depth
Completion
by
Tian Yu Liu
et al
|
|
|
|
03-31-2022
|
GraftNet: Towards Domain Generalized Stereo Matching
with a Broad-Spectrum and Task-Oriented Feature
by
Biyang Liu
et al
|
|
|
|
03-30-2022
|
Stochastic Backpropagation: A Memory Efficient Strategy
for Training Video Models
by
Feng Cheng
et al
|
|
|
|
03-30-2022
|
Casual 6-DoF: free-viewpoint panorama using a handheld
360 camera
by
Rongsen Chen
et al
|
|
|
|
03-31-2022
|
Dynamic Multimodal Fusion
by
Zihui Xue
et al
|
|
|
|
03-31-2022
|
CADG: A Model Based on Cross Attention for Domain
Generalization
by
Cheng Dai
et al
|
|
|
|