Spaces:
Running
Running
commit files to HF hub
Browse files- papers.csv +21 -21
papers.csv
CHANGED
@@ -85,7 +85,7 @@ Homography Guided Temporal Fusion for Road Line and Marking Segmentation,"Wang,
|
|
85 |
Zero-Shot Semantic Segmentation with Decoupled One-Shot Network,"Han, Cong*; Zhong, Yujie; Han, Kai; Dengjie, Li; Ma, Lin",poster,,,,,,,,,
|
86 |
TCOVIS: Temporally consistent online video instance segmentation,"Li, Junlong; Yu, Bingyao; Rao, Yongming; Zhou, Jie; Lu, Jiwen*",poster,,,,,,,,,
|
87 |
FPR: False Positive Rectification for Weakly Supervised Semantic Segmentation,"Chen, Liyi*; Lei, Chenyang; Li, Ruihuang; LI, Shuai; Zhang, Zhaoxiang; Zhang, Lei",poster,,,,,,,,,
|
88 |
-
Stochastic Segmentation with Conditional Categorical Diffusion Models,"Zbinden, Lukas*; Doorenbos, Lars; Pissas, Theodoros; Huber, Adrian Thomas; Sznitman, Raphael; Márquez Neila, Pablo",poster,2303.08888,https://arxiv.org/abs/2303.08888,,https://huggingface.co/papers/2303.08888,,,,6,
|
89 |
Segmenting Everything In Context,"Wang, Xinlong*; Zhang, Xiaosong; Cao, Yue; Wang, Wen; Shen, Chunhua; Huang, Tiejun",poster,,,,,,,,,
|
90 |
Open-vocabulary Panoptic Segmentation with Embedding Modulation,"CHEN, Xi*; Li, Shuang; Lim, Ser-Nam; Torralba, Antonio; Zhao, Hengshuang",poster,2303.11324,https://arxiv.org/abs/2303.11324,,https://huggingface.co/papers/2303.11324,,,,5,0
|
91 |
Residual Pattern Learning for Pixel-wise Out-of-Distribution Detection in Semantic Segmentation,"Liu, Yuyuan*; Ding, Choubo; Tian, Yu; Pang, Guansong; Belagiannis, Vasileios; Reid, Ian; Carneiro, Gustavo",poster,2211.14512,https://arxiv.org/abs/2211.14512,https://github.com/yyliu01/RPL,https://huggingface.co/papers/2211.14512,,,,7,0
|
@@ -98,7 +98,7 @@ Alignment Before Aggregation: Trajectory Memory Retrieval Network for Video Obje
|
|
98 |
Semi-Supervised Semantic Segmentation under Label Noise via Diverse Learning Groups,"Li, Peixia*; Purkait, Pulak; Ajanthan, Thalaiyasingam; Abdolshah, Majid; Garg, Ravi; Husain, Hisham; Xu, Chenchen; Gould, Stephen; Ouyang, Wanli; van den Hengel, Anton",poster,,,,,,,,,
|
99 |
SUMMIT: Source-Free Adaptation of Uni-Modal Models to Multi-Modal Targets,"Simons, Cody M*; Raychaudhuri, Dripta S.; AHMED, SK MIRAJ; You, Suya; Karydis, Konstantinos; Roy-Chowdhury, Amit K. ",poster,2308.11880,https://arxiv.org/abs/2308.11880,https://github.com/csimo005/SUMMIT,https://huggingface.co/papers/2308.11880,,,,6,0
|
100 |
Class-incremental Continual Learning for Instance Segmentation with Image-level Weak Supervision,"Hsieh, Yu-Hsing*; Chen, Guan-Sheng; Cai, Shun-Xian; Wei, Ting-Yun; Yang, Huei-Fang; Chen, Chu-Song",poster,,,,,,,,,
|
101 |
-
Coarse-to-Fine Amodal Segmentation with Shape Prior,"Gao, Jianxiong; Qian, Xuelin*; Fu, Yanwei; Wang, Yikai; Xiao, Tianjun; Zhang, Zheng; He, Tong",poster,2308.16825,https://arxiv.org/abs/2308.16825,,https://huggingface.co/papers/2308.16825,,,,7,
|
102 |
Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric Representation,"Fan, Ke; Lei, Jingshi; Qian, Xuelin*; Yu, Miaopeng; Zhang, Zheng; He, Tong; Xiao, Tianjun; Fu, Yanwei",poster,,,,,,,,,
|
103 |
DVIS: Decoupled Video Instance Segmentation Framework,"Zhang, Tao*; tian, xingye; Wu, Yu; Ji, Shunping; Wang, Xuebo; Zhang, Yuan; Wan, Pengfei ",poster,2306.03413,https://arxiv.org/abs/2306.03413,,https://huggingface.co/papers/2306.03413,,,,7,0
|
104 |
3D Segmentation of Humans in Point Clouds with Synthetic Data,"Takmaz, Ayca*; Schult, Jonas; Kaftan, Irem; Akcay, Cafer Mertcan; Leibe, Bastian; Sumner, Robert W; Engelmann, Francis; Tang, Siyu",poster,2212.00786,https://arxiv.org/abs/2212.00786,,https://huggingface.co/papers/2212.00786,,,,8,1
|
@@ -157,7 +157,7 @@ Spatial-Aware Token for Weakly Supervised Object Localization,"pingyu, wu; Zhai,
|
|
157 |
Towards Improved Input Masking for Convolutional Neural Networks,"Balasubramanian, Sriram*; Feizi, Soheil",poster,2211.14646,https://arxiv.org/abs/2211.14646,https://github.com/SriramB-98/layer_masking,https://huggingface.co/papers/2211.14646,,,,2,1
|
158 |
PDiscoNet: Semantically consistent part discovery for fine-grained recognition,"van der Klis, Robert D; Alaniz, Stephan; Mancini, Massimiliano; Dantas, Cassio F.; Ienco, Dino; Akata, Zeynep; Marcos, Diego*",poster,,,,,,,,,
|
159 |
Corrupting Neuron Explanations of Deep Visual Features,"Srivastava, Divyansh*; Oikarinen, Tuomas; Weng, Lily",poster,,,,,,,,,
|
160 |
-
ICICLE: Interpretable Class Incremental Continual Learning,"Rymarczyk, Dawid Damian*; van de Weijer, Joost; Zieli?ski, Bartosz; Twardowski, Bartlomiej",poster,2303.07811,https://arxiv.org/abs/2303.07811,,https://huggingface.co/papers/2303.07811,,,,4,
|
161 |
ProbVLM: Probabilistic Adapter for Frozen Vison-Language Models,"Upadhyay, Uddeshya*; Karthik, Shyamgopal; Mancini, Massimiliano; Akata, Zeynep",poster,2307.00398,https://arxiv.org/abs/2307.00398,,https://huggingface.co/papers/2307.00398,,,,4,0
|
162 |
Out-of-Distribution Detection for Monocular Depth Estimation,"Hornauer, Julia*; Holzbock, Adrian; Belagiannis, Vasileios",poster,2308.06072,https://arxiv.org/abs/2308.06072,,https://huggingface.co/papers/2308.06072,,,,3,0
|
163 |
Using Explanations to Guide Models,"Rao, Sukrut*; Böhle, Moritz; Parchami-Araghi, Amin; Schiele, Bernt",poster,2303.11932,https://arxiv.org/abs/2303.11932,,https://huggingface.co/papers/2303.11932,,,,4,1
|
@@ -212,7 +212,7 @@ HairNeRF: Geometry-Aware Hair Swapped Image Synthesis,"Chang, Seunggyu*; Kim, Gi
|
|
212 |
SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training,"Lin, Yuanze; Wei, Chen; Wang, Huiyu; Yuille, Alan; Xie, Cihang*",poster,2211.11446,https://arxiv.org/abs/2211.11446,,https://huggingface.co/papers/2211.11446,,,,5,0
|
213 |
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model,"Jin, Peng*; Li, Hao; Cheng, Zesen; Li, Kehan; Ji, Xiangyang; Liu, Chang; Yuan, Li; Chen, Jie",poster,2303.09867,https://arxiv.org/abs/2303.09867,https://github.com/jpthu17/DiffusionRet,https://huggingface.co/papers/2303.09867,,,,8,0
|
214 |
Explore and Tell: Embodied Visual Captioning in 3D Environments,"Hu, Anwen*; Chen, Shizhe; Zhang, Liang; Jin, Qin",poster,2308.10447,https://arxiv.org/abs/2308.10447,,https://huggingface.co/papers/2308.10447,,,,4,1
|
215 |
-
Distilling Large Vision-Language Model with Out-of-Distribution Generalizability,"Li, Xuanlin*; Fang, Yunhao; Liu, Minghua; Ling, Zhan; Tu, Zhuowen; Su, Hao",poster,2307.03135,https://arxiv.org/abs/2307.03135,https://github.com/xuanlinli17/large_vlm_distillation_ood,https://huggingface.co/papers/2307.03135,,,,6,
|
216 |
Learning Trajectory-Word Alignments for Video-Language Tasks,"YANG, XU; Li, Zhangzikang*; Xu, Haiyang; Zhang, Hanwang; Ye, Qinghao; Li, Chenliang; Yan, Ming; Zhang, Yu; Huang, Fei; Huang, Songfang",poster,2301.01953,https://arxiv.org/abs/2301.01953,,https://huggingface.co/papers/2301.01953,,,,10,0
|
217 |
Variational Causal Inference Network for Explanatory Visual Question Answering,"Xue, Dizhan*; Qian, Shengsheng; Xu, Changsheng",poster,,,,,,,,,
|
218 |
TextManiA: Enriching Visual Feature by Text-driven Manifold Augmentation,"Ye-Bin, Moon*; Kim, Jisoo; Kim, Hongyeob; son, kilho; Oh, Tae-Hyun",poster,2307.14611,https://arxiv.org/abs/2307.14611,,https://huggingface.co/papers/2307.14611,,,,5,0
|
@@ -335,7 +335,7 @@ Multi-Modal Neural Radiance Field for Monocular Dense SLAM with a Light-Weight T
|
|
335 |
ScanNet++: A High-Fidelity Dataset of 3D Indoor Scenes,"Yeshwanth, Chandan*; Liu, Yueh-Cheng; Niessner, Matthias; Dai, Angela",oral,2308.11417,https://arxiv.org/abs/2308.11417,,https://huggingface.co/papers/2308.11417,,,,4,0
|
336 |
Translating Images to Road Network: A Non-Autoregressive Sequence-to-Sequence Approach,"Lu, Jiachen; Peng, Renyuan; Cai, Xinyue; Xu, Hang; Li, Hongyang; Wen, Feng; Zhang, Wei; Zhang, Li*",oral,,,,,,,,,
|
337 |
Doppelgangers: Learning to Disambiguate Images of Similar Structures,"Cai, Ruojin*; Tung, Joseph; Wang, Qianqian; Averbuch-Elor, Hadar; Hariharan, Bharath; Snavely, Noah",oral,,,,,,,,,
|
338 |
-
EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with Visual Queries,"Mai, Jinjie*; Hamdi, Abdullah J; Giancola, Silvio; Zhao, Chen; Ghanem, Bernard",oral,2212.06969,https://arxiv.org/abs/2212.06969,https://github.com/Wayne-Mai/EgoLoc,https://huggingface.co/papers/2212.06969,,,,5,
|
339 |
ClothPose: A Real-world Benchmark for Visual Analysis of Garment Pose via An Indirect Recording Solution,"Xu, Wenqiang*; Du, Wenxin; Xue, Han; Li, Yutong; Ye, Ruolin; Wang, Yan-Feng; Lu, Cewu",oral,,,,,,,,,
|
340 |
EMR-MSF: Self-Supervised Recurrent Monocular Scene Flow Exploiting Ego-Motion Rigidity,"Jiang, Zijie*; Okutomi, Masatoshi",oral,,,,,,,,,
|
341 |
ENVIDR: Implicit Differentiable Renderer with Neural Environment Lighting,"Liang, Ruofan*; Chen, Huiting; Li, Chunlin; Chen, Fan; Panneer, Selvakumar; Vijaykumar, Nandita",oral,2303.13022,https://arxiv.org/abs/2303.13022,,https://huggingface.co/papers/2303.13022,,,,6,0
|
@@ -347,7 +347,7 @@ Robust Evaluation of Diffusion-Based Adversarial Purification,"Lee, Minjong*; Ki
|
|
347 |
Advancing Example Exploitation Can Alleviate Critical Challenges in Adversarial Training,"Ge, Yao*; Li, Yun; Han, Keji; Zhu, Junyi; Long, Xianzhong",oral,,,,,,,,,
|
348 |
The Victim and The Beneficiary: Exploiting a Poisoned Model to Train a Clean Model on Poisoned Data,"Zhu, Zixuan*; Wang, Rui; Zou, Cong; Jing, Lihua",oral,,,,,,,,,
|
349 |
TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models,"Sur, Indranil*; Sikka, Karan; Walmer, Matthew; Koneripalli, Kaushik; Roy, Anirban; Lin, Xiao; Divakaran, Ajay; Jha, Susmit",oral,2308.03906,https://arxiv.org/abs/2308.03906,https://github.com/SRI-CSL/TIJO,https://huggingface.co/papers/2308.03906,,,,8,1
|
350 |
-
SAGA: Spectral Adversarial Geometric Attack on 3D Meshes,"Stolik, Tomer*; Lang, Itai; Avidan, Shai",poster,2211.13775,https://arxiv.org/abs/2211.13775,https://github.com/StolikTomer/SAGA,https://huggingface.co/papers/2211.13775,,,,3,
|
351 |
Benchmarking and Analyzing Robust Point Cloud Recognition: Bag of Tricks for Defending Adversarial Examples,"Ji, Qiufan*; Wang, Lin ; Hu, Shengshan; Sun, Lichao; Shi, Cong; Chen, Yingying",poster,2307.16361,https://arxiv.org/abs/2307.16361,https://github.com/qiufan319/benchmark_pc_attack.git,https://huggingface.co/papers/2307.16361,,,,6,0
|
352 |
ACTIVE: Towards Highly Transferable 3D Physical Camouflage for Universal and Robust Vehicle Evasion,"Suryanto, Naufal*; Kim, Yongsu; Larasati, Harashta Tatimma; Kang, Hyoeun; Le, Thi-Thu-Huong; Hong, Yoonyoung; Yang, Hunmin; Oh, Se-Yoon; Kim, Howon",poster,2308.07009,https://arxiv.org/abs/2308.07009,,https://huggingface.co/papers/2308.07009,,,,9,1
|
353 |
Frequency-aware GAN for Adversarial Manipulation Generation,"Zhu, Peifei*; Osada, Genki; Kataoka, Hirokatsu; Takahashi, Tsubasa",poster,,,,,,,,,
|
@@ -575,7 +575,7 @@ FeatEnHancer: Enhancing Hierarchical Features for Object Detection and Beyond Un
|
|
575 |
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds,"Ma, Tao*; Yang, Xuemeng; Zhou, Hongbin; Li, Xin; Shi, Botian; Liu, Junjie; Yang, Yuchen; Liu, Zhizheng; He, Liang; Li, Hongsheng; Li, Yikang; Qiao, Yu",poster,2306.06023,https://arxiv.org/abs/2306.06023,,https://huggingface.co/papers/2306.06023,,,,12,0
|
576 |
DETRs with Collaborative Hybrid Assignments Training,"Zong, Zhuofan*; Song, Guanglu; Liu, Yu",poster,2211.12860,https://arxiv.org/abs/2211.12860,https://github.com/Sense-X/Co-DETR,https://huggingface.co/papers/2211.12860,,,,3,0
|
577 |
Open Vocabulary Object Detection With an Open Corpus,"Wang, Jiong*; zhang, huiming; Hong, Haiwen; Jin, Xuan; He, Yuan; xue, hui; Zhao, Zhou",poster,,,,,,,,,
|
578 |
-
SparseDet: Improving Sparsely Annotated Object Detection with Pseudo-positive Mining,"Suri, Saksham*; Rambhatla, Sai Saketh ; Chellappa, Rama; Shrivastava, Abhinav",poster,2201.04620,https://arxiv.org/abs/2201.04620,,https://huggingface.co/papers/2201.04620,,,,4,
|
579 |
Unsupervised Anomaly Detection with Diffusion Probabilistic Model,"Zhang, Xinyi*; Li, Naiqi; Li, Jiawei; Dai, Tao; Jiang, Yong; Xia, Shu-Tao",poster,,,,,,,,,
|
580 |
UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View Representation,"Wang, Haiyang*; Tang, Hao; Shi, Shaoshuai; Li, Aoxue; Li, Zhenguo; Schiele, Bernt; Wang, Liwei",poster,,,,,,,,,
|
581 |
Focus the Discrepancy: Intra- and Inter-Correlation Learning for Image Anomaly Detection,"Yao, Xincheng*; Li, Ruoqi; Qian, Zefeng; Luo, Yan; Zhang, Chongyang",poster,,,,,,,,,
|
@@ -626,7 +626,7 @@ End-to-End Diffusion Latent Optimization Improves Classifier Guidance,"Wallace,
|
|
626 |
Deep Geometrized Cartoon Line Inbetweening,"Siyao, Li*; Gu, Tianpei; Xiao, Weiye; Ding, Henghui; Liu, Ziwei; Loy, Chen Change",poster,,,,,,,,,
|
627 |
UnitedHuman: Harnessing Multi-Source Data for High-Resolution Human Generation,"Fu, Jianglin; Li, Shikai; Jiang, Yuming; Lin, Kwan-Yee; Wu, Wayne*; Liu, Ziwei",poster,,,,,,,,,
|
628 |
Towards Authentic Face Restoration with Iterative Diffusion Models and Beyond ,"Zhao, Yang*; Hou, Tingbo; Su, Yu-Chuan; Jia, Xuhui; Li, Yandong; Grundmann, Matthias",poster,,,,,,,,,
|
629 |
-
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning,"Han, Ligong*; Li, Yinxiao; Zhang, Han; Milanfar, Peyman; Metaxas, Dimitris N.; Yang, Feng",poster,2303.11305,https://arxiv.org/abs/2303.11305,,https://huggingface.co/papers/2303.11305,,,,6,
|
630 |
MI-GAN: A Simple Baseline for Image Inpainting on Mobile Devices,"Sargsyan, Andranik*; Navasardyan, Shant; Xu, Xingqian; Shi, Humphrey",poster,,,,,,,,,
|
631 |
Structure and Content-Guided Video Synthesis with Diffusion Models,"Esser, Patrick*; Chiu, Johnathan; Atighehchian, Parmida PA; Granskog, Jonathan; Germanidis, Anastasis",poster,2302.03011,https://arxiv.org/abs/2302.03011,,https://huggingface.co/papers/2302.03011,,,,5,0
|
632 |
Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation,"Jiang, Yuxin; Jiang, Liming*; Yang, Shuai; Loy, Chen Change",poster,2308.12968,https://arxiv.org/abs/2308.12968,,https://huggingface.co/papers/2308.12968,,,,4,1
|
@@ -706,7 +706,7 @@ Scale-MAE: A Scale-Aware Masked Autoencoder for Multiscale Geospatial Representa
|
|
706 |
Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval,"Li, Pandeng*; Xie, Chen-Wei; Zhao, Liming; Xie, Hongtao; Ge, Jiannan; Zheng, Yun; Zhao, Deli; Zhang, Yongdong",oral,,,,,,,,,
|
707 |
Towards Deeply Unified Depth-aware Panoptic Segmentation with Bi-directional Guidance Learning,"He, Junwen*; Wang, Yifan; Wang, Lijun; Lu, Huchuan; Luo, Bin; He, Jun-Yan; Lan, Jin-Peng; Geng, Yifeng; Xie, Xuansong",oral,2307.14786,https://arxiv.org/abs/2307.14786,,https://huggingface.co/papers/2307.14786,,,,9,1
|
708 |
LogicSeg: Parsing Visual Semantics with Neural Logic Learning and Reasoning,"Li, Liulei; Wang, Wenguan*; Yang, Yi",oral,,,,,,,,,
|
709 |
-
ASIC: Aligning Sparse in-the-wild Image Collections,"Gupta, Kamal*; Jampani, Varun; Shrivastava, Abhinav; Makadia, Ameesh; Snavely, Noah; Esteves, Carlos; Kar, Abhishek",oral,2303.16201,https://arxiv.org/abs/2303.16201,,https://huggingface.co/papers/2303.16201,,,,7,
|
710 |
CLIPascene: Scene Sketching with Different Types and Levels of Abstraction,"Vinker, Yael*; Alaluf, Yuval; Cohen-Or, Danny; Shamir, Ariel",oral,2211.17256,https://arxiv.org/abs/2211.17256,,https://huggingface.co/papers/2211.17256,,,,4,0
|
711 |
LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation,"PNVR, Koutilya*; Singh, Bharat; Ghosh, Pallabi; Jacobs, David; Siddiquie, Behjat",oral,,,,,,,,,
|
712 |
TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models,"Cao, Tianshi*; Kreis, Karsten; Fidler, Sanja; Sharp, Nicholas; Yin, Kangxue",oral,,,,,,,,,
|
@@ -725,7 +725,7 @@ View Consistent Purification for Accurate Cross-View Localization,"Wang, Shan*;
|
|
725 |
Semi-supervised Semantics-guided Adversarial Training for Robust Trajectory Prediction,"Jiao, Ruochen*; Liu, Xiangguo; SATO, TAKAMI; Chen, Alfred; Qi, Zhu",poster,,,,,,,,,
|
726 |
NeRF-LOAM: Neural Implicit Representation for Large-Scale Incremental LiDAR Odometry and Mapping,"DENG, Junyuan; Wu, Qi; Chen, Xieyuanli*; Xia, Songpengcheng; Sun, Zhen; Liu, Guoqing; Yu, Wenxian; Pei, Ling",poster,,,,,,,,,
|
727 |
MapPrior: A Generative Approach for Birds-Eye View Perception,"Zhu, Xiyue*; Zyrianov, Vlas; Liu, Zhijian; Wang, Shenlong",poster,,,,,,,,,
|
728 |
-
Hidden Biases of End-to-End Driving Models,"Jaeger, Bernhard*; Chitta, Kashyap; Geiger, Andreas",poster,2306.07957,https://arxiv.org/abs/2306.07957,,https://huggingface.co/papers/2306.07957,,,,3,
|
729 |
Search for or Navigate to? Dual Adaptive Thinking for Object Navigation,"Dang, Ronghao*; Wang, Liuyi; He, Zongtao; Su, Shuai; Tang, Jiagui; Liu, Chengju; Chen, Qijun",poster,2208.00553,https://arxiv.org/abs/2208.00553,,https://huggingface.co/papers/2208.00553,,,,6,0
|
730 |
Segmenting Known Objects and Unseen Unknowns without Prior Knowledge,"Gasperini, Stefano*; Marcos-Ramiro, Alvaro; Schmidt, Michael; Navab, Nassir; Busam, Benjamin ; Tombari, Federico",poster,2209.05407,https://arxiv.org/abs/2209.05407,,https://huggingface.co/papers/2209.05407,,,,6,1
|
731 |
BiFF: Bi-level Future Fusion with Polyline-based Coordinate for Interactive Trajectory Prediction,"ZHU, Yiyao*; LUAN, Di; Shen, Shaojie",poster,2306.14161,https://arxiv.org/abs/2306.14161,,https://huggingface.co/papers/2306.14161,,,,3,0
|
@@ -814,7 +814,7 @@ DeformToon3D: Deformable 3D Toonification from Neural Radiance Fields,"Zhang, Ju
|
|
814 |
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection,"Zhang, Renrui*; Qiu, Han; Wang, Tai; Guo, Ziyu; Cui, Ziteng; Gao, Peng; Qiao, Yu; Li, Hongsheng",poster,2203.13310,https://arxiv.org/abs/2203.13310,https://github.com/ZrrSkywalker/MonoDETR,https://huggingface.co/papers/2203.13310,,,,9,0
|
815 |
ReLeaPS : Reinforcement Learning-based Illumination Planning for Generalized Photometric Stereo,"Chan, Jun Hoong*; Yu, Bohan; Guo, Heng; Ren, Jieji; Lu, Zongqing; Shi, Boxin",poster,,,,,,,,,
|
816 |
Convex Decomposition of Indoor Scenes,"Vavilala, Vaibhav S*; Forsyth, David",poster,2307.04246,https://arxiv.org/abs/2307.04246,,https://huggingface.co/papers/2307.04246,,,,2,0
|
817 |
-
NeO 360: Neural Fields for Sparse View Synthesis of Outdoor Scenes,"Irshad, Muhammad Zubair*; Zakharov, Sergey; Liu, Katherine; Guizilini, Vitor; Kollar, Thomas; Gaidon, Adrien; Ambru?, Rare? A; Kira, Zsolt",poster,2308.12967,https://arxiv.org/abs/2308.12967,https://github.com/zubair-irshad/NeO-360,https://huggingface.co/papers/2308.12967,,,8,
|
818 |
UrbanGIRAFFE: Representing Urban Scenes as Compositional Generative Neural Feature Fields,"Yang, Yuanbo*; Yang, Yifei; Guo, Hanlei; Xiong, Rong; Wang, Yue; Liao, Yiyi",poster,2303.14167,https://arxiv.org/abs/2303.14167,,https://huggingface.co/papers/2303.14167,,,,6,0
|
819 |
Efficient Converted Spiking Neural Network for 3D and 2D classification,"Lan, Yuxiang; Zhang, Yachao; Ma, Xu; Qu, Yanyun*; FU, YUN",poster,,,,,,,,,
|
820 |
Distribution-Aligned Diffusion for Human Mesh Recovery,"Foo, Lin Geng*; Gong, Jia; Rahmani, Hossein; Liu, Jun",poster,2308.13369,https://arxiv.org/abs/2308.13369,,https://huggingface.co/papers/2308.13369,,,,4,0
|
@@ -893,7 +893,7 @@ PVT++: A Simple End-to-End Latency-Aware Visual Tracking Framework,"Li, Bowen*;
|
|
893 |
EigenTrajectory: Low-Rank Descriptors for Multi-Modal Trajectory Forecasting,"Bae, Inhwan*; Oh, Jean; Jeon, Hae-Gon",poster,2307.09306,https://arxiv.org/abs/2307.09306,https://github.com/inhwanbae/EigenTrajectory,https://huggingface.co/papers/2307.09306,,,,3,1
|
894 |
RPEFlow: Multimodal Fusion of RGB-PointCloud-Event for Joint Optical Flow and Scene Flow Estimation,"Wan, Zhexiong; Mao, Yuxin; Zhang, Jing; Dai, Yuchao*",poster,,,,,,,,,
|
895 |
Multi-Scale Bidirectional Recurrent Network with Hybrid Correlation for Point Cloud Based Scene Flow Estimation,"CHENG, WENCAN; Ko, Jong Hwan*",poster,,,,,,,,,
|
896 |
-
ReST: A Reconfigurable Spatial-Temporal Graph Model for Multi-Camera Multi-Object Tracking,"Cheng, Cheng-Che*; Qiu, Min-Xuan; Chiang, Chen-Kuo; Lai, Shang-Hong",poster,2308.13229,https://arxiv.org/abs/2308.13229,,https://huggingface.co/papers/2308.13229,,,,4,
|
897 |
"TAPIR: Tracking Any Point, Initialized per-frame, Refined temporally","Doersch, Carl*; Yang, Yi; Vecerik, Mel; Gokay, Dilara; Gupta, Ankush; Aytar, Yusuf; Carreira, Joao; Zisserman, Andrew",poster,,,,,,,,,
|
898 |
IHNet: Iterative Hierarchical Network Guided by High-Resolution Estimated Information for Scene Flow,"Wang, Yun*; Chi, Cheng; Lin, Min; Yang, Xin",poster,,,,,,,,,
|
899 |
Can Language Models Transfer to Social Gesture Motion Generation?,"Ng, Evonne*; Subramanian, Sanjay; Klein, Dan; Kanazawa, Angjoo; Darrell, Trevor; Ginosar, Shiry",poster,,,,,,,,,
|
@@ -954,7 +954,7 @@ Generalized Lightness Adaptation with Channel Selective Normalization,"Yao, Ming
|
|
954 |
Towards Nonlinear-Motion-Aware and Occlusion-Robust Rolling Shutter Correction,"Qu, Delin*; Lao, Yizhen; Wang, Zhigang; Wang, Dong; Zhao, Bin; Li, Xuelong",poster,2303.18125,https://arxiv.org/abs/2303.18125,https://github.com/DelinQu/qrsc,https://huggingface.co/papers/2303.18125,,,,6,0
|
955 |
FCCNs: Fully Complex-valued Convolutional Networks using Complex-valued Color Model and Loss Function,"Yadav, Saurabh*; Jerripothula, Koteswar Rao",poster,,,,,,,,,
|
956 |
Event Camera Data Pre-training,"Yang, Yan*; Pan, Liyuan; liu, Liu",poster,2301.01928,https://arxiv.org/abs/2301.01928,,https://huggingface.co/papers/2301.01928,,,,3,0
|
957 |
-
Improving 3D Imaging with Pre-Trained Perpendicular 2D Diffusion Models,"Lee, Suhyeon*; Chung, Hyungjin; Park, Min Young; Park, Jonghyeok; Ryu, Wi-Sun; Ye, Jong Chul",poster,2303.08440,https://arxiv.org/abs/2303.08440,,https://huggingface.co/papers/2303.08440,,,,6,
|
958 |
Multiscale Structure Guided Diffusion for Image Deblurring,"Ren, Mengwei*; Delbracio, Mauricio; Talebi, Hossein ; Gerig, Guido; Milanfar, Peyman",poster,2212.01789,https://arxiv.org/abs/2212.01789,,https://huggingface.co/papers/2212.01789,,,,5,0
|
959 |
Generalizing Event-Based Motion Deblurring in Real-World Scenarios,"Zhang, Xiang; Yu, Lei*; Yang, Wen; Liu, Jianzhuang; Xia, Gui-Song",poster,2308.05932,https://arxiv.org/abs/2308.05932,,https://huggingface.co/papers/2308.05932,,,,5,0
|
960 |
On the Robustness of Normalizing Flows for Inverse Problems in Imaging,"Hong, Seongmin; PARK, INBUM; Chun, Se Young*",poster,2212.04319,https://arxiv.org/abs/2212.04319,,https://huggingface.co/papers/2212.04319,,,,3,0
|
@@ -975,7 +975,7 @@ Perpetual Humanoid Control for Real-time Simulated Avatars,"Luo, Zhengyi*; Cao,
|
|
975 |
Grounding 3D Object Affordance from 2D Interactions in Images,"Yang, Yuhang; Zhai, Wei; Luo, Hongchen; Cao, Yang*; Luo, Jiebo; Zha, Zheng-Jun",poster,2303.10437,https://arxiv.org/abs/2303.10437,https://github.com/yyvhang/IAGNet,https://huggingface.co/papers/2303.10437,,,,6,0
|
976 |
Navigating to Objects Specified by Images,"Krantz, Jacob*; Gervet, Theophile; Yadav, Karmesh; Wang, Austin S; Paxton, Chris; Mottaghi, Roozbeh; Batra, Dhruv; Malik, Jitendra; Lee, Stefan; Chaplot, Devendra Singh",poster,2304.01192,https://arxiv.org/abs/2304.01192,,https://huggingface.co/papers/2304.01192,,,,10,1
|
977 |
PEANUT: Predicting and Navigating to Unseen Targets,"Zhai, Albert J*; Wang, Shenlong",poster,2212.02497,https://arxiv.org/abs/2212.02497,,https://huggingface.co/papers/2212.02497,,,,2,0
|
978 |
-
Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied Agents,"Kim, Byeonghwi; kim, jinyeon; Kim, yuyeong; Min, Cheolhong; Choi, Jonghyun*",poster,2308.07241,https://arxiv.org/abs/2308.07241,,https://huggingface.co/papers/2308.07241,,,,5,
|
979 |
Learning Foresightful Dense Visual Affordance for Deformable Object Manipulation,"Wu, Ruihai; Ning, Chuanruo; Dong, Hao*",poster,2303.11057,https://arxiv.org/abs/2303.11057,,https://huggingface.co/papers/2303.11057,,,,3,0
|
980 |
Exploiting Proximity-Aware Tasks for Embodied Social Navigation,"Cancelli, Enrico; Campari, Tommaso; Serafini, Luciano; Chang, Angel X; Ballan, Lamberto*",poster,2212.00767,https://arxiv.org/abs/2212.00767,,https://huggingface.co/papers/2212.00767,,,,5,0
|
981 |
Object-Aware Cognitive BirdÂs-Eye-View Grids for Vision-Language Navigation,"Liu, Rui; Wang, Xiaohan; Wang, Wenguan; Yang, Yi*",poster,,,,,,,,,
|
@@ -1002,7 +1002,7 @@ Modality Unifying Network for Visible Infrared Person Re-Identification,"Yu, Hao
|
|
1002 |
DeepChange: A Long-Term Person Re-Identification Benchmark with Clothes Change,"Xu, Peng*; Zhu, Xiatian",poster,,,,,,,,,
|
1003 |
LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval,"Luo, Ziyang*; Zhao, Pu; Xu, Can; Geng, Xiubo; Shen, Tao; Tao, Chongyang; Ma, Jing; Lin, Qingwei; Jiang, Daxin",poster,,,,,,,,,
|
1004 |
Dual Pseudo-Labels Interactive Self-Training for Semi-Supervised Visible-Infrared Person Re-Identification,"Shi, Jiangming; Zhang, Yachao; Yin, Xiangbo; Xie, Yuan; Zhang, Zhizhong; Fan, Jianping; shi, zhongchao; Qu, Yanyun*",poster,,,,,,,,,
|
1005 |
-
$BT^2$: Backward-compatible Training with Basis Transformation,"Zhou, Yifei*; Li, Zilu; Shrivastava, Abhinav; Zhao, Hengshuang; Torralba, Antonio; Tian, Tai-Peng; Lim, Ser-Nam",poster,2211.03989,https://arxiv.org/abs/2211.03989,,https://huggingface.co/papers/2211.03989,,,,7,
|
1006 |
Prototypical Mixup and Retrieval-based Refinement for Label Noise-resistant Image Retrieval,"Yang, Xinlong*; Wang, Haixin; Sun, Jinan; Zhang, Shikun; Chen, Chong; Hua, Xian-Sheng; Luo, Xiao",poster,,,,,,,,,
|
1007 |
Learning Spatial-context-aware Global Visual Feature Representation for Instance Image Retrieval,"Zhang, Zhongyan*; Wang, Lei; Zhou, Luping; Koniusz, Piotr",poster,,,,,,,,,
|
1008 |
Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval,"zhu, yunquan*; Gao, Xinkai; Ke, Bo; Qiao, Ruizhi; Sun, Xing",poster,,,,,,,,,
|
@@ -1218,7 +1218,7 @@ Markov Game Video Augmentation for Action Segmentation,"Aziere, Nicolas*; Todoro
|
|
1218 |
COOL-CHIC: Coordinate-based Low Complexity Hierarchical Image Codec,"Ladune, Théo*; Philippe, Pierrick; Henry, Felix E; clare, gordon; Leguay, Thomas",poster,,,,,,,,,
|
1219 |
ReGen: A good Generative zero-shot video classifier should be Rewarded,"Bulat, Adrian*; Sanchez, Enrique; Martinez, Brais; Tzimiropoulos, Georgios",poster,,,,,,,,,
|
1220 |
Task Agnostic Restoration of Natural Video Dynamics,"Ali, Muhammad Kashif; Kim, Dongjin; Kim, Tae Hyun*",poster,2206.03753,https://arxiv.org/abs/2206.03753,https://github.com/MKashifAli/TARONVD,https://huggingface.co/papers/2206.03753,,,,3,0
|
1221 |
-
Normalizing Flows for Human Pose Anomaly Detection,"Hirschorn, Or*; Avidan, Shai",poster,2211.10946,https://arxiv.org/abs/2211.10946,,https://huggingface.co/papers/2211.10946,,,,2,
|
1222 |
Movement Enhancement toward Multi-Scale Video Feature Representation for Temporal Action Detection,"Zhao, Zixuan; Wang, Dongqi; Zhao, Xu*",poster,,,,,,,,,
|
1223 |
Event-Guided Procedure Planning from Instructional Videos with Text Supervision,"Wang, An-Lan; Lin, Kun-Yu; Du, Jia-Run; Meng, Jingke; ZHENG, WEI-SHI*",poster,2308.08885,https://arxiv.org/abs/2308.08885,,https://huggingface.co/papers/2308.08885,,,,5,0
|
1224 |
SCANet: Scene Complexity Aware Network for Weakly-Supervised Video Moment Retrieval,"Yoon, Sunjae*; Koo, GwanHyeong; Kim, DaHyun; Yoo, Chang D.",poster,,,,,,,,,
|
@@ -1302,7 +1302,7 @@ Attention Discriminant Sampling for Point Clouds,"Hong, Cheng-Yao*; Chou, Yu-Yin
|
|
1302 |
SALAD: Part-Level Latent Diffusion for 3D Shape Generation and Manipulation,"Koo, Juil*; Yoo, Seungwoo; Nguyen, Hieu Minh; Sung, Minhyuk",poster,2303.12236,https://arxiv.org/abs/2303.12236,,https://huggingface.co/papers/2303.12236,,,,4,0
|
1303 |
MAPConNet: Self-supervised 3D Pose Transfer with Mesh and Point Contrastive Learning,"Sun, Jiaze*; Chen, Zhixiang; Kim, Tae-Kyun (T-K)",poster,2304.13819,https://arxiv.org/abs/2304.13819,,https://huggingface.co/papers/2304.13819,,,,3,0
|
1304 |
Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition,"Yi, Xuanyu*; Deng, Jiajun; Sun, Qianru; Hua, Xian-Sheng; Lim, Joo-Hwee; Zhang, Hanwang",poster,2308.09694,https://arxiv.org/abs/2308.09694,,https://huggingface.co/papers/2308.09694,,,,6,0
|
1305 |
-
EPiC: Ensemble of Partial Point Clouds for Robust Classification,"Levi, Meir Yossef*; Gilboa, Guy",poster,2303.11419,https://arxiv.org/abs/2303.11419,https://github.com/yossilevii100/EPiC,https://huggingface.co/papers/2303.11419,,,,2,
|
1306 |
Leveraging Intrinsic Properties for Non-Rigid Garment Alignment,"Lin, Siyou; ZHOU, Boyao; Zheng, Zerong; Zhang, Hongwen; Liu, Yebin*",poster,2308.09519,https://arxiv.org/abs/2308.09519,,https://huggingface.co/papers/2308.09519,,,,5,0
|
1307 |
Spatially and Spectrally Consistent Deep Functional Maps,"Sun, Mingze; Mao, Shiwei; Jiang, Puhua; Ovsjanikov, Maks; Huang, Ruqi*",poster,2308.08871,https://arxiv.org/abs/2308.08871,https://github.com/rqhuang88/Spatiallyand-Spectrally-Consistent-Deep-Functional-Maps,https://huggingface.co/papers/2308.08871,,,,5,0
|
1308 |
SVDFormer: Complementing Point Cloud via Self-view Augmentation and Self-structure Dual-generator,"Zhu, Zhe*; Chen, Honghua; He, Xing; Wang, Weiming; Qin, Jing; Wei, Mingqiang",poster,2307.08492,https://arxiv.org/abs/2307.08492,https://github.com/czvvd/SVDFormer,https://huggingface.co/papers/2307.08492,,,,6,0
|
@@ -1342,7 +1342,7 @@ MixSynthFormer: A Transformer Encoder-like Structure with Mixed Synthetic Self-a
|
|
1342 |
Dynamic Hyperbolic Attention Network for Fine Hand-object Reconstruction,"Leng, Zhiying*; Wu, shuncheng; Saleh, Mahdi; Montanaro, Antonio; Yu, Hao; Wang, Yin; Navab, Nassir; Liang, Xiaohui; Tombari, Federico",poster,,,,,,,,,
|
1343 |
Human from Blur: Human Pose Tracking from Blurry Images,"Zhao, Yiming*; Rozumnyi, Denys; Song, Jie; Hilliges, Otmar; Pollefeys, Marc; Oswald, Martin R.",poster,2303.17209,https://arxiv.org/abs/2303.17209,,https://huggingface.co/papers/2303.17209,,,,6,0
|
1344 |
AG3D: Learning to Generate 3D Avatars from 2D Image Collections,"Dong, Zijian*; Chen, Xu; Yang, Jinlong; Black, Michael J.; Hilliges, Otmar; Geiger, Andreas",poster,2305.02312,https://arxiv.org/abs/2305.02312,,https://huggingface.co/papers/2305.02312,,,,6,0
|
1345 |
-
InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion,"Xu, Sirui*; Li, Zhengyuan; Wang, Yu-Xiong; Gui, Liangyan",poster,2308.16905,https://arxiv.org/abs/2308.16905,https://github.com/Sirui-Xu/InterDiff,https://huggingface.co/papers/2308.16905,,,,4,
|
1346 |
SEFD: Learning to Distill Complex Pose and Occlusion,"Yang, ChangHee*; Kong, Kyeongbo; Min, Sung-Jun; Wee, Dongyoon; Jang, Ho-Deok; Cha, Geonho; Kang, Suk-Ju",poster,,,,,,,,,
|
1347 |
3D Human Mesh Recovery with Sequentially Global Rotation Estimation,"Wang, Dongkai; Zhang, Shiliang*",poster,,,,,,,,,
|
1348 |
Co-Evolution of Pose and Mesh for 3D Human Body Estimation from Video,"You, Yingxuan*; Liu, Hong; Wang, Ti; Li, Wenhao; Ding, Runwei; Li, Xia",poster,2308.10305,https://arxiv.org/abs/2308.10305,https://github.com/kasvii/PMCE,https://huggingface.co/papers/2308.10305,,,,6,0
|
@@ -1548,14 +1548,14 @@ TripLe: Revisiting Pretrained Model Reuse and Progressive Learning for Efficient
|
|
1548 |
DiffRate : Differentiable Compression Rate for Efficient Vision Transformers,"Chen, Mengzhao*; Shao, Wenqi; Xu, Peng; Lin, Mingbao; Zhang, Kaipeng; Chao, Fei; Ji, Rongrong; Qiao, Yu; Luo, Ping",poster,2305.17997,https://arxiv.org/abs/2305.17997,https://github.com/OpenGVLab/DiffRate,https://huggingface.co/papers/2305.17997,,,,9,0
|
1549 |
Bridging Cross-task Protocol Inconsistency for Distillation in Dense Object Detection,"Yang, Longrong; Zhou, Xianpan; Li, Xuewei; Qiao, Liang; Li, Zheyang; Yang, Ziwei; Wang, Gaoang; Li, Xi*",poster,2308.14286,https://arxiv.org/abs/2308.14286,https://github.com/TinyTigerPan/BCKD,https://huggingface.co/papers/2308.14286,,,,8,0
|
1550 |
From Knowledge Distillation to Self-Knowledge Distillation: A Unified Approach with Normalized Loss and Customized Soft Labels,"Yang, Zhendong*; Zeng, Ailing; Li, Zhe; Zhang, Tianke; Yuan, Chun; Li, Yu",poster,2303.13005,https://arxiv.org/abs/2303.13005,https://github.com/yzd-v/cls_KD,https://huggingface.co/papers/2303.13005,,,,6,1
|
1551 |
-
Efficient 3D Semantic Segmentation with Superpoint Transformer,"ROBERT, Damien*; Raguet, Hugo; Landrieu, Loic",poster,2306.08045,https://arxiv.org/abs/2306.08045,,https://huggingface.co/papers/2306.08045,,,,3,
|
1552 |
Dataset Quantization,"Zhou, Daquan; Wang, Kai*; Gu, Jianyang; Peng, Xiangyu; Lian, Dongze; Zhang, Yifan; You, Yang; Feng, Jiashi",poster,2308.10524,https://arxiv.org/abs/2308.10524,,https://huggingface.co/papers/2308.10524,,,,8,0
|
1553 |
Revisiting the Parameter Efficiency of Adapters from the Perspective of Precision Redundancy,"Jie, Shibo*; Wang, Haoqing; Deng, Zhi-Hong",poster,2307.16867,https://arxiv.org/abs/2307.16867,https://github.com/JieShibo/PETL-ViT,https://huggingface.co/papers/2307.16867,,,,3,0
|
1554 |
RepQ-ViT: Scale Reparameterization for Post-Training Quantization of Vision Transformers,"Li, Zhikai*; Xiao, Junrui; Yang, Lianwei; Gu, Qingyi",poster,,,,,,,,,
|
1555 |
Semantically Structured Image Compression via Irregular Group-Based Decoupling,"Feng, Ruoyu*; Gao, Yixin; Jin, Xin; Feng, Runsen; Chen, Zhibo",poster,2305.02586,https://arxiv.org/abs/2305.02586,,https://huggingface.co/papers/2305.02586,,,,5,0
|
1556 |
SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage,"Park, Song; Chun, Sanghyuk*; Heo, Byeongho; Kim, Wonjae; Yun, Sangdoo",poster,2303.11114,https://arxiv.org/abs/2303.11114,https://github.com/naver-ai/seit,https://huggingface.co/papers/2303.11114,,,,5,1
|
1557 |
SMMix: Self-Motivated Image Mixing for Vision Transformers,"Chen, Mengzhao*; Lin, Mingbao; Lin, Zhihang; Zhang, Yuxin; Chao, Fei; Ji, Rongrong",poster,2212.12977,https://arxiv.org/abs/2212.12977,https://github.com/ChenMnZ/SMMix,https://huggingface.co/papers/2212.12977,,,,6,0
|
1558 |
-
Multi-Label Knowledge Distillation,"Yang, Penghui*; Xie, Ming-Kun; Zong, Chen-Chen; Feng, Lei; Niu, Gang; Sugiyama, Masashi; Huang, Sheng-Jun",poster,2308.06453,https://arxiv.org/abs/2308.06453,https://github.com/penghui-yang/L2D,https://huggingface.co/papers/2308.06453,,,,7,
|
1559 |
UGC: Unified GAN Compression for Efficient Image-to-Image Translation ,"Ren, Yuxi*; Wu, Jie; Zhang, Peng; Zhang, Manlin; Xiao, Xuefeng; He, Qian; Wang, Rui; Zheng, Min ; Pan, Xin",poster,,,,,,,,,
|
1560 |
MotionDeltaCNN: Sparse CNN Inference of Frame Differences in Moving Camera Videos,"Parger, Mathias*; Tang, Chengcheng; Neff, Thomas; Twigg, Christopher D; Keskin, Cem; Wang, Robert; Steinberger, Markus",poster,2210.09887,https://arxiv.org/abs/2210.09887,,https://huggingface.co/papers/2210.09887,,,,7,0
|
1561 |
Lightweight Multi-Scale Attention for On-Device Semantic Segmentation,"Cai, Han*; Li, Junyan; Hu, Muyan; Gan, Chuang; Han, Song",poster,,,,,,,,,
|
@@ -1781,7 +1781,7 @@ Novel Scenes & Classes: Towards Adaptive Open-set Object Detection,"Li, Wuyang*;
|
|
1781 |
Improving Unsupervised Visual Program Inference with Code Rewriting Families,"Ganeshan, Aditya*; Jones, R. Kenny; Ritchie, Daniel",oral,,,,,,,,,
|
1782 |
Denoising Diffusion Autoencoders are Unified Self-supervised Learners,"Xiang, Weilai; Yang, Hongyu*; Huang, Di; Wang, Yunhong",oral,2303.09769,https://arxiv.org/abs/2303.09769,,https://huggingface.co/papers/2303.09769,,,,4,0
|
1783 |
Self-Ordering Point Clouds,"Yang, Pengwan*; Snoek, Cees; Asano, Yuki M",oral,2304.00961,https://arxiv.org/abs/2304.00961,,https://huggingface.co/papers/2304.00961,,,,3,0
|
1784 |
-
MOST: Multiple Object localization with Self-supervised Transformers for object discovery,"Rambhatla, Sai Saketh *; Misra, Ishan; Chellappa, Rama; Shrivastava, Abhinav",oral,2304.05387,https://arxiv.org/abs/2304.05387,,https://huggingface.co/papers/2304.05387,,,,4,
|
1785 |
Self-supervised Learning for 3D Human-Object Spatial Relations from Unbounded Synthesized Images,"Han, Sookwan*; Joo, Hanbyul",oral,,,,,,,,,
|
1786 |
Identity-Seeking Self-Supervised Representation Learning for Generalizable Person Re-identification,"Dou, Zhaopeng*; Wang, Zhongdao; Li, Ya-Li; Wang, Shengjin",oral,2308.08887,https://arxiv.org/abs/2308.08887,https://github.com/dcp15/ISR_ICCV2023_Oral,https://huggingface.co/papers/2308.08887,,,,4,0
|
1787 |
Anatomical Invariance Modeling and Semantic Alignment for Self-supervised Learning in 3D Medical Image Analysis,"Jiang, Yankai*; Sun, Mingze; Guo, Heng; Bai, Xiaoyu; Yan, Ke; Lu, Le; Xu, Minfeng",oral,2302.05615,https://arxiv.org/abs/2302.05615,https://github.com/alibaba-damo-academy/alice,https://huggingface.co/papers/2302.05615,,,,7,0
|
|
|
85 |
Zero-Shot Semantic Segmentation with Decoupled One-Shot Network,"Han, Cong*; Zhong, Yujie; Han, Kai; Dengjie, Li; Ma, Lin",poster,,,,,,,,,
|
86 |
TCOVIS: Temporally consistent online video instance segmentation,"Li, Junlong; Yu, Bingyao; Rao, Yongming; Zhou, Jie; Lu, Jiwen*",poster,,,,,,,,,
|
87 |
FPR: False Positive Rectification for Weakly Supervised Semantic Segmentation,"Chen, Liyi*; Lei, Chenyang; Li, Ruihuang; LI, Shuai; Zhang, Zhaoxiang; Zhang, Lei",poster,,,,,,,,,
|
88 |
+
Stochastic Segmentation with Conditional Categorical Diffusion Models,"Zbinden, Lukas*; Doorenbos, Lars; Pissas, Theodoros; Huber, Adrian Thomas; Sznitman, Raphael; Márquez Neila, Pablo",poster,2303.08888,https://arxiv.org/abs/2303.08888,,https://huggingface.co/papers/2303.08888,,,,6,1
|
89 |
Segmenting Everything In Context,"Wang, Xinlong*; Zhang, Xiaosong; Cao, Yue; Wang, Wen; Shen, Chunhua; Huang, Tiejun",poster,,,,,,,,,
|
90 |
Open-vocabulary Panoptic Segmentation with Embedding Modulation,"CHEN, Xi*; Li, Shuang; Lim, Ser-Nam; Torralba, Antonio; Zhao, Hengshuang",poster,2303.11324,https://arxiv.org/abs/2303.11324,,https://huggingface.co/papers/2303.11324,,,,5,0
|
91 |
Residual Pattern Learning for Pixel-wise Out-of-Distribution Detection in Semantic Segmentation,"Liu, Yuyuan*; Ding, Choubo; Tian, Yu; Pang, Guansong; Belagiannis, Vasileios; Reid, Ian; Carneiro, Gustavo",poster,2211.14512,https://arxiv.org/abs/2211.14512,https://github.com/yyliu01/RPL,https://huggingface.co/papers/2211.14512,,,,7,0
|
|
|
98 |
Semi-Supervised Semantic Segmentation under Label Noise via Diverse Learning Groups,"Li, Peixia*; Purkait, Pulak; Ajanthan, Thalaiyasingam; Abdolshah, Majid; Garg, Ravi; Husain, Hisham; Xu, Chenchen; Gould, Stephen; Ouyang, Wanli; van den Hengel, Anton",poster,,,,,,,,,
|
99 |
SUMMIT: Source-Free Adaptation of Uni-Modal Models to Multi-Modal Targets,"Simons, Cody M*; Raychaudhuri, Dripta S.; AHMED, SK MIRAJ; You, Suya; Karydis, Konstantinos; Roy-Chowdhury, Amit K. ",poster,2308.11880,https://arxiv.org/abs/2308.11880,https://github.com/csimo005/SUMMIT,https://huggingface.co/papers/2308.11880,,,,6,0
|
100 |
Class-incremental Continual Learning for Instance Segmentation with Image-level Weak Supervision,"Hsieh, Yu-Hsing*; Chen, Guan-Sheng; Cai, Shun-Xian; Wei, Ting-Yun; Yang, Huei-Fang; Chen, Chu-Song",poster,,,,,,,,,
|
101 |
+
Coarse-to-Fine Amodal Segmentation with Shape Prior,"Gao, Jianxiong; Qian, Xuelin*; Fu, Yanwei; Wang, Yikai; Xiao, Tianjun; Zhang, Zheng; He, Tong",poster,2308.16825,https://arxiv.org/abs/2308.16825,,https://huggingface.co/papers/2308.16825,,,,7,1
|
102 |
Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric Representation,"Fan, Ke; Lei, Jingshi; Qian, Xuelin*; Yu, Miaopeng; Zhang, Zheng; He, Tong; Xiao, Tianjun; Fu, Yanwei",poster,,,,,,,,,
|
103 |
DVIS: Decoupled Video Instance Segmentation Framework,"Zhang, Tao*; tian, xingye; Wu, Yu; Ji, Shunping; Wang, Xuebo; Zhang, Yuan; Wan, Pengfei ",poster,2306.03413,https://arxiv.org/abs/2306.03413,,https://huggingface.co/papers/2306.03413,,,,7,0
|
104 |
3D Segmentation of Humans in Point Clouds with Synthetic Data,"Takmaz, Ayca*; Schult, Jonas; Kaftan, Irem; Akcay, Cafer Mertcan; Leibe, Bastian; Sumner, Robert W; Engelmann, Francis; Tang, Siyu",poster,2212.00786,https://arxiv.org/abs/2212.00786,,https://huggingface.co/papers/2212.00786,,,,8,1
|
|
|
157 |
Towards Improved Input Masking for Convolutional Neural Networks,"Balasubramanian, Sriram*; Feizi, Soheil",poster,2211.14646,https://arxiv.org/abs/2211.14646,https://github.com/SriramB-98/layer_masking,https://huggingface.co/papers/2211.14646,,,,2,1
|
158 |
PDiscoNet: Semantically consistent part discovery for fine-grained recognition,"van der Klis, Robert D; Alaniz, Stephan; Mancini, Massimiliano; Dantas, Cassio F.; Ienco, Dino; Akata, Zeynep; Marcos, Diego*",poster,,,,,,,,,
|
159 |
Corrupting Neuron Explanations of Deep Visual Features,"Srivastava, Divyansh*; Oikarinen, Tuomas; Weng, Lily",poster,,,,,,,,,
|
160 |
+
ICICLE: Interpretable Class Incremental Continual Learning,"Rymarczyk, Dawid Damian*; van de Weijer, Joost; Zieli?ski, Bartosz; Twardowski, Bartlomiej",poster,2303.07811,https://arxiv.org/abs/2303.07811,,https://huggingface.co/papers/2303.07811,,,,4,1
|
161 |
ProbVLM: Probabilistic Adapter for Frozen Vison-Language Models,"Upadhyay, Uddeshya*; Karthik, Shyamgopal; Mancini, Massimiliano; Akata, Zeynep",poster,2307.00398,https://arxiv.org/abs/2307.00398,,https://huggingface.co/papers/2307.00398,,,,4,0
|
162 |
Out-of-Distribution Detection for Monocular Depth Estimation,"Hornauer, Julia*; Holzbock, Adrian; Belagiannis, Vasileios",poster,2308.06072,https://arxiv.org/abs/2308.06072,,https://huggingface.co/papers/2308.06072,,,,3,0
|
163 |
Using Explanations to Guide Models,"Rao, Sukrut*; Böhle, Moritz; Parchami-Araghi, Amin; Schiele, Bernt",poster,2303.11932,https://arxiv.org/abs/2303.11932,,https://huggingface.co/papers/2303.11932,,,,4,1
|
|
|
212 |
SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training,"Lin, Yuanze; Wei, Chen; Wang, Huiyu; Yuille, Alan; Xie, Cihang*",poster,2211.11446,https://arxiv.org/abs/2211.11446,,https://huggingface.co/papers/2211.11446,,,,5,0
|
213 |
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model,"Jin, Peng*; Li, Hao; Cheng, Zesen; Li, Kehan; Ji, Xiangyang; Liu, Chang; Yuan, Li; Chen, Jie",poster,2303.09867,https://arxiv.org/abs/2303.09867,https://github.com/jpthu17/DiffusionRet,https://huggingface.co/papers/2303.09867,,,,8,0
|
214 |
Explore and Tell: Embodied Visual Captioning in 3D Environments,"Hu, Anwen*; Chen, Shizhe; Zhang, Liang; Jin, Qin",poster,2308.10447,https://arxiv.org/abs/2308.10447,,https://huggingface.co/papers/2308.10447,,,,4,1
|
215 |
+
Distilling Large Vision-Language Model with Out-of-Distribution Generalizability,"Li, Xuanlin*; Fang, Yunhao; Liu, Minghua; Ling, Zhan; Tu, Zhuowen; Su, Hao",poster,2307.03135,https://arxiv.org/abs/2307.03135,https://github.com/xuanlinli17/large_vlm_distillation_ood,https://huggingface.co/papers/2307.03135,,,,6,1
|
216 |
Learning Trajectory-Word Alignments for Video-Language Tasks,"YANG, XU; Li, Zhangzikang*; Xu, Haiyang; Zhang, Hanwang; Ye, Qinghao; Li, Chenliang; Yan, Ming; Zhang, Yu; Huang, Fei; Huang, Songfang",poster,2301.01953,https://arxiv.org/abs/2301.01953,,https://huggingface.co/papers/2301.01953,,,,10,0
|
217 |
Variational Causal Inference Network for Explanatory Visual Question Answering,"Xue, Dizhan*; Qian, Shengsheng; Xu, Changsheng",poster,,,,,,,,,
|
218 |
TextManiA: Enriching Visual Feature by Text-driven Manifold Augmentation,"Ye-Bin, Moon*; Kim, Jisoo; Kim, Hongyeob; son, kilho; Oh, Tae-Hyun",poster,2307.14611,https://arxiv.org/abs/2307.14611,,https://huggingface.co/papers/2307.14611,,,,5,0
|
|
|
335 |
ScanNet++: A High-Fidelity Dataset of 3D Indoor Scenes,"Yeshwanth, Chandan*; Liu, Yueh-Cheng; Niessner, Matthias; Dai, Angela",oral,2308.11417,https://arxiv.org/abs/2308.11417,,https://huggingface.co/papers/2308.11417,,,,4,0
|
336 |
Translating Images to Road Network: A Non-Autoregressive Sequence-to-Sequence Approach,"Lu, Jiachen; Peng, Renyuan; Cai, Xinyue; Xu, Hang; Li, Hongyang; Wen, Feng; Zhang, Wei; Zhang, Li*",oral,,,,,,,,,
|
337 |
Doppelgangers: Learning to Disambiguate Images of Similar Structures,"Cai, Ruojin*; Tung, Joseph; Wang, Qianqian; Averbuch-Elor, Hadar; Hariharan, Bharath; Snavely, Noah",oral,,,,,,,,,
|
338 |
+
EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with Visual Queries,"Mai, Jinjie*; Hamdi, Abdullah J; Giancola, Silvio; Zhao, Chen; Ghanem, Bernard",oral,2212.06969,https://arxiv.org/abs/2212.06969,https://github.com/Wayne-Mai/EgoLoc,https://huggingface.co/papers/2212.06969,,,,5,1
|
339 |
ClothPose: A Real-world Benchmark for Visual Analysis of Garment Pose via An Indirect Recording Solution,"Xu, Wenqiang*; Du, Wenxin; Xue, Han; Li, Yutong; Ye, Ruolin; Wang, Yan-Feng; Lu, Cewu",oral,,,,,,,,,
|
340 |
EMR-MSF: Self-Supervised Recurrent Monocular Scene Flow Exploiting Ego-Motion Rigidity,"Jiang, Zijie*; Okutomi, Masatoshi",oral,,,,,,,,,
|
341 |
ENVIDR: Implicit Differentiable Renderer with Neural Environment Lighting,"Liang, Ruofan*; Chen, Huiting; Li, Chunlin; Chen, Fan; Panneer, Selvakumar; Vijaykumar, Nandita",oral,2303.13022,https://arxiv.org/abs/2303.13022,,https://huggingface.co/papers/2303.13022,,,,6,0
|
|
|
347 |
Advancing Example Exploitation Can Alleviate Critical Challenges in Adversarial Training,"Ge, Yao*; Li, Yun; Han, Keji; Zhu, Junyi; Long, Xianzhong",oral,,,,,,,,,
|
348 |
The Victim and The Beneficiary: Exploiting a Poisoned Model to Train a Clean Model on Poisoned Data,"Zhu, Zixuan*; Wang, Rui; Zou, Cong; Jing, Lihua",oral,,,,,,,,,
|
349 |
TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models,"Sur, Indranil*; Sikka, Karan; Walmer, Matthew; Koneripalli, Kaushik; Roy, Anirban; Lin, Xiao; Divakaran, Ajay; Jha, Susmit",oral,2308.03906,https://arxiv.org/abs/2308.03906,https://github.com/SRI-CSL/TIJO,https://huggingface.co/papers/2308.03906,,,,8,1
|
350 |
+
SAGA: Spectral Adversarial Geometric Attack on 3D Meshes,"Stolik, Tomer*; Lang, Itai; Avidan, Shai",poster,2211.13775,https://arxiv.org/abs/2211.13775,https://github.com/StolikTomer/SAGA,https://huggingface.co/papers/2211.13775,,,,3,1
|
351 |
Benchmarking and Analyzing Robust Point Cloud Recognition: Bag of Tricks for Defending Adversarial Examples,"Ji, Qiufan*; Wang, Lin ; Hu, Shengshan; Sun, Lichao; Shi, Cong; Chen, Yingying",poster,2307.16361,https://arxiv.org/abs/2307.16361,https://github.com/qiufan319/benchmark_pc_attack.git,https://huggingface.co/papers/2307.16361,,,,6,0
|
352 |
ACTIVE: Towards Highly Transferable 3D Physical Camouflage for Universal and Robust Vehicle Evasion,"Suryanto, Naufal*; Kim, Yongsu; Larasati, Harashta Tatimma; Kang, Hyoeun; Le, Thi-Thu-Huong; Hong, Yoonyoung; Yang, Hunmin; Oh, Se-Yoon; Kim, Howon",poster,2308.07009,https://arxiv.org/abs/2308.07009,,https://huggingface.co/papers/2308.07009,,,,9,1
|
353 |
Frequency-aware GAN for Adversarial Manipulation Generation,"Zhu, Peifei*; Osada, Genki; Kataoka, Hirokatsu; Takahashi, Tsubasa",poster,,,,,,,,,
|
|
|
575 |
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds,"Ma, Tao*; Yang, Xuemeng; Zhou, Hongbin; Li, Xin; Shi, Botian; Liu, Junjie; Yang, Yuchen; Liu, Zhizheng; He, Liang; Li, Hongsheng; Li, Yikang; Qiao, Yu",poster,2306.06023,https://arxiv.org/abs/2306.06023,,https://huggingface.co/papers/2306.06023,,,,12,0
|
576 |
DETRs with Collaborative Hybrid Assignments Training,"Zong, Zhuofan*; Song, Guanglu; Liu, Yu",poster,2211.12860,https://arxiv.org/abs/2211.12860,https://github.com/Sense-X/Co-DETR,https://huggingface.co/papers/2211.12860,,,,3,0
|
577 |
Open Vocabulary Object Detection With an Open Corpus,"Wang, Jiong*; zhang, huiming; Hong, Haiwen; Jin, Xuan; He, Yuan; xue, hui; Zhao, Zhou",poster,,,,,,,,,
|
578 |
+
SparseDet: Improving Sparsely Annotated Object Detection with Pseudo-positive Mining,"Suri, Saksham*; Rambhatla, Sai Saketh ; Chellappa, Rama; Shrivastava, Abhinav",poster,2201.04620,https://arxiv.org/abs/2201.04620,,https://huggingface.co/papers/2201.04620,,,,4,2
|
579 |
Unsupervised Anomaly Detection with Diffusion Probabilistic Model,"Zhang, Xinyi*; Li, Naiqi; Li, Jiawei; Dai, Tao; Jiang, Yong; Xia, Shu-Tao",poster,,,,,,,,,
|
580 |
UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View Representation,"Wang, Haiyang*; Tang, Hao; Shi, Shaoshuai; Li, Aoxue; Li, Zhenguo; Schiele, Bernt; Wang, Liwei",poster,,,,,,,,,
|
581 |
Focus the Discrepancy: Intra- and Inter-Correlation Learning for Image Anomaly Detection,"Yao, Xincheng*; Li, Ruoqi; Qian, Zefeng; Luo, Yan; Zhang, Chongyang",poster,,,,,,,,,
|
|
|
626 |
Deep Geometrized Cartoon Line Inbetweening,"Siyao, Li*; Gu, Tianpei; Xiao, Weiye; Ding, Henghui; Liu, Ziwei; Loy, Chen Change",poster,,,,,,,,,
|
627 |
UnitedHuman: Harnessing Multi-Source Data for High-Resolution Human Generation,"Fu, Jianglin; Li, Shikai; Jiang, Yuming; Lin, Kwan-Yee; Wu, Wayne*; Liu, Ziwei",poster,,,,,,,,,
|
628 |
Towards Authentic Face Restoration with Iterative Diffusion Models and Beyond ,"Zhao, Yang*; Hou, Tingbo; Su, Yu-Chuan; Jia, Xuhui; Li, Yandong; Grundmann, Matthias",poster,,,,,,,,,
|
629 |
+
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning,"Han, Ligong*; Li, Yinxiao; Zhang, Han; Milanfar, Peyman; Metaxas, Dimitris N.; Yang, Feng",poster,2303.11305,https://arxiv.org/abs/2303.11305,,https://huggingface.co/papers/2303.11305,,,,6,1
|
630 |
MI-GAN: A Simple Baseline for Image Inpainting on Mobile Devices,"Sargsyan, Andranik*; Navasardyan, Shant; Xu, Xingqian; Shi, Humphrey",poster,,,,,,,,,
|
631 |
Structure and Content-Guided Video Synthesis with Diffusion Models,"Esser, Patrick*; Chiu, Johnathan; Atighehchian, Parmida PA; Granskog, Jonathan; Germanidis, Anastasis",poster,2302.03011,https://arxiv.org/abs/2302.03011,,https://huggingface.co/papers/2302.03011,,,,5,0
|
632 |
Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation,"Jiang, Yuxin; Jiang, Liming*; Yang, Shuai; Loy, Chen Change",poster,2308.12968,https://arxiv.org/abs/2308.12968,,https://huggingface.co/papers/2308.12968,,,,4,1
|
|
|
706 |
Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval,"Li, Pandeng*; Xie, Chen-Wei; Zhao, Liming; Xie, Hongtao; Ge, Jiannan; Zheng, Yun; Zhao, Deli; Zhang, Yongdong",oral,,,,,,,,,
|
707 |
Towards Deeply Unified Depth-aware Panoptic Segmentation with Bi-directional Guidance Learning,"He, Junwen*; Wang, Yifan; Wang, Lijun; Lu, Huchuan; Luo, Bin; He, Jun-Yan; Lan, Jin-Peng; Geng, Yifeng; Xie, Xuansong",oral,2307.14786,https://arxiv.org/abs/2307.14786,,https://huggingface.co/papers/2307.14786,,,,9,1
|
708 |
LogicSeg: Parsing Visual Semantics with Neural Logic Learning and Reasoning,"Li, Liulei; Wang, Wenguan*; Yang, Yi",oral,,,,,,,,,
|
709 |
+
ASIC: Aligning Sparse in-the-wild Image Collections,"Gupta, Kamal*; Jampani, Varun; Shrivastava, Abhinav; Makadia, Ameesh; Snavely, Noah; Esteves, Carlos; Kar, Abhishek",oral,2303.16201,https://arxiv.org/abs/2303.16201,,https://huggingface.co/papers/2303.16201,,,,7,1
|
710 |
CLIPascene: Scene Sketching with Different Types and Levels of Abstraction,"Vinker, Yael*; Alaluf, Yuval; Cohen-Or, Danny; Shamir, Ariel",oral,2211.17256,https://arxiv.org/abs/2211.17256,,https://huggingface.co/papers/2211.17256,,,,4,0
|
711 |
LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation,"PNVR, Koutilya*; Singh, Bharat; Ghosh, Pallabi; Jacobs, David; Siddiquie, Behjat",oral,,,,,,,,,
|
712 |
TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models,"Cao, Tianshi*; Kreis, Karsten; Fidler, Sanja; Sharp, Nicholas; Yin, Kangxue",oral,,,,,,,,,
|
|
|
725 |
Semi-supervised Semantics-guided Adversarial Training for Robust Trajectory Prediction,"Jiao, Ruochen*; Liu, Xiangguo; SATO, TAKAMI; Chen, Alfred; Qi, Zhu",poster,,,,,,,,,
|
726 |
NeRF-LOAM: Neural Implicit Representation for Large-Scale Incremental LiDAR Odometry and Mapping,"DENG, Junyuan; Wu, Qi; Chen, Xieyuanli*; Xia, Songpengcheng; Sun, Zhen; Liu, Guoqing; Yu, Wenxian; Pei, Ling",poster,,,,,,,,,
|
727 |
MapPrior: A Generative Approach for Birds-Eye View Perception,"Zhu, Xiyue*; Zyrianov, Vlas; Liu, Zhijian; Wang, Shenlong",poster,,,,,,,,,
|
728 |
+
Hidden Biases of End-to-End Driving Models,"Jaeger, Bernhard*; Chitta, Kashyap; Geiger, Andreas",poster,2306.07957,https://arxiv.org/abs/2306.07957,,https://huggingface.co/papers/2306.07957,,,,3,2
|
729 |
Search for or Navigate to? Dual Adaptive Thinking for Object Navigation,"Dang, Ronghao*; Wang, Liuyi; He, Zongtao; Su, Shuai; Tang, Jiagui; Liu, Chengju; Chen, Qijun",poster,2208.00553,https://arxiv.org/abs/2208.00553,,https://huggingface.co/papers/2208.00553,,,,6,0
|
730 |
Segmenting Known Objects and Unseen Unknowns without Prior Knowledge,"Gasperini, Stefano*; Marcos-Ramiro, Alvaro; Schmidt, Michael; Navab, Nassir; Busam, Benjamin ; Tombari, Federico",poster,2209.05407,https://arxiv.org/abs/2209.05407,,https://huggingface.co/papers/2209.05407,,,,6,1
|
731 |
BiFF: Bi-level Future Fusion with Polyline-based Coordinate for Interactive Trajectory Prediction,"ZHU, Yiyao*; LUAN, Di; Shen, Shaojie",poster,2306.14161,https://arxiv.org/abs/2306.14161,,https://huggingface.co/papers/2306.14161,,,,3,0
|
|
|
814 |
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection,"Zhang, Renrui*; Qiu, Han; Wang, Tai; Guo, Ziyu; Cui, Ziteng; Gao, Peng; Qiao, Yu; Li, Hongsheng",poster,2203.13310,https://arxiv.org/abs/2203.13310,https://github.com/ZrrSkywalker/MonoDETR,https://huggingface.co/papers/2203.13310,,,,9,0
|
815 |
ReLeaPS : Reinforcement Learning-based Illumination Planning for Generalized Photometric Stereo,"Chan, Jun Hoong*; Yu, Bohan; Guo, Heng; Ren, Jieji; Lu, Zongqing; Shi, Boxin",poster,,,,,,,,,
|
816 |
Convex Decomposition of Indoor Scenes,"Vavilala, Vaibhav S*; Forsyth, David",poster,2307.04246,https://arxiv.org/abs/2307.04246,,https://huggingface.co/papers/2307.04246,,,,2,0
|
817 |
+
NeO 360: Neural Fields for Sparse View Synthesis of Outdoor Scenes,"Irshad, Muhammad Zubair*; Zakharov, Sergey; Liu, Katherine; Guizilini, Vitor; Kollar, Thomas; Gaidon, Adrien; Ambru?, Rare? A; Kira, Zsolt",poster,2308.12967,https://arxiv.org/abs/2308.12967,https://github.com/zubair-irshad/NeO-360,https://huggingface.co/papers/2308.12967,,,8,8,1
|
818 |
UrbanGIRAFFE: Representing Urban Scenes as Compositional Generative Neural Feature Fields,"Yang, Yuanbo*; Yang, Yifei; Guo, Hanlei; Xiong, Rong; Wang, Yue; Liao, Yiyi",poster,2303.14167,https://arxiv.org/abs/2303.14167,,https://huggingface.co/papers/2303.14167,,,,6,0
|
819 |
Efficient Converted Spiking Neural Network for 3D and 2D classification,"Lan, Yuxiang; Zhang, Yachao; Ma, Xu; Qu, Yanyun*; FU, YUN",poster,,,,,,,,,
|
820 |
Distribution-Aligned Diffusion for Human Mesh Recovery,"Foo, Lin Geng*; Gong, Jia; Rahmani, Hossein; Liu, Jun",poster,2308.13369,https://arxiv.org/abs/2308.13369,,https://huggingface.co/papers/2308.13369,,,,4,0
|
|
|
893 |
EigenTrajectory: Low-Rank Descriptors for Multi-Modal Trajectory Forecasting,"Bae, Inhwan*; Oh, Jean; Jeon, Hae-Gon",poster,2307.09306,https://arxiv.org/abs/2307.09306,https://github.com/inhwanbae/EigenTrajectory,https://huggingface.co/papers/2307.09306,,,,3,1
|
894 |
RPEFlow: Multimodal Fusion of RGB-PointCloud-Event for Joint Optical Flow and Scene Flow Estimation,"Wan, Zhexiong; Mao, Yuxin; Zhang, Jing; Dai, Yuchao*",poster,,,,,,,,,
|
895 |
Multi-Scale Bidirectional Recurrent Network with Hybrid Correlation for Point Cloud Based Scene Flow Estimation,"CHENG, WENCAN; Ko, Jong Hwan*",poster,,,,,,,,,
|
896 |
+
ReST: A Reconfigurable Spatial-Temporal Graph Model for Multi-Camera Multi-Object Tracking,"Cheng, Cheng-Che*; Qiu, Min-Xuan; Chiang, Chen-Kuo; Lai, Shang-Hong",poster,2308.13229,https://arxiv.org/abs/2308.13229,,https://huggingface.co/papers/2308.13229,,,,4,1
|
897 |
"TAPIR: Tracking Any Point, Initialized per-frame, Refined temporally","Doersch, Carl*; Yang, Yi; Vecerik, Mel; Gokay, Dilara; Gupta, Ankush; Aytar, Yusuf; Carreira, Joao; Zisserman, Andrew",poster,,,,,,,,,
|
898 |
IHNet: Iterative Hierarchical Network Guided by High-Resolution Estimated Information for Scene Flow,"Wang, Yun*; Chi, Cheng; Lin, Min; Yang, Xin",poster,,,,,,,,,
|
899 |
Can Language Models Transfer to Social Gesture Motion Generation?,"Ng, Evonne*; Subramanian, Sanjay; Klein, Dan; Kanazawa, Angjoo; Darrell, Trevor; Ginosar, Shiry",poster,,,,,,,,,
|
|
|
954 |
Towards Nonlinear-Motion-Aware and Occlusion-Robust Rolling Shutter Correction,"Qu, Delin*; Lao, Yizhen; Wang, Zhigang; Wang, Dong; Zhao, Bin; Li, Xuelong",poster,2303.18125,https://arxiv.org/abs/2303.18125,https://github.com/DelinQu/qrsc,https://huggingface.co/papers/2303.18125,,,,6,0
|
955 |
FCCNs: Fully Complex-valued Convolutional Networks using Complex-valued Color Model and Loss Function,"Yadav, Saurabh*; Jerripothula, Koteswar Rao",poster,,,,,,,,,
|
956 |
Event Camera Data Pre-training,"Yang, Yan*; Pan, Liyuan; liu, Liu",poster,2301.01928,https://arxiv.org/abs/2301.01928,,https://huggingface.co/papers/2301.01928,,,,3,0
|
957 |
+
Improving 3D Imaging with Pre-Trained Perpendicular 2D Diffusion Models,"Lee, Suhyeon*; Chung, Hyungjin; Park, Min Young; Park, Jonghyeok; Ryu, Wi-Sun; Ye, Jong Chul",poster,2303.08440,https://arxiv.org/abs/2303.08440,,https://huggingface.co/papers/2303.08440,,,,6,1
|
958 |
Multiscale Structure Guided Diffusion for Image Deblurring,"Ren, Mengwei*; Delbracio, Mauricio; Talebi, Hossein ; Gerig, Guido; Milanfar, Peyman",poster,2212.01789,https://arxiv.org/abs/2212.01789,,https://huggingface.co/papers/2212.01789,,,,5,0
|
959 |
Generalizing Event-Based Motion Deblurring in Real-World Scenarios,"Zhang, Xiang; Yu, Lei*; Yang, Wen; Liu, Jianzhuang; Xia, Gui-Song",poster,2308.05932,https://arxiv.org/abs/2308.05932,,https://huggingface.co/papers/2308.05932,,,,5,0
|
960 |
On the Robustness of Normalizing Flows for Inverse Problems in Imaging,"Hong, Seongmin; PARK, INBUM; Chun, Se Young*",poster,2212.04319,https://arxiv.org/abs/2212.04319,,https://huggingface.co/papers/2212.04319,,,,3,0
|
|
|
975 |
Grounding 3D Object Affordance from 2D Interactions in Images,"Yang, Yuhang; Zhai, Wei; Luo, Hongchen; Cao, Yang*; Luo, Jiebo; Zha, Zheng-Jun",poster,2303.10437,https://arxiv.org/abs/2303.10437,https://github.com/yyvhang/IAGNet,https://huggingface.co/papers/2303.10437,,,,6,0
|
976 |
Navigating to Objects Specified by Images,"Krantz, Jacob*; Gervet, Theophile; Yadav, Karmesh; Wang, Austin S; Paxton, Chris; Mottaghi, Roozbeh; Batra, Dhruv; Malik, Jitendra; Lee, Stefan; Chaplot, Devendra Singh",poster,2304.01192,https://arxiv.org/abs/2304.01192,,https://huggingface.co/papers/2304.01192,,,,10,1
|
977 |
PEANUT: Predicting and Navigating to Unseen Targets,"Zhai, Albert J*; Wang, Shenlong",poster,2212.02497,https://arxiv.org/abs/2212.02497,,https://huggingface.co/papers/2212.02497,,,,2,0
|
978 |
+
Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied Agents,"Kim, Byeonghwi; kim, jinyeon; Kim, yuyeong; Min, Cheolhong; Choi, Jonghyun*",poster,2308.07241,https://arxiv.org/abs/2308.07241,,https://huggingface.co/papers/2308.07241,,,,5,1
|
979 |
Learning Foresightful Dense Visual Affordance for Deformable Object Manipulation,"Wu, Ruihai; Ning, Chuanruo; Dong, Hao*",poster,2303.11057,https://arxiv.org/abs/2303.11057,,https://huggingface.co/papers/2303.11057,,,,3,0
|
980 |
Exploiting Proximity-Aware Tasks for Embodied Social Navigation,"Cancelli, Enrico; Campari, Tommaso; Serafini, Luciano; Chang, Angel X; Ballan, Lamberto*",poster,2212.00767,https://arxiv.org/abs/2212.00767,,https://huggingface.co/papers/2212.00767,,,,5,0
|
981 |
Object-Aware Cognitive BirdÂs-Eye-View Grids for Vision-Language Navigation,"Liu, Rui; Wang, Xiaohan; Wang, Wenguan; Yang, Yi*",poster,,,,,,,,,
|
|
|
1002 |
DeepChange: A Long-Term Person Re-Identification Benchmark with Clothes Change,"Xu, Peng*; Zhu, Xiatian",poster,,,,,,,,,
|
1003 |
LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval,"Luo, Ziyang*; Zhao, Pu; Xu, Can; Geng, Xiubo; Shen, Tao; Tao, Chongyang; Ma, Jing; Lin, Qingwei; Jiang, Daxin",poster,,,,,,,,,
|
1004 |
Dual Pseudo-Labels Interactive Self-Training for Semi-Supervised Visible-Infrared Person Re-Identification,"Shi, Jiangming; Zhang, Yachao; Yin, Xiangbo; Xie, Yuan; Zhang, Zhizhong; Fan, Jianping; shi, zhongchao; Qu, Yanyun*",poster,,,,,,,,,
|
1005 |
+
$BT^2$: Backward-compatible Training with Basis Transformation,"Zhou, Yifei*; Li, Zilu; Shrivastava, Abhinav; Zhao, Hengshuang; Torralba, Antonio; Tian, Tai-Peng; Lim, Ser-Nam",poster,2211.03989,https://arxiv.org/abs/2211.03989,,https://huggingface.co/papers/2211.03989,,,,7,1
|
1006 |
Prototypical Mixup and Retrieval-based Refinement for Label Noise-resistant Image Retrieval,"Yang, Xinlong*; Wang, Haixin; Sun, Jinan; Zhang, Shikun; Chen, Chong; Hua, Xian-Sheng; Luo, Xiao",poster,,,,,,,,,
|
1007 |
Learning Spatial-context-aware Global Visual Feature Representation for Instance Image Retrieval,"Zhang, Zhongyan*; Wang, Lei; Zhou, Luping; Koniusz, Piotr",poster,,,,,,,,,
|
1008 |
Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval,"zhu, yunquan*; Gao, Xinkai; Ke, Bo; Qiao, Ruizhi; Sun, Xing",poster,,,,,,,,,
|
|
|
1218 |
COOL-CHIC: Coordinate-based Low Complexity Hierarchical Image Codec,"Ladune, Théo*; Philippe, Pierrick; Henry, Felix E; clare, gordon; Leguay, Thomas",poster,,,,,,,,,
|
1219 |
ReGen: A good Generative zero-shot video classifier should be Rewarded,"Bulat, Adrian*; Sanchez, Enrique; Martinez, Brais; Tzimiropoulos, Georgios",poster,,,,,,,,,
|
1220 |
Task Agnostic Restoration of Natural Video Dynamics,"Ali, Muhammad Kashif; Kim, Dongjin; Kim, Tae Hyun*",poster,2206.03753,https://arxiv.org/abs/2206.03753,https://github.com/MKashifAli/TARONVD,https://huggingface.co/papers/2206.03753,,,,3,0
|
1221 |
+
Normalizing Flows for Human Pose Anomaly Detection,"Hirschorn, Or*; Avidan, Shai",poster,2211.10946,https://arxiv.org/abs/2211.10946,,https://huggingface.co/papers/2211.10946,,,,2,1
|
1222 |
Movement Enhancement toward Multi-Scale Video Feature Representation for Temporal Action Detection,"Zhao, Zixuan; Wang, Dongqi; Zhao, Xu*",poster,,,,,,,,,
|
1223 |
Event-Guided Procedure Planning from Instructional Videos with Text Supervision,"Wang, An-Lan; Lin, Kun-Yu; Du, Jia-Run; Meng, Jingke; ZHENG, WEI-SHI*",poster,2308.08885,https://arxiv.org/abs/2308.08885,,https://huggingface.co/papers/2308.08885,,,,5,0
|
1224 |
SCANet: Scene Complexity Aware Network for Weakly-Supervised Video Moment Retrieval,"Yoon, Sunjae*; Koo, GwanHyeong; Kim, DaHyun; Yoo, Chang D.",poster,,,,,,,,,
|
|
|
1302 |
SALAD: Part-Level Latent Diffusion for 3D Shape Generation and Manipulation,"Koo, Juil*; Yoo, Seungwoo; Nguyen, Hieu Minh; Sung, Minhyuk",poster,2303.12236,https://arxiv.org/abs/2303.12236,,https://huggingface.co/papers/2303.12236,,,,4,0
|
1303 |
MAPConNet: Self-supervised 3D Pose Transfer with Mesh and Point Contrastive Learning,"Sun, Jiaze*; Chen, Zhixiang; Kim, Tae-Kyun (T-K)",poster,2304.13819,https://arxiv.org/abs/2304.13819,,https://huggingface.co/papers/2304.13819,,,,3,0
|
1304 |
Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition,"Yi, Xuanyu*; Deng, Jiajun; Sun, Qianru; Hua, Xian-Sheng; Lim, Joo-Hwee; Zhang, Hanwang",poster,2308.09694,https://arxiv.org/abs/2308.09694,,https://huggingface.co/papers/2308.09694,,,,6,0
|
1305 |
+
EPiC: Ensemble of Partial Point Clouds for Robust Classification,"Levi, Meir Yossef*; Gilboa, Guy",poster,2303.11419,https://arxiv.org/abs/2303.11419,https://github.com/yossilevii100/EPiC,https://huggingface.co/papers/2303.11419,,,,2,1
|
1306 |
Leveraging Intrinsic Properties for Non-Rigid Garment Alignment,"Lin, Siyou; ZHOU, Boyao; Zheng, Zerong; Zhang, Hongwen; Liu, Yebin*",poster,2308.09519,https://arxiv.org/abs/2308.09519,,https://huggingface.co/papers/2308.09519,,,,5,0
|
1307 |
Spatially and Spectrally Consistent Deep Functional Maps,"Sun, Mingze; Mao, Shiwei; Jiang, Puhua; Ovsjanikov, Maks; Huang, Ruqi*",poster,2308.08871,https://arxiv.org/abs/2308.08871,https://github.com/rqhuang88/Spatiallyand-Spectrally-Consistent-Deep-Functional-Maps,https://huggingface.co/papers/2308.08871,,,,5,0
|
1308 |
SVDFormer: Complementing Point Cloud via Self-view Augmentation and Self-structure Dual-generator,"Zhu, Zhe*; Chen, Honghua; He, Xing; Wang, Weiming; Qin, Jing; Wei, Mingqiang",poster,2307.08492,https://arxiv.org/abs/2307.08492,https://github.com/czvvd/SVDFormer,https://huggingface.co/papers/2307.08492,,,,6,0
|
|
|
1342 |
Dynamic Hyperbolic Attention Network for Fine Hand-object Reconstruction,"Leng, Zhiying*; Wu, shuncheng; Saleh, Mahdi; Montanaro, Antonio; Yu, Hao; Wang, Yin; Navab, Nassir; Liang, Xiaohui; Tombari, Federico",poster,,,,,,,,,
|
1343 |
Human from Blur: Human Pose Tracking from Blurry Images,"Zhao, Yiming*; Rozumnyi, Denys; Song, Jie; Hilliges, Otmar; Pollefeys, Marc; Oswald, Martin R.",poster,2303.17209,https://arxiv.org/abs/2303.17209,,https://huggingface.co/papers/2303.17209,,,,6,0
|
1344 |
AG3D: Learning to Generate 3D Avatars from 2D Image Collections,"Dong, Zijian*; Chen, Xu; Yang, Jinlong; Black, Michael J.; Hilliges, Otmar; Geiger, Andreas",poster,2305.02312,https://arxiv.org/abs/2305.02312,,https://huggingface.co/papers/2305.02312,,,,6,0
|
1345 |
+
InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion,"Xu, Sirui*; Li, Zhengyuan; Wang, Yu-Xiong; Gui, Liangyan",poster,2308.16905,https://arxiv.org/abs/2308.16905,https://github.com/Sirui-Xu/InterDiff,https://huggingface.co/papers/2308.16905,,,,4,1
|
1346 |
SEFD: Learning to Distill Complex Pose and Occlusion,"Yang, ChangHee*; Kong, Kyeongbo; Min, Sung-Jun; Wee, Dongyoon; Jang, Ho-Deok; Cha, Geonho; Kang, Suk-Ju",poster,,,,,,,,,
|
1347 |
3D Human Mesh Recovery with Sequentially Global Rotation Estimation,"Wang, Dongkai; Zhang, Shiliang*",poster,,,,,,,,,
|
1348 |
Co-Evolution of Pose and Mesh for 3D Human Body Estimation from Video,"You, Yingxuan*; Liu, Hong; Wang, Ti; Li, Wenhao; Ding, Runwei; Li, Xia",poster,2308.10305,https://arxiv.org/abs/2308.10305,https://github.com/kasvii/PMCE,https://huggingface.co/papers/2308.10305,,,,6,0
|
|
|
1548 |
DiffRate : Differentiable Compression Rate for Efficient Vision Transformers,"Chen, Mengzhao*; Shao, Wenqi; Xu, Peng; Lin, Mingbao; Zhang, Kaipeng; Chao, Fei; Ji, Rongrong; Qiao, Yu; Luo, Ping",poster,2305.17997,https://arxiv.org/abs/2305.17997,https://github.com/OpenGVLab/DiffRate,https://huggingface.co/papers/2305.17997,,,,9,0
|
1549 |
Bridging Cross-task Protocol Inconsistency for Distillation in Dense Object Detection,"Yang, Longrong; Zhou, Xianpan; Li, Xuewei; Qiao, Liang; Li, Zheyang; Yang, Ziwei; Wang, Gaoang; Li, Xi*",poster,2308.14286,https://arxiv.org/abs/2308.14286,https://github.com/TinyTigerPan/BCKD,https://huggingface.co/papers/2308.14286,,,,8,0
|
1550 |
From Knowledge Distillation to Self-Knowledge Distillation: A Unified Approach with Normalized Loss and Customized Soft Labels,"Yang, Zhendong*; Zeng, Ailing; Li, Zhe; Zhang, Tianke; Yuan, Chun; Li, Yu",poster,2303.13005,https://arxiv.org/abs/2303.13005,https://github.com/yzd-v/cls_KD,https://huggingface.co/papers/2303.13005,,,,6,1
|
1551 |
+
Efficient 3D Semantic Segmentation with Superpoint Transformer,"ROBERT, Damien*; Raguet, Hugo; Landrieu, Loic",poster,2306.08045,https://arxiv.org/abs/2306.08045,,https://huggingface.co/papers/2306.08045,,,,3,2
|
1552 |
Dataset Quantization,"Zhou, Daquan; Wang, Kai*; Gu, Jianyang; Peng, Xiangyu; Lian, Dongze; Zhang, Yifan; You, Yang; Feng, Jiashi",poster,2308.10524,https://arxiv.org/abs/2308.10524,,https://huggingface.co/papers/2308.10524,,,,8,0
|
1553 |
Revisiting the Parameter Efficiency of Adapters from the Perspective of Precision Redundancy,"Jie, Shibo*; Wang, Haoqing; Deng, Zhi-Hong",poster,2307.16867,https://arxiv.org/abs/2307.16867,https://github.com/JieShibo/PETL-ViT,https://huggingface.co/papers/2307.16867,,,,3,0
|
1554 |
RepQ-ViT: Scale Reparameterization for Post-Training Quantization of Vision Transformers,"Li, Zhikai*; Xiao, Junrui; Yang, Lianwei; Gu, Qingyi",poster,,,,,,,,,
|
1555 |
Semantically Structured Image Compression via Irregular Group-Based Decoupling,"Feng, Ruoyu*; Gao, Yixin; Jin, Xin; Feng, Runsen; Chen, Zhibo",poster,2305.02586,https://arxiv.org/abs/2305.02586,,https://huggingface.co/papers/2305.02586,,,,5,0
|
1556 |
SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage,"Park, Song; Chun, Sanghyuk*; Heo, Byeongho; Kim, Wonjae; Yun, Sangdoo",poster,2303.11114,https://arxiv.org/abs/2303.11114,https://github.com/naver-ai/seit,https://huggingface.co/papers/2303.11114,,,,5,1
|
1557 |
SMMix: Self-Motivated Image Mixing for Vision Transformers,"Chen, Mengzhao*; Lin, Mingbao; Lin, Zhihang; Zhang, Yuxin; Chao, Fei; Ji, Rongrong",poster,2212.12977,https://arxiv.org/abs/2212.12977,https://github.com/ChenMnZ/SMMix,https://huggingface.co/papers/2212.12977,,,,6,0
|
1558 |
+
Multi-Label Knowledge Distillation,"Yang, Penghui*; Xie, Ming-Kun; Zong, Chen-Chen; Feng, Lei; Niu, Gang; Sugiyama, Masashi; Huang, Sheng-Jun",poster,2308.06453,https://arxiv.org/abs/2308.06453,https://github.com/penghui-yang/L2D,https://huggingface.co/papers/2308.06453,,,,7,1
|
1559 |
UGC: Unified GAN Compression for Efficient Image-to-Image Translation ,"Ren, Yuxi*; Wu, Jie; Zhang, Peng; Zhang, Manlin; Xiao, Xuefeng; He, Qian; Wang, Rui; Zheng, Min ; Pan, Xin",poster,,,,,,,,,
|
1560 |
MotionDeltaCNN: Sparse CNN Inference of Frame Differences in Moving Camera Videos,"Parger, Mathias*; Tang, Chengcheng; Neff, Thomas; Twigg, Christopher D; Keskin, Cem; Wang, Robert; Steinberger, Markus",poster,2210.09887,https://arxiv.org/abs/2210.09887,,https://huggingface.co/papers/2210.09887,,,,7,0
|
1561 |
Lightweight Multi-Scale Attention for On-Device Semantic Segmentation,"Cai, Han*; Li, Junyan; Hu, Muyan; Gan, Chuang; Han, Song",poster,,,,,,,,,
|
|
|
1781 |
Improving Unsupervised Visual Program Inference with Code Rewriting Families,"Ganeshan, Aditya*; Jones, R. Kenny; Ritchie, Daniel",oral,,,,,,,,,
|
1782 |
Denoising Diffusion Autoencoders are Unified Self-supervised Learners,"Xiang, Weilai; Yang, Hongyu*; Huang, Di; Wang, Yunhong",oral,2303.09769,https://arxiv.org/abs/2303.09769,,https://huggingface.co/papers/2303.09769,,,,4,0
|
1783 |
Self-Ordering Point Clouds,"Yang, Pengwan*; Snoek, Cees; Asano, Yuki M",oral,2304.00961,https://arxiv.org/abs/2304.00961,,https://huggingface.co/papers/2304.00961,,,,3,0
|
1784 |
+
MOST: Multiple Object localization with Self-supervised Transformers for object discovery,"Rambhatla, Sai Saketh *; Misra, Ishan; Chellappa, Rama; Shrivastava, Abhinav",oral,2304.05387,https://arxiv.org/abs/2304.05387,,https://huggingface.co/papers/2304.05387,,,,4,2
|
1785 |
Self-supervised Learning for 3D Human-Object Spatial Relations from Unbounded Synthesized Images,"Han, Sookwan*; Joo, Hanbyul",oral,,,,,,,,,
|
1786 |
Identity-Seeking Self-Supervised Representation Learning for Generalizable Person Re-identification,"Dou, Zhaopeng*; Wang, Zhongdao; Li, Ya-Li; Wang, Shengjin",oral,2308.08887,https://arxiv.org/abs/2308.08887,https://github.com/dcp15/ISR_ICCV2023_Oral,https://huggingface.co/papers/2308.08887,,,,4,0
|
1787 |
Anatomical Invariance Modeling and Semantic Alignment for Self-supervised Learning in 3D Medical Image Analysis,"Jiang, Yankai*; Sun, Mingze; Guo, Heng; Bai, Xiaoyu; Yan, Ke; Lu, Le; Xu, Minfeng",oral,2302.05615,https://arxiv.org/abs/2302.05615,https://github.com/alibaba-damo-academy/alice,https://huggingface.co/papers/2302.05615,,,,7,0
|