PMMTalk Speech Driven 3D Facial Animation From Complementary Pseudo Multi Modal ...
FER Former Multimodal Transformer for Facial Expression Recognition
Focus Entirety and Perceive Environment for Arbitrary Shaped Text Detection
Token Masking Transformer for Weakly Supervised Object Localization
A Mobile Image Driven PM25 Estimation Framework Using Deep Learning Techniques
RHLS A Robust Hybrid Level Set Model Using Global Local Signed Energy Based Pres...
Explain Vision Focus Blending Human Saliency Into Synthetic Face Images
Implicit and Explicit Language Guidance for Diffusion Based Visual Perception
Adaptive Knowledge Distillation With Attention Based Multi Modal Fusion for Robu...
Progressive Pseudo Labeling for Multi Dataset Detection Over Unified Label Space
Scene Text Image Super Resolution Via Semantic Distillation and Text Perceptual ...
Unsupervised Low Light Image Enhancement With Self Paced Learning
Unleash the Power of Vision Language Models by Visual Attention Prompt and Multi...
Progressive Region to Boundary Exploration Network for Camouflaged Object Detection
Masked Attribute Description Embedding for Cloth Changing Person Re Identification
Category Contrastive Fine Grained Crowd Counting and Beyond