A 3D Self Awareness Diffusion Network for Multimodal Classification
Learning Shape Color Diffusion Priors for Text Guided 3D Object Generation
Text2Avatar Articulated 3D Avatar Creation With Text Instructions
Cross Modal Progressive Perspective Matching Network for Remote Sensing Image Te...
S3GAAR Segmented Spatiotemporal Skeleton Graph Attention for Action Recognition
Efficient Chroma Intra Prediction via Exemplar Colorization Network for Versatil...
Rectangling for Stitched Image via Pixel Wise Deformation Learning
Facial Action Units as a Joint Dataset Training Bridge for Facial Expression Rec...
TITFormer Combining Textual Modality and Simulating Infrared Modality Based on T...
Learning Intrinsic Invariance Within Intra Class for Domain Generalization
Adaptive Complex Wavelet Informed Transformer Operator
CLIP AE A Multi Modal Unsupervised Images Enhancement Method Based on High Order...
Prune and Merge Efficient Token Compression for Vision Transformer With Spatial ...
Detecting Adversarial Attacks Based on Tracking Differences in Frequency Bands
DetailRecon Focusing on Detailed Regions for Online Monocular 3D Reconstruction
Eliminating Moir Patterns Across Diverse Image Resolutions via DMMNet