Suivre
Yiyuan Zhang
Yiyuan Zhang
Autres noms张 懿元
Adresse e-mail validée de ie.cuhk.edu.hk - Page d'accueil
Titre
Citée par
Citée par
Année
Meta-transformer: A Unified Framework for Multimodal Learning
Y Zhang, K Gong, K Zhang, H Li, Y Qiao, W Ouyang, X Yue
arXiv preprint arXiv:2307.10802, 2023
982023
Asymmetric Convolution: An Efficient and Generalized Method to Fuse Feature Maps in Multiple Vision Tasks
W Han, X Dong, Y Zhang, D Crandall, CZ Xu, J Shen
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024
90*2024
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio Video Point Cloud Time-Series and Image Recognition
X Ding, Y Zhang, Y Ge, S Zhao, L Song, X Yue, Y Shan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
602024
Modality Synergy Complement Learning with Cascaded Aggregation for Visible-Infrared Person Re-identification
Y Zhang, S Zhao, Y Kang, J Shen
European Conference on Computer Vision (ECCV), 462-479, 2022
392022
OneLLM: One Framework to Align All Modalities with Language
J Han, K Gong, Y Zhang, J Wang, K Zhang, D Lin, Y Qiao, P Gao, X Yue
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
302024
Dual-Semantic Consistency Learning for Visible-Infrared Person Re-identification
Y Zhang, Y Kang, S Zhao, J Shen
IEEE Transactions on Information Forensics and Security 18, 1554-1565, 2022
262022
Unireplknet: A universal perception large-kernel convnet for audio, video, point cloud, time-series and image recognition. arXiv 2023
X Ding, Y Zhang, Y Ge, S Zhao, L Song, X Yue, Y Shan
arXiv preprint arXiv:2311.15599, 2024
162024
Online Vectorized HD Map Construction using Geometry
Z Zhang, Y Zhang, X Ding, F Jin, X Yue
European Conference on Computer Vision (ECCV), 2024
92024
Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priors
L Ding, S Dong, Z Huang, Z Wang, Y Zhang, K Gong, D Xu, T Xue
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
52024
InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions
Y Zhang, Y Kang, Z Zhang, X Ding, S Zhao, X Yue
arXiv preprint arXiv:2402.03040, 2024
32024
Meta-transformer: A unified framework for multimodal learning. arXiv 2023
Y Zhang, K Gong, K Zhang, H Li, Y Qiao, W Ouyang, X Yue
arXiv preprint arXiv:2307.10802, 0
3
Multimodal pathway: Improve transformers with irrelevant data from other modalities
Y Zhang, X Ding, K Gong, Y Ge, Y Shan, X Yue
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
22024
Towards Unified and Effective Domain Generalization
Y Zhang, K Gong, X Ding, K Zhang, F Lv, K Keutzer, X Yue
arXiv preprint arXiv:2310.10008, 2023
12023
Explore the Limits of Omni-modal Pretraining at Scale
Y Zhang, H Li, J Liu, X Yue
arXiv preprint arXiv:2406.09412, 2024
2024
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–14