Zhi, H. (2026). Cross-Modal Contrastive Representation Learning for Multimedia Retrieval with Noisy Supervision. EAI Endorsed Transactions on Scalable Information Systems, 12(9). https://doi.org/10.4108/eetsis.10757