Fang, S. (2024). A Comprehensive Survey of Text Encoders for Text-to-Image Diffusion Models. EAI Endorsed Transactions on AI and Robotics, 3. https://doi.org/10.4108/airo.5566