Fang, S. (2024). A Comprehensive Survey of Text Encoders for Text-to-Image Diffusion Models.
EAI Endorsed Transactions on AI and Robotics
,
3
. https://doi.org/10.4108/airo.5566