Tags 3D1 AI18 Attention2 clip1 Diffusion7 domain2 fine-tuning1 GAN7 generative17 multi-modal1 NLP1 Score-based1 Transformer1 VAE1 vision-language2