Abstract: Large-scale pretrained text-to-image models have made incredible progress recently. When synthesizing the appearance of subjects in given texts, existing works fine-tune pretrained models or ...