//=time() ?>
Introducing ERNIE-ViLG 2.0, a 24B text-to-image model with knowledge-enhanced Mixture-of-Denoising-Experts that obtains SOTA results on MS-COCO. This is also the largest text-to-image model at present.
Paper: https://t.co/YpSDABMvPI
Demo @huggingface: https://t.co/DrplqfSysk