When inpainting, the masked area is affected by the unmasked area. This script creates a 1024x512 or 1536x512 image and masks the current frame area to create a 512x512 image.
Since the test was designed based on 2D characters, realistic checkpoints do not work well. To use it with a real model, it is necessary to adjust the size and position of the eyes and mouth.