A Swin-Transformer-based backbone and a pixel-focus loss function are proposed to effectively demosaic event camera RAW images with missing pixel values.