Conceitos essenciais
Gaussian Pancakes synergizes 3D Gaussian Splatting with a colonoscopy-tailored SLAM system to generate high-quality, photorealistic 3D reconstructions of the colon from endoscopic video, enabling improved diagnosis and treatment of colorectal cancer.
Resumo
The paper introduces "Gaussian Pancakes", a method that combines 3D Gaussian Splatting (3D GS) with a Recurrent Neural Network-based Simultaneous Localization and Mapping (RNNSLAM) system to generate accurate and photorealistic 3D reconstructions of the colon from endoscopic video.
Key highlights:
- Integrates RNNSLAM to provide robust camera poses, depth maps, and a rudimentary surface reconstruction, addressing the limitations of traditional Structure-from-Motion (SfM) methods.
- Improves the base 3D GS method by incorporating geometric and depth regularizations, effectively "pancaking" the Gaussians to align them with the colon surface and reduce artifacts in novel view synthesis.
- Optimizes the training and rendering process for radiance fields in surgical scenes, reducing training times to around 2 minutes and improving image rendering speeds by over 100 times.
- Evaluations across three diverse datasets show that Gaussian Pancakes outperforms current leading methods in novel view synthesis quality, with an 18% boost in PSNR and a 16% improvement in SSIM.
- The explicit geometry and real-time rendering capabilities of Gaussian Pancakes make it a promising tool for practical applications in colorectal cancer diagnosis and treatment, such as automated surgical path planning, AR/VR training environments, and AI-based polyp detection.
Estatísticas
Colorectal cancer remains a major global health challenge, consistently ranking among the top three cancers in prevalence and mortality.
Up to three-quarters of all missed polyp cases are due to polyps hidden behind folds.
Colonoscopists encounter difficulties in thorough surface inspection due to the limited field of view and lack of 3D information.
Citações
"Generating full 3D reconstructions with high quality textures from endoscopic images in near real-time would enable improved diagnosis and treatment through downstream tasks like automated surgical path planning, AR/VR training environments, AI-based polyp detection, and for determining missing regions for re-inspection."
"NeRF and NeuS, including their colonoscopy-specific adaptations ColonNeRF, REIM-NeRF and LightNeuS, are limited by lengthy training times, slow rendering speeds, and a lack of explicit geometry, which complicates integration into practical workflows."