Integrating Vision Language and Foundation Models for Automated Estimation of Building Lowest Floor Elevation from Street View Imagery
This study integrates the Segment Anything model, a segmentation foundation model, with vision language models to conduct text-prompt image segmentation on street view images for automated estimation of building lowest floor elevation (LFE). The proposed method significantly enhances the availability of LFE estimation compared to the existing model.