Core Concepts
Developing a streamlined methodology for building datasets and deriving accurate models for automatic taxonomic identification of the Portuguese native flora using deep learning.
Abstract
The content discusses the development of Floralens, a deep learning model for identifying species in the Portuguese native flora. It outlines the methodology used to construct datasets from various sources like FloraOn, iNaturalist, Pl@ntNet, and Observation.org. The process of training the model using Google's AutoML Vision cloud service is detailed, along with the evaluation metrics such as precision, recall, Top-1, Top-5, and Mean Reciprocal Rank (MRR). The results are compared with Pl@ntNet API models and integrated into web and mobile applications.
Directory:
Introduction
Importance of Citizen Science platforms.
Dataset Construction
Creation of Floralens dataset from various sources.
Model Derivation
Training process using GAMLV.
Model Evaluation
Evaluation metrics and comparison with Pl@ntNet API models.
Software Artifacts
Integration into Biolens website and mobile app.
Conclusions
Future work on improving species coverage and model accuracy.
Stats
4 million images available for consideration.
2,712 species covered in the FloraOn catalog.
300,000 images in the Floralens dataset.
Quotes
"Machine-learning techniques are pivotal for image-based identification of biological species."
"We find that off-the-shelf machine-learning cloud services produce accurate models with relatively little effort."