Core Concepts
Capturing cell heterogeneity in cell population representations can significantly improve the performance of image-based cell profiling for mechanism of action prediction.
Abstract
The article introduces CytoSummaryNet, a Deep Sets-based approach that uses self-supervised contrastive learning in a multiple-instance learning framework to improve image-based cell profiling. The key insights are:
Typical cell profiling methods represent a sample by averaging across cells, failing to capture the heterogeneity within cell populations. CytoSummaryNet addresses this by learning a representation that preserves the diversity of single-cell features.
CytoSummaryNet achieves a 30-68% improvement in mean average precision for mechanism of action prediction compared to average profiling on a public dataset.
Interpretability analysis suggests the model achieves this by downweighting small mitotic cells or those with debris, and prioritizing large uncrowded cells.
The approach requires only perturbation labels for training, which are readily available in all cell profiling datasets, making it an easy-to-apply method for aggregating single-cell feature data.
CytoSummaryNet offers a straightforward post-processing step for single-cell profiles that can significantly boost retrieval performance on image-based profiling datasets.
Stats
CytoSummaryNet achieves a 30-68% improvement in mean average precision for mechanism of action prediction compared to average profiling on a public dataset.
Quotes
"Typical cell profiling methods represent a sample by averaging across cells, failing to capture the heterogeneity within cell populations."
"CytoSummaryNet achieves this improvement by downweighting small mitotic cells or those with debris and prioritizing large uncrowded cells."