toplogo
Sign In
insight - Cancer biology - # Chromatin accessibility profiling and cell-type deconvolution of bulk tumor samples

Accurate Deconvolution of Bulk Tumor ATAC-Seq Data Reveals Cancer and Immune Cell Composition


Core Concepts
The authors developed a robust computational framework, EPIC-ATAC, that accurately predicts the proportions of cancer, immune, stromal, and vascular cells from bulk tumor ATAC-Seq data by integrating cell-type specific chromatin accessibility markers and reference profiles.
Abstract

The authors collected 564 ATAC-Seq samples from sorted cell populations covering major immune, stromal, and vascular cell types relevant to cancer. They used this dataset to identify cell-type specific chromatin accessibility marker peaks and build reference profiles for each cell type.

The authors then integrated these markers and profiles into the EPIC deconvolution framework to develop EPIC-ATAC, a tool for accurately predicting the proportions of cancer, immune, stromal, and vascular cells from bulk tumor ATAC-Seq data.

EPIC-ATAC was validated on PBMC and tumor samples, showing high accuracy in predicting cell-type fractions compared to other deconvolution tools. It was able to simultaneously quantify the proportions of uncharacterized cells (a proxy for cancer cells) as well as immune, stromal, and vascular cells.

When applied to a breast cancer cohort, EPIC-ATAC accurately inferred the immune contexture of the main breast cancer subtypes, revealing differences in immune cell infiltration between the subtypes.

The authors also provide the annotated cell-type specific marker peaks, which can be used to expand the list of known marker genes and transcription factors associated with each cell type. The marker peaks were also found to be enriched for pathways involved in immune responses to the tumor microenvironment.

Overall, the EPIC-ATAC framework enables robust deconvolution of bulk tumor ATAC-Seq data, supporting the analysis of regulatory processes underlying tumor development and the tumor microenvironment composition.

edit_icon

Customize Summary

edit_icon

Rewrite with AI

edit_icon

Generate Citations

translate_icon

Translate Source

visual_icon

Generate MindMap

visit_icon

Visit Source

Stats
"Venous blood from five healthy donors was collected at the local blood transfusion center of Geneva in Switzerland, under the approval of the Geneva University Hospital's Institute Review Board, upon written informed consent and in accordance with the Declaration of Helsinki." "We collected 564 samples of sorted cell populations from twelve studies including eight immune cell types (B cells, CD4+ T cells, CD8+ T cells, natural killer (NK) cells, dendritic cells (DCs), macrophages, monocytes and neutrophils), as well as fibroblasts and endothelial cells."
Quotes
"A key determinant for accurate predictions of cell-type proportions by most deconvolution tools is the availability of reliable cell-type specific markers and reference profiles." "EPIC-ATAC shows reliable predictions of immune, stromal, vascular and cancer cell proportions. With the expected increase of ATAC-Seq studies in cancer, the EPIC-ATAC framework will enable researchers to deconvolve bulk ATAC-Seq data from tumor samples to support the analysis of regulatory processes underlying tumor development, and correlate the TME composition with clinical variables."

Deeper Inquiries

How could the EPIC-ATAC framework be extended to incorporate additional cell types or subtypes beyond the ones considered in this study

To extend the EPIC-ATAC framework to incorporate additional cell types or subtypes beyond the ones considered in this study, several steps can be taken: Collecting Additional ATAC-Seq Data: Gather ATAC-Seq data from pure cell populations representing the new cell types or subtypes of interest. Ensure that the data collection process follows the same standards and quality control measures as the original dataset. Identifying Cell-Type Specific Marker Peaks: Conduct differential accessibility analysis to identify cell-type specific marker peaks for the new cell types or subtypes. This involves comparing the chromatin accessibility patterns across different cell populations to pinpoint regions that are uniquely accessible in each cell type. Building Reference Profiles: Generate reference profiles for the new cell types or subtypes based on the identified marker peaks. Normalize the ATAC-Seq counts and compute the median accessibility levels for each peak to create the reference profiles. Integration with EPIC-ATAC: Integrate the new marker peaks and reference profiles into the EPIC-ATAC framework. Ensure that the new data aligns with the existing framework and can be used for accurate cell-type deconvolution in bulk ATAC-Seq samples. By following these steps, the EPIC-ATAC framework can be expanded to include additional cell types or subtypes, allowing for more comprehensive and accurate cell-type quantification in bulk tumor ATAC-Seq data.

What are the potential limitations of using chromatin accessibility data versus gene expression data for cell-type deconvolution, and how could these limitations be addressed

Using chromatin accessibility data for cell-type deconvolution has certain limitations compared to gene expression data: Resolution and Specificity: Chromatin accessibility data may not provide the same level of resolution and specificity as gene expression data in distinguishing between closely related cell types or subtypes. This can lead to challenges in accurately quantifying cell proportions, especially for cell populations with similar regulatory profiles. Interpretation of Regulatory Mechanisms: Chromatin accessibility data reflects the accessibility of DNA regions to transcription factors and regulatory proteins, providing insights into potential regulatory mechanisms. However, gene expression data directly captures the activity of these regulatory elements, offering a more direct link to cellular functions and processes. To address these limitations, several strategies can be employed: Integration of Multi-Omics Data: Combining chromatin accessibility data with gene expression data can provide a more comprehensive view of cellular states and regulatory networks. Integrating multiple omics layers can enhance the accuracy and interpretability of cell-type deconvolution analyses. Development of Advanced Computational Models: Implementing advanced machine learning algorithms that can effectively leverage chromatin accessibility data for cell-type quantification. These models can account for the inherent complexities and nuances of chromatin accessibility profiles to improve the accuracy of cell-type deconvolution. Validation and Benchmarking: Conducting thorough validation and benchmarking studies to compare the performance of chromatin accessibility-based deconvolution methods with gene expression-based approaches. This can help identify the strengths and limitations of each data type and guide the selection of the most appropriate method for specific research questions.

How could the insights gained from the cell-type specific marker peak annotations be leveraged to further understand the regulatory mechanisms underlying the tumor microenvironment

The insights gained from the cell-type specific marker peak annotations can be leveraged to further understand the regulatory mechanisms underlying the tumor microenvironment in the following ways: Identification of Regulatory Networks: By associating the marker peaks with specific transcription factors (TFs) and chromatin binding proteins (CBPs), researchers can uncover key regulatory networks that govern the gene expression patterns in different cell types within the tumor microenvironment. This information can provide valuable insights into the molecular mechanisms driving tumor progression and immune response. Functional Pathway Analysis: The enriched biological pathways identified through the marker peak annotations can offer clues about the functional roles of different cell types in the tumor microenvironment. By linking these pathways to known cellular processes and signaling pathways, researchers can elucidate the functional impact of specific cell populations on tumor development and immune evasion. Integration with Single-Cell Data: The marker peak annotations can be integrated with single-cell ATAC-Seq data to validate and refine cell-type specific profiles. By comparing the chromatin accessibility patterns at the single-cell level with the bulk ATAC-Seq marker peaks, researchers can gain a more detailed understanding of the heterogeneity and dynamics of cell populations within the tumor microenvironment. Overall, leveraging the information from the cell-type specific marker peak annotations can enhance our understanding of the regulatory landscape in the tumor microenvironment and provide valuable insights into the interactions between different cell types in the context of cancer progression and treatment response.
0
star