Anomaly detection for high-content image-based phenotypic cell profiling

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

High-content image-based phenotypic profiling combines automated microscopy and analysis to identify phenotypic alterations in cell morphology and provide insight into the cell’s physiological state. Classical representations of the phenotypic profile can not capture the full underlying complexity in cell organization, while recent weakly machine-learning based representation-learning methods are hard to biologically interpret. We used the abundance of control wells to learn the in-distribution of control experiments and use it to formulate a self-supervised reconstruction anomaly-based representation that encodes the intricate morphological inter-feature dependencies while preserving the representation interpretability. The performance of our anomaly-based representations was evaluated for downstream tasks with respect to two classical representations across four public Cell Painting datasets. Anomaly-based representations improved reproducibility, Mechanism of Action classification, and complemented classical representations. Unsupervised explainability of autoencoder-based anomalies identified specific inter-feature dependencies causing anomalies. The general concept of anomaly-based representations can be adapted to other applications in cell biology.

Related collections

Most cited references 56

Record: found
Abstract: found
Article: not found

A Unified Approach to Interpreting Model Predictions

Scott R. Lundberg, Su-In Lee (2017)

Understanding why a model makes a certain prediction can be as crucial as the prediction's accuracy in many applications. However, the highest accuracy for large modern datasets is often achieved by complex models that even experts struggle to interpret, such as ensemble or deep learning models, creating a tension between accuracy and interpretability. In response, various methods have recently been proposed to help users interpret the predictions of complex models, but it is often unclear how these methods are related and when one method is preferable over another. To address this problem, we present a unified framework for interpreting predictions, SHAP (SHapley Additive exPlanations). SHAP assigns each feature an importance value for a particular prediction. Its novel components include: (1) the identification of a new class of additive feature importance measures, and (2) theoretical results showing there is a unique solution in this class with a set of desirable properties. The new class unifies six existing methods, notable because several recent methods in the class lack the proposed desirable properties. Based on insights from this unification, we present new methods that show improved computational performance and/or better consistency with human intuition than previous approaches. To appear in NIPS 2017

0 comments Cited 1211 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles

Aravind Subramanian, Rajiv Narayan, Steven Corsello … (2017)

We previously piloted the concept of a Connectivity Map (CMap), whereby genes, drugs, and disease states are connected by virtue of common gene-expression signatures. Here, we report more than a 1,000-fold scale-up of the CMap as part of the NIH LINCS Consortium, made possible by a new, low-cost, high-throughput reduced representation expression profiling method that we term L1000. We show that L1000 is highly reproducible, comparable to RNA sequencing, and suitable for computational inference of the expression levels of 81% of non-measured transcripts. We further show that the expanded CMap can be used to discover mechanism of action of small molecules, functionally annotate genetic variants of disease genes, and inform clinical trials. The 1.3 million L1000 profiles described here, as well as tools for their analysis, are available at https://clue.io.

0 comments Cited 1007 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Anomaly detection: A survey

Varun Chandola, Arindam Banerjee, Vipin Kumar (2009)

Anomaly detection is an important problem that has been researched within diverse research areas and application domains. Many anomaly detection techniques have been specifically developed for certain application domains, while others are more generic. This survey tries to provide a structured and comprehensive overview of the research on anomaly detection. We have grouped existing techniques into different categories based on the underlying approach adopted by each technique. For each category we have identified key assumptions, which are used by the techniques to differentiate between normal and anomalous behavior. When applying a given technique to a particular domain, these assumptions can be used as guidelines to assess the effectiveness of the technique in that domain. For each category, we provide a basic anomaly detection technique, and then show how the different existing techniques in that category are variants of the basic technique. This template provides an easier and more succinct understanding of the techniques belonging to each category. Further, for each category, we identify the advantages and disadvantages of the techniques in that category. We also provide a discussion on the computational complexity of the techniques since it is an important issue in real application domains. We hope that this survey will provide a better understanding of the different directions in which research has been done on this topic, and how techniques developed in one area can be applied in domains for which they were not intended to begin with.

0 comments Cited 695 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Journal ID (nlm-ta): bioRxiv

Journal ID (publisher-id): BIORXIV

Title: bioRxiv

Publisher: Cold Spring Harbor Laboratory

Publication date (Electronic): 03 June 2024

Electronic Location Identifier: 2024.06.01.595856

Affiliations

[1 ]Department of Software and Information Systems Engineering, Ben-Gurion University of the Negev, Beer-Sheva 84105, Israel

[2 ]Department of Computer Science, Jerusalem College of Technology, 91160 Jerusalem, Israel

[3 ]Imaging Platform, Broad Institute of MIT and Harvard, Cambridge (MA), USA

Author notes

[*]

Equal contribution

Author Contribution

AZ conceived the study. AS and NK developed the computational method and analyzed the data. AS, NK, SG, EW and AZ interpreted the results. AS and AZ drafted the manuscript. AZ acquired funding and mentored AS and NK. All authors edited the manuscript and approved its content.

Assaf Zaritsky, assafzar@ 123456gmail.com

Author information

Assaf Zaritsky http://orcid.org/0000-0002-1477-5478

Article

DOI: 10.1101/2024.06.01.595856

PMC ID: 11185510

PubMed ID: 38895267

SO-VID: e4662f8d-8559-41d7-b60e-478dfe7bee4f

License:

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License, which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.

Anomaly detection for high-content image-based phenotypic cell profiling

Read this article at

Abstract

Related collections

Recursive Rule based Visual Categorization

Most cited references 56

A Unified Approach to Interpreting Model Predictions

A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles

Anomaly detection: A survey

Author and article information

Journal

Affiliations

Author notes

Author information

Article

History

Categories

Comments

Comment on this article

Similar content 230

Most referenced authors 1,920