DELVE

Dynamic selection of locally covarying features

Introduction

DELVE is an unsupervised feature selection method for identifying a representative subset of dynamically-expressed molecular features that recapitulate cellular trajectories from single-cell data (e.g. single-cell RNA sequencing, protein iterative immunofluorescence imaging). In contrast to previous work, DELVE uses a bottom-up approach to mitigate the effect of unwanted sources of feature variation confounding inference, and instead models cell states from dynamic feature modules that constitute core regulatory complexes. For more details on the method, please read the associated paper: Ranek JS, Stallaert W, Milner JJ, Redick M, Wolff SC, Beltran AS, Stanley N, and Purvis JE. DELVE: feature selection for preserving biological trajectories in single-cell data. Nature Communications. 2024.

For a comparison of alternative feature selection methods and the overall benchmarking pipeline, please see delve_benchmark.

Installation

Dependencies

Python >= 3.6, sketchKH == 0.1.1, anndata >= 0.7.6, numpy >= 1.19.5, scipy >= 1.7.1, pandas >= 1.1.5, umap-learn == 0.5.1, scikit-learn >= 0.23.2, tqdm

You can install the package and necessary dependencies with pip by,

pip install delve-fs

Alternatively, you can clone the git repository and install the necessary dependencies using the provided yml file. First clone the repository by,

git clone https://github.com/jranek/delve.git

Then change the working directory as,

cd delve

You can then create the conda environment using the provided yml file.

conda env create -f venv_delve.yml

Once the environment is created, you can activate it by,

conda activate venv_delve

Data access

You can download all of the preprocessed single-cell datasets (.h5ad files) from the Zenodo repository.

Example usage

To perform trajectory-preserving feature selection with DELVE, first read in a preprocessed .h5ad object. This .h5ad object contains a sample profiled with a single-cell technology (i.e. protein iterative indirect immunofluorescence imaging data).

import anndata
import os
adata = anndata.read_h5ad(os.path.join('data', 'adata_RPE.h5ad'))

Then simply perform DELVE feature selection by,

# Inputs:
# adata: annotated data object (dimensions = cells x features)
# k: number of nearest neighbors in the between-cell kNN affinity graph
# n_pcs: number of principal components. If None (default): will construct a between-cell affinity graph by computing pairwise Euclidean distances according to adata.X. Else: according to PCA of adata.X 
# num_subsamples: number of representative cellular neighborhoods. Neighborhoods are subsampled using kernel herding sketching (see https://dl.acm.org/doi/abs/10.1145/3535508.3545539, https://github.com/CompCy-lab/SketchKH)  
# n_clusters: number of feature modules
# n_random_state: number of random KMeans clustering initializations when identifying dynamic feature modules
# random_state: random state parameter 
# n_jobs: number of tasks
# -----------------------
    
# Returns:
# delta_mean: average pairwise change in expression across prototypical cellular neighborhoods (dimensions = num_subsamples x features)
# modules: dataframe containing feature-cluster assignments and permutation p-values (dimensions = features x 2)
# ranked_features: ranked set of features that best preserve the local trajectory structure (dimensions = features x 1)
# -----------------------
from delve import *
delta_mean, modules, ranked_features = delve_fs(adata = adata, k = 10, num_subsamples = 1000, n_clusters = 5, random_state = 0, n_jobs = -1)

Recent Updates and Changes

The project has been extended with several new features and analyses:

Attention Mechanism Implementation

Added an attention-based autoencoder to improve feature selection
The attention mechanism learns feature importance weights through unsupervised learning
Implemented in delve_ae.py and delve_ae_new_it.py

Multiple Affinity Graph Methods

Added various methods to construct affinity graphs (basic_affinity.py):

kNN-based affinity
K-means clustering-based affinity
RBF kernel-based affinity
GMM-based affinity

Trajectory Analysis

New tools for analyzing and comparing trajectories (trajectory_comparison.py):

Comparison of trajectories between different feature selection methods
Visualization of trajectory preservation
Quantitative metrics for trajectory similarity

Preservation Testing

Added comprehensive testing framework (test_preservation.py):

Cell state preservation analysis
Trajectory preservation testing
Visualization tools for comparing different feature selection approaches
Quantitative metrics including ARI scores and silhouette coefficients

These updates provide more robust ways to analyze and validate feature selection methods, with particular focus on comparing traditional approaches with attention-based methods.

The run_delve file contains the main function for performing DELVE feature selection and has our changes without attention. 
The changes folder has all the different files we changed and added attetnion to, the names show what is done in each of them! :D

License

This software is licensed under the MIT license (https://opensource.org/licenses/MIT).

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.github		.github
Changes		Changes
data		data
delve		delve
plots		plots
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
affinity_comparison.png		affinity_comparison.png
basic_affinity.py		basic_affinity.py
graph_metrics_comparison.png		graph_metrics_comparison.png
graph_structure.png		graph_structure.png
method_comparison.png		method_comparison.png
method_comparison_simple.png		method_comparison_simple.png
pipeline.png		pipeline.png
run_delve.py		run_delve.py
setup.py		setup.py
venv_delve.yml		venv_delve.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DELVE

Introduction

Installation

Data access

Example usage

Recent Updates and Changes

Attention Mechanism Implementation

Multiple Affinity Graph Methods

Trajectory Analysis

Preservation Testing

License

About

Uh oh!

Releases

Packages

Languages

License

GWeale/delve

Folders and files

Latest commit

History

Repository files navigation

DELVE

Introduction

Installation

Data access

Example usage

Recent Updates and Changes

Attention Mechanism Implementation

Multiple Affinity Graph Methods

Trajectory Analysis

Preservation Testing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages