An Organism Starts with a Single Pix-Cell: A Neural Cellular Diffusion for High-Resolution Image Synthesis

Authors: Marawan Elbatel, Konstantinos Kamnitsas, Xiaomeng Li

This repository contains the official implementation of the Generative Cellular Automata (GeCA) framework. GeCA enables high-resolution image synthesis in medical imaging, specifically designed for data-scarce domains like retinal disease classification from OCT and Fundus images.

Abstract

Generative modeling seeks to approximate the statistical properties of real data, enabling synthesis of new data that closely resembles the original distribution. Generative Adversarial Networks (GANs) and Denoising Diffusion Probabilistic Models (DDPMs) represent significant advancements in generative modeling, drawing inspiration from game theory and thermodynamics, respectively. Nevertheless, the exploration of generative modeling through the lens of biological evolution remains largely untapped. In this paper, we introduce a novel family of models termed Generative Cellular Automata (GeCA), inspired by the evolution of an organism from a single cell. GeCAs are evaluated as an effective augmentation tool for retinal disease classification across two imaging modalities: Fundus and Optical Coherence Tomography (OCT). In the context of OCT imaging, where data is scarce and the distribution of classes is inherently skewed, GeCA significantly boosts the performance of 11 different ophthalmological conditions, achieving a 12% increase in the average F1 score compared to conventional baselines. GeCAs outperform both diffusion methods that incorporate UNet or state-of-the art variants with transformer-based denoising models, under similar parameter constraints

Quantitative Results

Quantitative Image Quality Evaluation

KID values are expressed as 1e-3. Models are trained and evaluated with classifier-free guidance (CFG) and T = 250.

Method	# Params (↓)	Fundus KID (↓)	Fundus LPIPS (↑)	Fundus GG (>0)	OCT KID (↓)	OCT LPIPS (↑)	OCT GG (>0)
LDM-B	17.3 M	11.64 ± 2.1	0.37 ± 0.09	-10.67	64.5 ± 10	0.39 ± 0.16	-2.31
DiT-S	32.7 M	12.45 ± 2.8	0.31 ± 0.09	-14.55	62.3 ± 5.9	0.37 ± 0.14	-0.44
GeCA-S (ours)	13.3 M	7.42 ± 1.6	0.39 ± 0.11	2.02	49.1 ± 8.0	0.53 ± 0.16	0.34

Getting Started

Installation

Clone the repository:

git clone https://github.com/xmed-lab/GeCA
cd GeCA

Create a virtual environment:

conda create -n GeCA python=3.8
conda activate GeCA

Install PyTorch:

pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118

Install additional dependencies by following the setup in Fast-DiT:
```
pip install -r req.txt
```

Training GeCA

Feature Extraction:

CUDA_VISIBLE_DEVICES=0 torchrun --nnodes=1 --master-port 29504 --nproc_per_node=1 extract_features.py --data-path data/oct_multilabel/ --features-path store/oct_features/ --global-batch-size 128 --fold 0

Model Training:

CUDA_VISIBLE_DEVICES=0,1,2,3 accelerate launch --main_process_port 29504 --multi_gpu --num_processes 4 --mixed_precision fp16 train.py --model GeCA-S --feature-path store/oct_features/ --num-classes 11 --global-batch-size 128 --epochs 14000 --fold 8 --validate_every 700 --data-path data/oct_multilabel/ --results-dir ./results_oct_GeCA/

Evaluating GeCA

Generate synthetic images from the best checkpoint:

CUDA_VISIBLE_DEVICES=0 torchrun --master-port 29506 --nnodes=1 --nproc_per_node=1 sample_ddp_val.py --expand_ratio 1 --model GeCA-S --data-path oct_multilabel/ --fold 0 --num-sampling-steps 250 --ckpt ./results_oct_GeCA/001-GeCA-S/checkpoints/best_ckpt.pt --sample-dir ./synthetic_oct/

Evaluate generated images:

python evaluate.py --fold 0 --image-size 256 --device_list cuda:0 --real ./oct_multilabel/ --gen ./synthetic_oct/GeCA-S-GS-fold-0-nstep-250-best_ckpt-size-256-vae-ema-cfg-1.5-seed-0/

Expanding Dataset with Synthetic Images

To address data scarcity and class imbalance in OCT datasets, GeCA generates high-quality synthetic images, expanding the dataset five-times. This augmentation enhances model performance by adding diverse training samples while maintaining the original dataset’s class distribution.

CUDA_VISIBLE_DEVICES=0 torchrun --master-port 29506 --nnodes=1 --nproc_per_node=1 sample_ddp.py --per-proc-batch-size 64 --expand_ratio 5 --model GeCA-S --data-path ./store/oct_features/ --fold 0 --num-sampling-steps 250 --ckpt ./results_oct_GeCA/001-GeCA-S/checkpoints/best_ckpt.pt --sample-dir ./synthetic_oct/

OCT Dataset Classification Performance

Synthetic Data	Sen. (↑)	Spe. (↑)	AUC (↑)	F1 (↑)	F1 (sen/spe) (↑)	mAP (↑)	p-value
Baseline (Geometric Aug)	54.66 ± 1.53	96.50 ± 0.16	92.47 ± 0.85	55.47 ± 0.99	60.80 ± 1.49	68.85 ± 1.41	-
Baseline w/o Aug.	48.34 ± 1.45	96.39 ± 0.20	89.99 ± 0.82	54.56 ± 1.77	50.07 ± 0.89	64.58 ± 1.19	**
LDM-B	58.83 ± 1.90	96.12 ± 0.29	91.22 ± 0.74	59.65 ± 3.19	67.74 ± 2.97	70.49 ± 2.64	**
DiT-S	59.25 ± 4.54	95.87 ± 0.37	91.80 ± 1.74	59.11 ± 2.57	67.13 ± 4.87	69.89 ± 3.34	***
GeCA-S (ours)	59.95 ± 5.32	96.38 ± 0.40	92.74 ± 2.21	61.62 ± 3.93	68.38 ± 4.61	73.28 ± 5.58	****

Acknowledgment

Code is built on Fast-Dit, ViTCA, and DiT.

Citation

@misc{elbatel2024organismstartssinglepixcell,
      title={An Organism Starts with a Single Pix-Cell: A Neural Cellular Diffusion for High-Resolution Image Synthesis}, 
      author={Marawan Elbatel and Konstantinos Kamnitsas and Xiaomeng Li},
      year={2024},
      eprint={2407.03018},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2407.03018}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
diffusion		diffusion
train_options		train_options
visuals		visuals
README.md		README.md
TNBC_dataset.py		TNBC_dataset.py
environment.yml		environment.yml
evaluate.py		evaluate.py
extract_features.py		extract_features.py
geca.py		geca.py
models.py		models.py
req.txt		req.txt
sample_ddp.py		sample_ddp.py
sample_ddp_val.py		sample_ddp_val.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

An Organism Starts with a Single Pix-Cell: A Neural Cellular Diffusion for High-Resolution Image Synthesis

Authors: Marawan Elbatel, Konstantinos Kamnitsas, Xiaomeng Li

Abstract

Quantitative Results

Quantitative Image Quality Evaluation

Getting Started

Installation

Training GeCA

Evaluating GeCA

Expanding Dataset with Synthetic Images

OCT Dataset Classification Performance

Acknowledgment

Citation

About

Releases

Packages

Languages

xmed-lab/GeCA

Folders and files

Latest commit

History

Repository files navigation

An Organism Starts with a Single Pix-Cell: A Neural Cellular Diffusion for High-Resolution Image Synthesis

Authors: Marawan Elbatel, Konstantinos Kamnitsas, Xiaomeng Li

Abstract

Quantitative Results

Quantitative Image Quality Evaluation

Getting Started

Installation

Training GeCA

Evaluating GeCA

Expanding Dataset with Synthetic Images

OCT Dataset Classification Performance

Acknowledgment

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages