extraCellularRNA

find DESeq data sets

anything with de.seq or de-seq is going to be de seq output of either the normalized counts or differential expression variety

find /public/groups/kimlab  -name "*de.seq*" -print |grep -v "permission denied"

conda references

start the env

cd extraCellularRNA
conda activate extraCellularRNA
export PYTHONPATH="${PYTHONPATH}:`pwd`/src"

create the extraCellularRNA environment from yaml file

conda env create -f environment.yml
pip install tensorflow

updating dependencies

see exporting-an-environment-file-across-platforms

cd extraCellularRNA
conda activate extraCellularRNA
conda env export --from-history > environment.yml

Running Unit test

cd extraCellularRNA
conda activate extraCellularRNA
export PYTHONPATH="${PYTHONPATH}:`pwd`/src"
cd src/test
python -m unittest discover .

example of how to run a specific test case

(extraCellularRNA) $ python plots/test/testUpsetPlots.py  TestUpsetPlots.testFindDegrees

Spark Install

download spark-3.1.2-bin-hadoop3.2.tgz from https://spark.apache.org/downloads.html

you could probably just install pyspark using pip if you like

Configuring Visual Studio Code (IDE)

doc/configureVisualStudio.md

Warning !!!!!

This branch contains config file for visual studio code. You will need to modify them to match your development enviroment
(extraCellularRNA) $ ls .vscode/ settings.json launch.json

Deconvolution Hyperparmeter Tunning Overview

9/8/23 : Status
We created some preliminary data that suggests our method may work. This was good enough to advance to candidacy last spring. The analysis was done using a series of notebooks. We will need to clean this up so we can consistently run the analysis using different parmaters, collect the output and meta data in a format that make it easy to track what works and does not work.

ref

assumptions

we ran salmonQuantWorkflow.wdl and 1vsAllTask.wdl on all the GTEx and TCGA sample
the results have been aggregated into count matrices (transcripts, and count by genes)
- data file can be found at /private/groups/kimlab/{GTEx,TCGA,GTEx_TCGA}

** preliminary data pipeline **

select signatue gene sets
- extraCellularRNA/terra/jupyterNotebooks/signatureGenesUpsetPlots.html
- selects set of 1vsAll genes. ie. best n=25, up regulated, down regulated, ...
- generates upset plots and gene set interections
- results where copied from GPC bucket to /private/groups/kimlab
Create CIBERSORTx mixture matrix
- extraCellularRNA/terra/jupyterNotebooks/cibersort/createCibersortMixtureMatrix.ipynb
- scales counts using DESeq estiamted scaling factors
- output:
  - mixture matrix
  - expected fractions
  - randomized mixture matirx. random shuffle. Does not contain any information we can use this to evaluate how well our model works
Create CIBERSORTx gene signature matrix
- extraCellularRNA/terra/jupyterNotebooks/cibersort/createCiberSortGeneSignatureMatrix.ipynb
- given results files extraCellularRNA/terra/jupyterNotebooks/signatureGenesUpsetPlots.ipynb and results form 1vsAll analysis, creates a gene signature matrix
Run CIBERSORTx see extraCellularRNA/terra/cibersortx/wdl/README.md for direction on how to run on phoenix/slurm or CLI

** preliminary analysis pipeline**

extraCellularRNA/terra/jupyterNotebooks/cibersort. See README.md

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.vscode		.vscode
R		R
bin		bin
data		data
deconvolutionAnalysis		deconvolutionAnalysis
doc		doc
intraExtraRNA_POC		intraExtraRNA_POC
juypterNotebooks		juypterNotebooks
src		src
terra		terra
.gitignore		.gitignore
.project		.project
.pydevproject		.pydevproject
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

extraCellularRNA

find DESeq data sets

conda references

start the env

create the extraCellularRNA environment from yaml file

updating dependencies

Running Unit test

Spark Install

Configuring Visual Studio Code (IDE)

Warning !!!!!

Deconvolution Hyperparmeter Tunning Overview

About

Releases

Packages

Contributors 2

Languages

aedavids/extraCellularRNA

Folders and files

Latest commit

History

Repository files navigation

extraCellularRNA

find DESeq data sets

conda references

start the env

create the extraCellularRNA environment from yaml file

updating dependencies

Running Unit test

Spark Install

Configuring Visual Studio Code (IDE)

Warning !!!!!

Deconvolution Hyperparmeter Tunning Overview

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages