-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Running UCE on new species #52
Comments
I'm able to change the
It seems like the 145469 is already fixed in the state_dict of the trained model. Is it still changeable? |
Hi, UCE requires using the full 15b esm2 model, not the 36 layer 3b model. What command are you using to launch UCE for the new species? |
Yes, I noticed that because the protein dimensions didn't match, and have updated the protein embedding 15b esm2 model. The command I used is as follows: adata_path="/project/shared/UCE/marmoset_m1/Marmoset_M1_10xV3.h5ad"
dir="marmoset_m1/"
species="zzmarmoset"
spec_chrom_csv_path="/project/shared/UCE/model_files/species_chrom_wmarmoset.csv"
token_file="/project/shared/UCE/model_files/all_tokens_marmoset.torch"
offset_path="/project/shared/UCE/model_files/species_offsets_wmarmoset.pkl"
batch_size="50"
python eval_single_anndata.py --adata_path ${adata_path} --dir ${dir} --species ${species} --batch_size ${batch_size} --spec_chrom_csv_path ${spec_chrom_csv_path} --token_file ${token_file} --offset_pkl_path ${offset_path}
the |
I am also trying to run UCE on new species (C. elegans). I was following along with the notebook
Everything ran fine in the notebook "data proc/Create new Species Files.ipynb" until Generate token file section, where I got Do you have any updated instructions on how to use UCE for new species? |
When you say "Please also add this line to the dictionary created on line 247 in the file |
Hi, I'm trying to run UCE on Marmoset dataset, but the species is not included in the UCE data. I have obtained the protein embedding using ESM2 (36 layers) and the chromosome location of genes in marmoset, but I noticed that there is also a
offset
file. How am I supposed to give the value for the offset of marmoset? And do I need to modify other files too?The text was updated successfully, but these errors were encountered: