You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I think that the downloadable file FASTA_GeckoV2_all.fasta contains a regex-related error, repeated 12 times.
In the original .csv files containing the GeCKOv2 sequences (downloadable from www.addgene.org), there are two gene identifiers which contain an underscore in the gene identifier code itself: "CD99_X" and "CD99_Y". Whatever conversion script you are using to transform from these .csv source files to .fasta format probably contains a regex which is not handling these underscores correctly, because the resulting lines in the fasta file look like this:
I.e., the label line fails to include an a second underscore between the gene id (CD99_X / CD99_Y) and the 20 bp gRNA sequence, and furthermore the sequence line itself contains an additional prepended "x" or "y".
Best regards,
Andrew L. Stachyra
The text was updated successfully, but these errors were encountered:
Hello,
I think that the downloadable file FASTA_GeckoV2_all.fasta contains a regex-related error, repeated 12 times.
In the original .csv files containing the GeCKOv2 sequences (downloadable from www.addgene.org), there are two gene identifiers which contain an underscore in the gene identifier code itself: "CD99_X" and "CD99_Y". Whatever conversion script you are using to transform from these .csv source files to .fasta format probably contains a regex which is not handling these underscores correctly, because the resulting lines in the fasta file look like this:
I.e., the label line fails to include an a second underscore between the gene id (CD99_X / CD99_Y) and the 20 bp gRNA sequence, and furthermore the sequence line itself contains an additional prepended "x" or "y".
Best regards,
Andrew L. Stachyra
The text was updated successfully, but these errors were encountered: