Subtype assignment and primer trimming for antibody sequences with Error Correcting BarCodes (ECBC)
- R1: fwd 4N, leader, variable
- R2: rev subtype determination, constant, variable
- I1: ECBC (12)
- I2: sample index
- R1: fwd 4N, leader, variable
- R2: rev ECBC (12), spacer (4), constant, variable
- I1: LC index TAAGGCGAGAGC (12)
- I2: sample index
IgG_klMA_ECBC.py
- split into IgG vs. klMA
- for LC do klMA discrimination
- for HC determine IgG subtypes
- write ECBC in front of R1
- pandaseq R1 und R2
IgG_klMA_ECBC_V2.py NAME_L001_R1_001.FASTQ[.GZ]
###Helper Scripts
seq_count.sh
counts fasta and fastq sequences
batch.ch
to run multiple samples
imgt_combine.sh
to combine two IMGT output files from the same sample
primer_fwd_H.fasta
heavy chain forward primersprimer_fwd_k.fasta
kappa forward primersprimer_fwd_l.fasta
lambda forward primers
- run
IgG_klMA_ECBC_V2.py
- upload sequences to IMGT (split big files into files with 1000000 sequences each)
- download IMGT results, combine files which had been split before
- run
annotate_first.py