This repository provides Bash and R scripts to efficiently process and analyze genotype files.
- Work with genotype files, including
chromosome.data
andimputed_genotypefiles
.
- Compare genotype files to:
- Identify SNP removals.
- Detect updates in genotype files.
Perform quality control checks on genotype data, including:
- Call Rate: Ensure sufficient SNP call coverage.
- Call Frequency: Verify the frequency of SNP calls.
- Minor Allele Frequency (MAF): Exclude SNPs with low-frequency alleles.
- Compute:
- Linear kernels.
- Gaussian kernels.
- Bash: For processing pipeline scripts.
- R: Version 4.0+ recommended.