Provide locus, chromosome, and haplotype components of genetic, breeding, ... values #215

gregorgorjanc · 2024-12-23T20:53:57Z

For pedagogical and some research reasons, it would be neat to show how the whole-genome genetic value is composed of locus-specific values, chromosome-specific values, or haplotype values.

This clearly would not be very efficient memory-wise, but calculations are all already done at the C++ level.

I am not fully clear on how the API would look like. Say, something like:

whole-genome value (default): gv(pop) returning a matrix of nInd * nTraits
chromosome-specific value: gv(pop, return="chr") returning a matrix of nInd * nTraits * nChr
locus-specific value: gv(pop, return="loc") returning a matrix of nInd * nTraits * nLoc
haplotype-specific value:
- gv(pop, return="chr", returnHap=TRUE) returning a matrix of nInd * nTraits * nChr * ploidy
- gv(pop, return="loc", returnHap=TRUE) returning a matrix of nInd * nTraits * nLoc * ploidy

So, the function signature would be gv(pop, return="whole-genome-or-similar", returnHap=FALSE).

Similarly for other values (bv, dd, aa). Obviously for some not all of the above made sense.

Since everything is already calculated at C++ level, we would mostly have to think how to return these results to the R level ...

Thoughts?

The text was updated successfully, but these errors were encountered:

gaynorr · 2024-12-24T13:02:58Z

Epistasis will be an issue. Centering of the values is also a little tricky. What do you doe with the intercept?

Otherwise, genetic values are pretty straightforward to calculate in R code. The breeding values are fairly easy too with the allele substitution effect being exported.

I generally desire avoid too much functionality in existing functions. I find a large chunk of users try to read and understand every argument in a function. For example, I often see code where every argument to a function is explicitly written out, even though the intent in AlphaSimR has been to set sensible defaults to avoid users needing to worry about them in most cases. Thus, I think a design with a larger number of functions might be preferable to fewer functions with more arguments.

gregorgorjanc · 2024-12-24T14:15:03Z

I agree that gv and bv might be best targets, though eventually someone will ask for dd, while aa is not straightforward/possible.

I would follow the same centering as elsewhere in the package for consistency.

I know what you mean about arguments. Pros and cons as always. I prefer less functions, if possible, but also dislike too many arguments. I think the above proposal is not bad.

gregorgorjanc · 2024-12-25T20:08:50Z

Just to add that I always appreciate good defaults in AlphaSimR! I learned a lot from AlphaSimR;)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide locus, chromosome, and haplotype components of genetic, breeding, ... values #215

Provide locus, chromosome, and haplotype components of genetic, breeding, ... values #215

gregorgorjanc commented Dec 23, 2024 •

edited

Loading

gaynorr commented Dec 24, 2024

gregorgorjanc commented Dec 24, 2024

gregorgorjanc commented Dec 25, 2024

Provide locus, chromosome, and haplotype components of genetic, breeding, ... values #215

Provide locus, chromosome, and haplotype components of genetic, breeding, ... values #215

Comments

gregorgorjanc commented Dec 23, 2024 • edited Loading

gaynorr commented Dec 24, 2024

gregorgorjanc commented Dec 24, 2024

gregorgorjanc commented Dec 25, 2024

gregorgorjanc commented Dec 23, 2024 •

edited

Loading