Interesting problem: automatic phenotype group detection

tpoterba · December 18, 2019, 4:59pm

Pavlos brought up an interesting question on the user forum

The problem as I see it is: Given a list of phenotypes Y, each of which has N samples with defined phenotypes, compute a list of lists of phenotypes such that the number of samples included in each group, G, is not smaller than N by more than some small error term E for any phenotype in the group:

G/N > 1 - E

We want to minimize the number of groups, in order to take best advantage of the BLAS3 optimizations in linear regression rows.

Topic		Replies	Views
MNV project requirements	1	828	May 9, 2018
Thoughts on entry filtering	3	854	April 3, 2019
Randomized big linear algebra	3	1119	July 1, 2020
The curious case of `impute_sex`	14	852	February 6, 2018
The future of types	3	635	November 18, 2017

Interesting problem: automatic phenotype group detection

Related topics