On 2016 Apr 19, S. Hong Lee commented:
Many thanks for your comments. The two models (response variables) do not simply represent association between genome-wide markers, previously associated with schizophrenia, and woman’s age at first birth. The two models (response variables) are the functions derived from the relationship between the schizophrenia risk in children and their mother’s age, studied in a large national-wide study (please see reference #11 in the paper). So what we tested was that the relationship between the schizophrenia risk in children and their mother’s age found from the previous (totally independent) study was associated with the relationship between schizophrenia risk profile score (inferred from SNP effects estimated in another independent SCZ GWAS data set) and age at first birth. Although we used SNP effects to infer risk profile score, we did not think the test should be corrected for the number of SNPs. We used a response variable and one set of risk profile score (N x 1) in a single test (not multiple sets of risk profile score).
In addition, I would like to explain a bit more about the variance explained by the predictor in the target data set (R2 in Table 2). The variance explained by the predictor in the target data set is not the actual variance of the underlying effects. It means it can increase more when there is less sampling error (i.e. with a larger samples size). For example, if you estimate SNP effects in a discovery (or training) data set, and the estimated SNP effects are projected into an independent target data set, the R2 explained by the predictor (from the estimated SNPs effects) depends on the estimation error of the SNP effects. So, if the sample size in the discovery data set (SCZ GWAS in our case) is increased, the R2 can be increased (hence lower p value).
This comment, imported by Hypothesis from PubMed Commons, is licensed under CC BY.