On 2015 Feb 10, Pieter-Jan Volders commented:
At the time of submission, we were unaware of a specific RefSeq lncRNA subset. As a result, we initially used the NR_* records larger than 200 nucleotides. When dr. Kim D. Pruitt contacted us regarding this issue, we repeated the analysis for the suggested RefSeq subset. This subset contains 4774 transcripts and was obtained through the UCSC table browser. As expected, the percentage of transcripts passing the PhyloCSF cutoff has decreased from 48% to 14% of the RefSeq subset. The online manuscript was updated and the current Figure 3 (online as of January 15, 2015) represents these new results. Additionally, LNCipedia.org was updated as well, and the RefSeq records that do not represent lncRNAs were removed from the database version 3.1.
It is worth noting that we could not find any information on the keyword that was used in the query suggested by dr. Kimm D. Pruit (biomolncrnalncrna), neither on the RefSeq website, nor in the cited manuscript. It is unclear to us how many researchers are aware of this and we would like to suggest to RefSeq to indicate this subset on their website as it is of great value to the lncRNA research community.
This comment, imported by Hypothesis from PubMed Commons, is licensed under CC BY.