Genome Res. 2009 Dec;19(12):2324-33. doi: 10.1101/gr.095976.109. Epub 2009 Sep 18.
Since its start, the Mammalian Gene Collection (MGC) has sought to provide at least one full-protein-coding sequence cDNA clone for every human and mouse gene with a RefSeq transcript, and at least 6200 rat genes. The MGC cloning effort initially relied on random expressed sequence tag screening of cDNA libraries. Here, we summarize our recent progress using directed RT-PCR cloning and DNA synthesis. The MGC now contains clones with the entire protein-coding sequence for 92% of human and 89% of mouse genes with curated RefSeq (NM-accession) transcripts, and for 97% of human and 96% of mouse genes with curated RefSeq transcripts that have one or more PubMed publications, in addition to clones for more than 6300 rat genes. These high-quality MGC clones and their sequences are accessible without restriction to researchers worldwide.
自成立以来,哺乳动物基因集(MGC)一直致力于为每个具有 RefSeq 转录本的人类和小鼠基因以及至少 6200 个大鼠基因提供至少一个全长蛋白编码序列 cDNA 克隆。MGC 的克隆工作最初依赖于 cDNA 文库的随机表达序列标签筛选。在这里,我们总结了我们最近使用定向 RT-PCR 克隆和 DNA 合成的进展。MGC 现在包含了具有完整蛋白编码序列的克隆,涵盖了 92%的人类和 89%的具有经过注释的 RefSeq(NM 访问号)转录本的小鼠基因,以及 97%的人类和 96%的具有经过注释的 RefSeq 转录本且有一个或多个 PubMed 出版物的小鼠基因,此外还有超过 6300 个大鼠基因的克隆。这些高质量的 MGC 克隆及其序列可供全球研究人员无限制地使用。