• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

黑腹果蝇常染色体基因组注释:一项系统综述。

Annotation of the Drosophila melanogaster euchromatic genome: a systematic review.

作者信息

Misra Sima, Crosby Madeline A, Mungall Christopher J, Matthews Beverley B, Campbell Kathryn S, Hradecky Pavel, Huang Yanmei, Kaminker Joshua S, Millburn Gillian H, Prochnik Simon E, Smith Christopher D, Tupy Jonathan L, Whitfied Eleanor J, Bayraktaroglu Leyla, Berman Benjamin P, Bettencourt Brian R, Celniker Susan E, de Grey Aubrey D N J, Drysdale Rachel A, Harris Nomi L, Richter John, Russo Susan, Schroeder Andrew J, Shu Sheng Qiang, Stapleton Mark, Yamada Chihiro, Ashburner Michael, Gelbart William M, Rubin Gerald M, Lewis Suzanna E

机构信息

Department of Molecular and Cell Biology, University of California, Life Sciences Addition, Berkeley, CA 94720-3200, USA.

出版信息

Genome Biol. 2002;3(12):RESEARCH0083. doi: 10.1186/gb-2002-3-12-research0083. Epub 2002 Dec 31.

DOI:10.1186/gb-2002-3-12-research0083
PMID:12537572
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC151185/
Abstract

BACKGROUND

The recent completion of the Drosophila melanogaster genomic sequence to high quality and the availability of a greatly expanded set of Drosophila cDNA sequences, aligning to 78% of the predicted euchromatic genes, afforded FlyBase the opportunity to significantly improve genomic annotations. We made the annotation process more rigorous by inspecting each gene visually, utilizing a comprehensive set of curation rules, requiring traceable evidence for each gene model, and comparing each predicted peptide to SWISS-PROT and TrEMBL sequences.

RESULTS

Although the number of predicted protein-coding genes in Drosophila remains essentially unchanged, the revised annotation significantly improves gene models, resulting in structural changes to 85% of the transcripts and 45% of the predicted proteins. We annotated transposable elements and non-protein-coding RNAs as new features, and extended the annotation of untranslated (UTR) sequences and alternative transcripts to include more than 70% and 20% of genes, respectively. Finally, cDNA sequence provided evidence for dicistronic transcripts, neighboring genes with overlapping UTRs on the same DNA sequence strand, alternatively spliced genes that encode distinct, non-overlapping peptides, and numerous nested genes.

CONCLUSIONS

Identification of so many unusual gene models not only suggests that some mechanisms for gene regulation are more prevalent than previously believed, but also underscores the complex challenges of eukaryotic gene prediction. At present, experimental data and human curation remain essential to generate high-quality genome annotations.

摘要

背景

黑腹果蝇基因组序列近期高质量完成,且有大量扩充的果蝇cDNA序列可供使用,这些序列与78%的预测常染色体基因匹配,这使FlyBase有机会显著改进基因组注释。我们通过直观检查每个基因、利用一套全面的编辑规则、要求每个基因模型有可追溯的证据,并将每个预测的肽段与SWISS-PROT和TrEMBL序列进行比较,使注释过程更加严格。

结果

虽然果蝇中预测的蛋白质编码基因数量基本保持不变,但修订后的注释显著改进了基因模型,导致85%的转录本和45%的预测蛋白质发生结构变化。我们将转座元件和非蛋白质编码RNA注释为新特征,并扩展了非翻译(UTR)序列和可变转录本的注释,分别涵盖了70%以上和20%以上的基因。最后,cDNA序列为双顺反子转录本、同一DNA序列链上具有重叠UTR的相邻基因、编码不同的非重叠肽段的可变剪接基因以及众多嵌套基因提供了证据。

结论

识别出如此多不寻常的基因模型不仅表明某些基因调控机制比以前认为的更为普遍,也凸显了真核基因预测的复杂挑战。目前,实验数据和人工编辑对于生成高质量的基因组注释仍然至关重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0812/151185/9080ddd04fdc/gb-2002-3-12-research0083-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0812/151185/d8d9008e32b2/gb-2002-3-12-research0083-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0812/151185/d5e12151a9c7/gb-2002-3-12-research0083-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0812/151185/5096cd6e3150/gb-2002-3-12-research0083-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0812/151185/f33bfa3bb70b/gb-2002-3-12-research0083-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0812/151185/1ba7fe1fd738/gb-2002-3-12-research0083-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0812/151185/30bbdc07f3fe/gb-2002-3-12-research0083-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0812/151185/214261fea99e/gb-2002-3-12-research0083-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0812/151185/9080ddd04fdc/gb-2002-3-12-research0083-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0812/151185/d8d9008e32b2/gb-2002-3-12-research0083-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0812/151185/d5e12151a9c7/gb-2002-3-12-research0083-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0812/151185/5096cd6e3150/gb-2002-3-12-research0083-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0812/151185/f33bfa3bb70b/gb-2002-3-12-research0083-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0812/151185/1ba7fe1fd738/gb-2002-3-12-research0083-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0812/151185/30bbdc07f3fe/gb-2002-3-12-research0083-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0812/151185/214261fea99e/gb-2002-3-12-research0083-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0812/151185/9080ddd04fdc/gb-2002-3-12-research0083-8.jpg

相似文献

1
Annotation of the Drosophila melanogaster euchromatic genome: a systematic review.黑腹果蝇常染色体基因组注释:一项系统综述。
Genome Biol. 2002;3(12):RESEARCH0083. doi: 10.1186/gb-2002-3-12-research0083. Epub 2002 Dec 31.
2
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
3
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
4
Tobacco cessation interventions for young people.针对年轻人的戒烟干预措施。
Cochrane Database Syst Rev. 2006 Oct 18(4):CD003289. doi: 10.1002/14651858.CD003289.pub4.
5
Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗
Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.
6
A systematic review of the effectiveness of adalimumab, etanercept and infliximab for the treatment of rheumatoid arthritis in adults and an economic evaluation of their cost-effectiveness.阿达木单抗、依那西普和英夫利昔单抗治疗成人类风湿关节炎有效性的系统评价及其成本效益的经济学评估。
Health Technol Assess. 2006 Nov;10(42):iii-iv, xi-xiii, 1-229. doi: 10.3310/hta10420.
7
Adefovir dipivoxil and pegylated interferon alfa-2a for the treatment of chronic hepatitis B: a systematic review and economic evaluation.阿德福韦酯与聚乙二醇化干扰素α-2a治疗慢性乙型肝炎:系统评价与经济学评估
Health Technol Assess. 2006 Aug;10(28):iii-iv, xi-xiv, 1-183. doi: 10.3310/hta10280.
8
Indoor residual spraying for preventing malaria in communities using insecticide-treated nets.室内残留喷洒用杀虫剂处理过的蚊帐预防疟疾在社区中使用。
Cochrane Database Syst Rev. 2022 Jan 17;1(1):CD012688. doi: 10.1002/14651858.CD012688.pub3.
9
A systematic review and economic evaluation of epoetin alpha, epoetin beta and darbepoetin alpha in anaemia associated with cancer, especially that attributable to cancer treatment.促红细胞生成素α、促红细胞生成素β和达比加群酯治疗癌症相关性贫血(尤其是癌症治疗所致贫血)的系统评价与经济学评估
Health Technol Assess. 2007 Apr;11(13):1-202, iii-iv. doi: 10.3310/hta11130.
10
Sertindole for schizophrenia.用于治疗精神分裂症的舍吲哚。
Cochrane Database Syst Rev. 2005 Jul 20;2005(3):CD001715. doi: 10.1002/14651858.CD001715.pub2.

引用本文的文献

1
A gene-based model of fitness and its implications for genetic variation: Genetic and inbreeding loads.一种基于基因的适合度模型及其对遗传变异的影响:遗传负荷与近亲繁殖负荷。
bioRxiv. 2025 Jul 9:2025.02.19.639162. doi: 10.1101/2025.02.19.639162.
2
A model of Hill-Robertson interference caused by purifying selection in a nonrecombining genome.由非重组基因组中的纯化选择导致的希尔-罗伯逊干涉模型。
Genetics. 2025 May 8;230(1). doi: 10.1093/genetics/iyaf048.
3
Small ORFs, Big Insights: as a Model to Unraveling Microprotein Functions.小开放阅读框,大发现:作为揭示微小蛋白质功能的模型。

本文引用的文献

1
Assessment of genome-wide protein function classification for Drosophila melanogaster.黑腹果蝇全基因组蛋白质功能分类评估
Genome Res. 2003 Sep;13(9):2118-28. doi: 10.1101/gr.771603.
2
Computational analysis of core promoters in the Drosophila genome.果蝇基因组中核心启动子的计算分析。
Genome Biol. 2002;3(12):RESEARCH0087. doi: 10.1186/gb-2002-3-12-research0087. Epub 2002 Dec 20.
3
Assessing the impact of comparative genomic sequence data on the functional annotation of the Drosophila genome.评估比较基因组序列数据对果蝇基因组功能注释的影响。
Cells. 2024 Oct 2;13(19):1645. doi: 10.3390/cells13191645.
4
Multiple hybridization events and repeated evolution of homoeologue expression bias in parthenogenetic, polyploid New Zealand stick insects.孤雌生殖的多倍体新西兰竹节虫中的多次杂交事件和同源基因表达偏向的反复进化。
Mol Ecol. 2025 Aug;34(15):e17422. doi: 10.1111/mec.17422. Epub 2024 Jun 6.
5
RNA polymerase collisions and their role in transcription.RNA 聚合酶碰撞及其在转录中的作用。
Transcription. 2024 Feb-Apr;15(1-2):38-47. doi: 10.1080/21541264.2024.2316972. Epub 2024 Feb 15.
6
Using FlyBase: A Database of Drosophila Genes and Genetics.使用 FlyBase:果蝇基因和遗传学数据库。
Methods Mol Biol. 2022;2540:1-34. doi: 10.1007/978-1-0716-2541-5_1.
7
The Drosophila ribosome protein S5 paralog RpS5b promotes germ cell and follicle cell differentiation during oogenesis.果蝇核糖体蛋白 S5 同源物 RpS5b 在卵子发生过程中促进生殖细胞和滤泡细胞的分化。
Development. 2021 Oct 1;148(19). doi: 10.1242/dev.199511. Epub 2021 Oct 4.
8
Identification of the cross-strand chimeric RNAs generated by fusions of bi-directional transcripts.双向转录本融合产生的链间嵌合 RNA 的鉴定。
Nat Commun. 2021 Jul 30;12(1):4645. doi: 10.1038/s41467-021-24910-2.
9
Double triage to identify poorly annotated genes in maize: The missing link in community curation.双重分诊以鉴定玉米中注释不良的基因:社区策展中的缺失环节。
PLoS One. 2019 Oct 28;14(10):e0224086. doi: 10.1371/journal.pone.0224086. eCollection 2019.
10
Repertoire-wide gene structure analyses: a case study comparing automatically predicted and manually annotated gene models.全面的基因结构分析:自动预测和手动注释基因模型的比较案例研究。
BMC Genomics. 2019 Oct 17;20(1):753. doi: 10.1186/s12864-019-6064-8.
Genome Biol. 2002;3(12):RESEARCH0086. doi: 10.1186/gb-2002-3-12-research0086. Epub 2002 Dec 30.
4
Heterochromatic sequences in a Drosophila whole-genome shotgun assembly.果蝇全基因组鸟枪法组装中的异染色质序列。
Genome Biol. 2002;3(12):RESEARCH0085. doi: 10.1186/gb-2002-3-12-research0085. Epub 2002 Dec 31.
5
The transposable elements of the Drosophila melanogaster euchromatin: a genomics perspective.黑腹果蝇常染色质中的转座元件:基因组学视角
Genome Biol. 2002;3(12):RESEARCH0084. doi: 10.1186/gb-2002-3-12-research0084. Epub 2002 Dec 23.
6
Apollo: a sequence annotation editor.阿波罗:一个序列注释编辑器。
Genome Biol. 2002;3(12):RESEARCH0082. doi: 10.1186/gb-2002-3-12-research0082. Epub 2002 Dec 23.
7
An integrated computational pipeline and database to support whole-genome sequence annotation.一个支持全基因组序列注释的集成计算流程和数据库。
Genome Biol. 2002;3(12):RESEARCH0081. doi: 10.1186/gb-2002-3-12-research0081. Epub 2002 Dec 23.
8
A Drosophila full-length cDNA resource.一个果蝇全长cDNA资源。
Genome Biol. 2002;3(12):RESEARCH0080. doi: 10.1186/gb-2002-3-12-research0080. Epub 2002 Dec 23.
9
Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster euchromatic genome sequence.完成全基因组鸟枪法测序:黑腹果蝇常染色质基因组序列第3版发布。
Genome Biol. 2002;3(12):RESEARCH0079. doi: 10.1186/gb-2002-3-12-research0079. Epub 2002 Dec 23.
10
Comparative genome and proteome analysis of Anopheles gambiae and Drosophila melanogaster.冈比亚按蚊和黑腹果蝇的比较基因组与蛋白质组分析。
Science. 2002 Oct 4;298(5591):149-59. doi: 10.1126/science.1077061.