利用 ChemDataExtractor 从科学文献中自动生成热激活延迟荧光分子数据库。

A database of thermally activated delayed fluorescent molecules auto-generated from scientific literature with ChemDataExtractor.

机构信息

Cavendish Laboratory, University of Cambridge, J. J. Thomson Avenue, Cambridge, CB3 0HE, UK.

ISIS Neutron and Muon Source, Rutherford Appleton Laboratory, Harwell Science and Innovation Campus, Didcot, Oxfordshire, OX11 0QX, UK.

出版信息

Sci Data. 2024 Jan 17;11(1):80. doi: 10.1038/s41597-023-02897-3.

DOI:10.1038/s41597-023-02897-3

PMID:38233439

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10794197/

Abstract

A database of thermally activated delayed fluorescent (TADF) molecules was automatically generated from the scientific literature. It consists of 25,482 data records with an overall precision of 82%. Among these, 5,349 records have chemical names in the form of SMILES strings which are represented with 91% accuracy; these are grouped in a subsidiary database. Each data record contains one of the following four properties: maximum emission wavelength (λ), photoluminescence quantum yield (PLQY), singlet-triplet energy splitting (ΔE), and delayed lifetime (τ). The databases were created through text mining using ChemDataExtractor, a chemistry-aware natural-language-processing toolkit, which has been adapted for TADF research. The text-mined corpus consisted of 2,733 papers from the Royal Society of Chemistry and Elsevier. To the best of our knowledge, these databases are the first databases that have been auto-generated for TADF molecules from existing publications. The databases have been publicly released for experimental and computational applications in the TADF research field.

摘要

从科学文献中自动生成了一个热激活延迟荧光 (TADF) 分子数据库。它由 25482 条数据记录组成，整体精度为 82%。其中，5349 条记录具有 SMILES 字符串形式的化学名称，其表示准确度为 91%；这些记录被分组到一个附属数据库中。每个数据记录包含以下四个属性之一：最大发射波长 (λ)、光致发光量子产率 (PLQY)、单重态-三重态能量分裂 (ΔE) 和延迟寿命 (τ)。这些数据库是通过使用 ChemDataExtractor（一种具有化学意识的自然语言处理工具包）进行文本挖掘创建的，该工具包已针对 TADF 研究进行了改编。文本挖掘语料库由来自皇家化学学会和爱思唯尔的 2733 篇论文组成。据我们所知，这些数据库是第一个从现有出版物中自动生成 TADF 分子的数据库。这些数据库已公开发布，可供 TADF 研究领域的实验和计算应用使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5766/10794197/9e4f304e5478/41597_2023_2897_Fig1_HTML.jpg

相似文献

A database of thermally activated delayed fluorescent molecules auto-generated from scientific literature with ChemDataExtractor.

Sci Data. 2024 Jan 17;11(1):80. doi: 10.1038/s41597-023-02897-3.

A thermoelectric materials database auto-generated from the scientific literature using ChemDataExtractor.

Sci Data. 2022 Oct 22;9(1):648. doi: 10.1038/s41597-022-01752-1.

Auto-generated database of semiconductor band gaps using ChemDataExtractor.

Sci Data. 2022 May 3;9(1):193. doi: 10.1038/s41597-022-01294-6.

A database of battery materials auto-generated using ChemDataExtractor.

Sci Data. 2020 Aug 6;7(1):260. doi: 10.1038/s41597-020-00602-2.

TADF Material Design: Photophysical Background and Case Studies Focusing on Cu and Ag Complexes.

Chemphyschem. 2017 Dec 15;18(24):3508-3535. doi: 10.1002/cphc.201700872. Epub 2017 Dec 19.

Conjugation-Induced Thermally Activated Delayed Fluorescence: Photophysics of a Carbazole-Benzophenone Monomer-to-Tetramer Molecular Series.

J Phys Chem A. 2021 Feb 18;125(6):1345-1354. doi: 10.1021/acs.jpca.0c08977. Epub 2021 Feb 8.

Theoretical tuning of the singlet-triplet energy gap to achieve efficient long-wavelength thermally activated delayed fluorescence emitters: the impact of substituents.

Phys Chem Chem Phys. 2017 Aug 16;19(32):21639-21647. doi: 10.1039/c7cp02615c.

Reverse Designing the Wavelength-Specific Thermally Activation Delayed Fluorescent Molecules Using a Genetic Algorithm Coupled with Cheap QM Methods.

J Phys Chem A. 2023 Jul 20;127(28):5930-5941. doi: 10.1021/acs.jpca.3c01714. Epub 2023 Jul 7.

Perovskite- and Dye-Sensitized Solar-Cell Device Databases Auto-generated Using ChemDataExtractor.

Sci Data. 2022 Jun 17;9(1):329. doi: 10.1038/s41597-022-01355-w.

"Rate-limited effect" of reverse intersystem crossing process: the key for tuning thermally activated delayed fluorescence lifetime and efficiency roll-off of organic light emitting diodes.

Chem Sci. 2016 Jul 1;7(7):4264-4275. doi: 10.1039/c6sc00542j. Epub 2016 Mar 15.

引用本文的文献

Autogenerating a Domain-Specific Question-Answering Data Set from a Thermoelectric Materials Database to Enable High-Performing BERT Models.

J Chem Inf Model. 2025 Aug 25;65(16):8579-8592. doi: 10.1021/acs.jcim.5c00840. Epub 2025 Aug 7.

Thermally activated delayed fluorescence materials: innovative design and advanced application in biomedicine, catalysis and electronics.

RSC Adv. 2025 Mar 7;15(10):7383-7471. doi: 10.1039/d5ra00157a. eCollection 2025 Mar 6.

Cost-Efficient Domain-Adaptive Pretraining of Language Models for Optoelectronics Applications.

J Chem Inf Model. 2025 Mar 10;65(5):2476-2486. doi: 10.1021/acs.jcim.4c02029. Epub 2025 Feb 11.

MechBERT: Language Models for Extracting Chemical and Property Relationships about Mechanical Stress and Strain.

J Chem Inf Model. 2025 Feb 24;65(4):1873-1888. doi: 10.1021/acs.jcim.4c00857. Epub 2025 Jan 31.

本文引用的文献

A thermoelectric materials database auto-generated from the scientific literature using ChemDataExtractor.

Sci Data. 2022 Oct 22;9(1):648. doi: 10.1038/s41597-022-01752-1.

Delayed fluorescence from inverted singlet and triplet excited states.

Nature. 2022 Sep;609(7927):502-506. doi: 10.1038/s41586-022-05132-y. Epub 2022 Sep 14.

Perovskite- and Dye-Sensitized Solar-Cell Device Databases Auto-generated Using ChemDataExtractor.

Sci Data. 2022 Jun 17;9(1):329. doi: 10.1038/s41597-022-01355-w.

Efficient Adversarial Generation of Thermally Activated Delayed Fluorescence Molecules.

ACS Omega. 2022 May 20;7(21):18179-18188. doi: 10.1021/acsomega.2c02253. eCollection 2022 May 31.

Reconstructing Chromatic-Dispersion Relations and Predicting Refractive Indices Using Text Mining and Machine Learning.

J Chem Inf Model. 2022 Jun 13;62(11):2670-2684. doi: 10.1021/acs.jcim.2c00253. Epub 2022 May 19.

A database of refractive indices and dielectric constants auto-generated using ChemDataExtractor.

Sci Data. 2022 May 3;9(1):192. doi: 10.1038/s41597-022-01295-5.

Auto-generated database of semiconductor band gaps using ChemDataExtractor.

Sci Data. 2022 May 3;9(1):193. doi: 10.1038/s41597-022-01294-6.

Single Model for Organic and Inorganic Chemical Named Entity Recognition in ChemDataExtractor.

J Chem Inf Model. 2022 Mar 14;62(5):1207-1213. doi: 10.1021/acs.jcim.1c01199. Epub 2022 Feb 24.

ChemDataExtractor 2.0: Autopopulated Ontologies for Materials Science.

J Chem Inf Model. 2021 Sep 27;61(9):4280-4289. doi: 10.1021/acs.jcim.1c00446. Epub 2021 Sep 16.

Highly Efficient Near-Infrared Thermally Activated Delayed Fluorescence Molecules via Acceptor Tuning: Theoretical Molecular Design and Experimental Verification.

J Phys Chem Lett. 2021 Feb 25;12(7):1893-1903. doi: 10.1021/acs.jpclett.0c03805. Epub 2021 Feb 15.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用 ChemDataExtractor 从科学文献中自动生成热激活延迟荧光分子数据库。

A database of thermally activated delayed fluorescent molecules auto-generated from scientific literature with ChemDataExtractor.

机构信息

Cavendish Laboratory, University of Cambridge, J. J. Thomson Avenue, Cambridge, CB3 0HE, UK.

ISIS Neutron and Muon Source, Rutherford Appleton Laboratory, Harwell Science and Innovation Campus, Didcot, Oxfordshire, OX11 0QX, UK.

出版信息

Sci Data. 2024 Jan 17;11(1):80. doi: 10.1038/s41597-023-02897-3.

DOI:10.1038/s41597-023-02897-3

PMID:38233439

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10794197/

Abstract

摘要

利用 ChemDataExtractor 从科学文献中自动生成热激活延迟荧光分子数据库。

A database of thermally activated delayed fluorescent molecules auto-generated from scientific literature with ChemDataExtractor.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

利用 ChemDataExtractor 从科学文献中自动生成热激活延迟荧光分子数据库。

A database of thermally activated delayed fluorescent molecules auto-generated from scientific literature with ChemDataExtractor.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献