Department of Chemistry, School of Sciences & Engineering , The American University in Cairo , New Cairo , 11835 , Egypt.
Department of Chemistry & Biochemistry , University of Texas at Arlington , Arlington , Texas 76019 , United States.
J Chem Inf Model. 2018 Nov 26;58(11):2214-2223. doi: 10.1021/acs.jcim.8b00534. Epub 2018 Oct 25.
A significant number of published databases and research papers exist in foreign languages and remain untranslated to date. Important sources of primary scientific information in German are Beilstein Handbuch der Organischen Chemie, Gmelin Handbuch der Anorganischen Chemie, Landolt-Börnstein Zahlenwerte und Funktionen, Houben-Weyl Methoden der Organischen Chemie, fundamental research papers, and patents. Although Reaxys has acquired Beilstein and Gmelin, many original references are still in German since 1770s, and the information presented in printed and online versions is often not duplicated. To read these resources, either costly professional translation services are needed or a reading knowledge of German has to be acquired. A convenient approach is to utilize machine translation for reading German texts; however, there is a question of translation reliability. In this work, several different platforms that employ neural network for machine translation (NMT) were tested for translation capability of scientific German. From a preliminary survey, Google Translate and DeepL were finalized for further studies (German to English). Excerpts from German documents spanning more than a century have been carefully chosen from standard works. DeepL Translator and Google Translate were found to be reliable for converting German scientific literature into English for a wide variety of technical passages. As a benchmark, human and machine translations are compared for complex sentences from old literature and a recent publication. Care and intuition should be used before relying on machine translation of methods and directions in general. Reagent addition (to or from) may be inverted in some synthetic procedures using machine translations.
目前存在大量的外文出版数据库和研究论文,且尚未被翻译成中文。德语中的重要原始科学信息来源有:《贝耶尔有机化学手册》《格林有机化学手册》《兰道尔-伯恩斯坦数值与函数》《霍本-魏尔有机化学方法》基础研究论文和专利。尽管 Reaxys 已经收购了贝耶尔和格林,但自 18 世纪以来,许多原始参考文献仍为德文,而且印刷版和在线版的信息往往没有重复。要阅读这些资源,要么需要昂贵的专业翻译服务,要么需要掌握德语阅读知识。一个方便的方法是使用机器翻译来阅读德文文本;但是,翻译的可靠性是一个问题。在这项工作中,我们测试了几个使用神经网络进行机器翻译(NMT)的不同平台,以评估其翻译德文科技文献的能力。初步调查后,选定 Google Translate 和 DeepL 进一步研究(德译英)。从标准著作中精心挑选了一个多世纪以来的德文文献摘录。研究发现 DeepL Translator 和 Google Translate 可以可靠地将德文科技文献转换为英文,适用于各种技术段落。作为基准,对来自旧文献和最新出版物的复杂句子进行了人工翻译和机器翻译的比较。在一般情况下,在依赖机器翻译方法和说明之前,应该谨慎并运用直觉。在某些使用机器翻译的合成过程中,试剂的添加(加入或除去)可能会颠倒。