• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CRIE:一款用于中文文本的自动分析器。

CRIE: An automated analyzer for Chinese texts.

作者信息

Sung Yao-Ting, Chang Tao-Hsing, Lin Wei-Chun, Hsieh Kuan-Sheng, Chang Kuo-En

机构信息

Department of Educational Psychology and Counseling, National Taiwan Normal University, Taipei, Taiwan.

Department of Computer Science and Information Engineering, National Kaohsiung University of Applied Sciences, 415, Jiangong Road, Kaohsiung City, 80778, Taiwan, Republic of China.

出版信息

Behav Res Methods. 2016 Dec;48(4):1238-1251. doi: 10.3758/s13428-015-0649-1.

DOI:10.3758/s13428-015-0649-1
PMID:26424442
Abstract

Textual analysis has been applied to various fields, such as discourse analysis, corpus studies, text leveling, and automated essay evaluation. Several tools have been developed for analyzing texts written in alphabetic languages such as English and Spanish. However, currently there is no tool available for analyzing Chinese-language texts. This article introduces a tool for the automated analysis of simplified and traditional Chinese texts, called the Chinese Readability Index Explorer (CRIE). Composed of four subsystems and incorporating 82 multilevel linguistic features, CRIE is able to conduct the major tasks of segmentation, syntactic parsing, and feature extraction. Furthermore, the integration of linguistic features with machine learning models enables CRIE to provide leveling and diagnostic information for texts in language arts, texts for learning Chinese as a foreign language, and texts with domain knowledge. The usage and validation of the functions provided by CRIE are also introduced.

摘要

文本分析已应用于各个领域,如话语分析、语料库研究、文本分级和自动作文评估。已经开发了几种工具来分析用字母语言(如英语和西班牙语)编写的文本。然而,目前还没有可用于分析中文文本的工具。本文介绍了一种用于自动分析简体中文和繁体中文文本的工具,称为中文可读性指数浏览器(CRIE)。CRIE由四个子系统组成,包含82个多层次语言特征,能够执行分词、句法分析和特征提取等主要任务。此外,语言特征与机器学习模型的集成使CRIE能够为语言艺术文本、对外汉语学习文本和具有领域知识的文本提供分级和诊断信息。还介绍了CRIE所提供功能的用法和验证。

相似文献

1
CRIE: An automated analyzer for Chinese texts.CRIE:一款用于中文文本的自动分析器。
Behav Res Methods. 2016 Dec;48(4):1238-1251. doi: 10.3758/s13428-015-0649-1.
2
LATIC-A linguistic analyzer for text and item characteristics.LATIC-用于文本和项目特征的语言分析器。
PLoS One. 2022 Nov 3;17(11):e0277250. doi: 10.1371/journal.pone.0277250. eCollection 2022.
3
Constructing and validating readability models: the method of integrating multilevel linguistic features with machine learning.构建和验证可读性模型:将多层次语言特征与机器学习相结合的方法。
Behav Res Methods. 2015 Jun;47(2):340-54. doi: 10.3758/s13428-014-0459-x.
4
Coh-metrix: analysis of text on cohesion and language.Coh - metrix:衔接与语言文本分析
Behav Res Methods Instrum Comput. 2004 May;36(2):193-202. doi: 10.3758/bf03195564.
5
Integrating machine learning with linguistic features: A universal method for extraction and normalization of temporal expressions in Chinese texts.将机器学习与语言特征相结合:一种用于中文文本中时间表达式提取与规范化的通用方法。
Comput Methods Programs Biomed. 2023 May;233:107474. doi: 10.1016/j.cmpb.2023.107474. Epub 2023 Mar 11.
6
Easy-to-read texts for students with intellectual disability: linguistic factors affecting comprehension.智障学生易懂的文本:影响理解的语言因素。
J Appl Res Intellect Disabil. 2014 May;27(3):212-25. doi: 10.1111/jar.12065. Epub 2013 Jul 1.
7
The Role of Surface, Semantic and Grammatical Features on Simplification of Spanish Medical Texts: A User Study.表面、语义和语法特征对西班牙语医学文本简化的作用:一项用户研究。
AMIA Annu Symp Proc. 2018 Apr 16;2017:1322-1331. eCollection 2017.
8
Assessing Communicative Effectiveness of Public Health Information in Chinese: Developing Automatic Decision Aids for International Health Professionals.评估中文公共卫生信息的传播效果:为国际卫生专业人员开发自动决策辅助工具。
Int J Environ Res Public Health. 2021 Sep 30;18(19):10329. doi: 10.3390/ijerph181910329.
9
Languages for different health information readers: multitrait-multimethod content analysis of Cochrane systematic reviews textual summary formats.不同健康信息读者适用的语言:考科蓝系统评价文本摘要格式的多特质-多方法内容分析。
BMC Med Res Methodol. 2019 Apr 5;19(1):75. doi: 10.1186/s12874-019-0716-x.
10
Sentence and text comprehension: roles of linguistic structure.句子与文本理解:语言结构的作用
Annu Rev Psychol. 2001;52:167-96. doi: 10.1146/annurev.psych.52.1.167.

引用本文的文献

1
Enhancing Physician-Patient Communication in Oncology Using GPT-4 Through Simplified Radiology Reports: Multicenter Quantitative Study.通过简化放射学报告利用GPT-4加强肿瘤学领域医患沟通:多中心定量研究
J Med Internet Res. 2025 Apr 17;27:e63786. doi: 10.2196/63786.
2
Ability of ChatGPT to Replace Doctors in Patient Education: Cross-Sectional Comparative Analysis of Inflammatory Bowel Disease.ChatGPT在患者教育中替代医生的能力:炎症性肠病的横断面比较分析
J Med Internet Res. 2025 Mar 31;27:e62857. doi: 10.2196/62857.
3
Evaluation of the quality and readability of ChatGPT responses to frequently asked questions about myopia in traditional Chinese language.
评估ChatGPT对中文常见近视相关问题的回答质量和可读性。
Digit Health. 2024 Sep 2;10:20552076241277021. doi: 10.1177/20552076241277021. eCollection 2024 Jan-Dec.
4
Developing Machine Learning and Statistical Tools to Evaluate the Accessibility of Public Health Advice on Infectious Diseases among Vulnerable People.开发机器学习和统计工具,以评估弱势群体获取传染病公共卫生建议的可及性。
Comput Intell Neurosci. 2021 Dec 17;2021:1916690. doi: 10.1155/2021/1916690. eCollection 2021.
5
Influential Factors on Collective Anxiety of Online Topic-Based Communities.基于在线主题社区集体焦虑的影响因素
Front Psychol. 2021 Oct 5;12:740065. doi: 10.3389/fpsyg.2021.740065. eCollection 2021.
6
Assessing Communicative Effectiveness of Public Health Information in Chinese: Developing Automatic Decision Aids for International Health Professionals.评估中文公共卫生信息的传播效果:为国际卫生专业人员开发自动决策辅助工具。
Int J Environ Res Public Health. 2021 Sep 30;18(19):10329. doi: 10.3390/ijerph181910329.
7
Applying the Bell's Test to Chinese Texts.将贝尔测试应用于中文文本。
Entropy (Basel). 2020 Feb 28;22(3):275. doi: 10.3390/e22030275.
8
Exploring Knowledge and Experience of Health Literacy for Chinese-Speaking Nurses in Taiwan: A Cross-Sectional Study.探索台湾华语护士健康素养的知识与经验:一项横断面研究。
Int J Environ Res Public Health. 2020 Oct 19;17(20):7609. doi: 10.3390/ijerph17207609.
9
Cross-Cultural Translation of the nChinese Version of Pain Care Quality Surveys (C-PainCQ).中文版疼痛护理质量调查问卷(C-PainCQ)的跨文化翻译
Asian Pac Isl Nurs J. 2020;4(4):165-172. doi: 10.31372/20190404.1072.
10
CLAD: A corpus-derived Chinese Lexical Association Database.CLAD:基于语料库的汉语词汇联想数据库。
Behav Res Methods. 2019 Oct;51(5):2310-2336. doi: 10.3758/s13428-019-01208-2.