Jailingeswari I, Gopinathan S
Department of Computer Science, University of Madras, Chennai, India.
Data Brief. 2024 Jan 30;53:110100. doi: 10.1016/j.dib.2024.110100. eCollection 2024 Apr.
Most palm leaf manuscripts are generally accessible in deteriorated condition, including cracks, discoloration, moisture and humidity, and insects bite. Such a manuscript is considered challenging in the research field. We captured deteriorated Tamil palm leaves around 262 dataset samples are 'Naladiyar(27)',' Tholkappiyam(221)', and' Thirikadugam(14)' which are genned up mortal health, discipline, authoritative text on Tamil grammar. We contribute the high-quality raw dataset with the aid of a Nikon camera, pre-enhance samples by editing software tool, and applied the Otsu threshold to deliver the ground images through binarization as readily accessible content presenting a highly time-consuming task to play a vital role in Machine/Deep/ Transfer learning, AI, and ANN.
大多数棕榈叶手稿通常状况不佳,存在裂缝、变色、受潮以及遭昆虫叮咬等问题。在研究领域,这样的手稿被视为具有挑战性。我们采集了状况不佳的泰米尔棕榈叶,约262个数据集样本包括《纳拉迪亚尔》(27份)、《托勒卡皮亚姆》(221份)和《蒂里卡杜加姆》(14份),这些都是关于泰米尔语法中人类健康、学科、权威文本的内容。我们借助尼康相机提供高质量的原始数据集,通过编辑软件工具对样本进行预增强,并应用大津阈值通过二值化来生成地面图像,将其作为易于获取的内容,这是一项非常耗时的任务,在机器学习/深度学习/迁移学习、人工智能和人工神经网络中发挥着至关重要的作用。