Bunardzic A
B.C. InfoHealth Ltd., Vancouver, Canada.
Medinfo. 1995;8 Pt 1:520.
The focus of information processing requirements is shifting from the on-line transaction processing (OLTP) issues to the on-line analytical processing (OLAP) issues. While the former serves to ensure the feasibility of the real-time on-line transaction processing (which has already exceeded a level of up to 1,000 transactions per second under normal conditions), the latter aims at enabling more sophisticated analytical manipulation of data. The OLTP requirements, or how to efficiently get data into the system, have been solved by applying the Relational theory in the form of Entity-Relation model. There is presently no theory related to OLAP that would resolve the analytical processing requirements as efficiently as Relational theory provided for the transaction processing. The "relational dogma" also provides the mathematical foundation for the Centralized Data Processing paradigm in which mission-critical information is incorporated as 'one and only one instance' of data, thus ensuring data integrity. In such surroundings, the information that supports business analysis and decision support activities is obtained by running predefined reports and queries that are provided by the IS department. In today's intensified competitive climate, businesses are finding that this traditional approach is not good enough. The only way to stay on top of things, and to survive and prosper, is to decentralize the IS services. The newly emerging Distributed Data Processing, with its increased emphasis on empowering the end user, does not seem to find enough merit in the relational database model to justify relying upon it. Relational theory proved too rigid and complex to accommodate the analytical processing needs. In order to satisfy the OLAP requirements, or how to efficiently get the data out of the system, different models, metaphors, and theories have been devised. All of them are pointing to the need for simplifying the highly non-intuitive mathematical constraints found in the relational databases normalized to their 3rd normal form. Object-oriented approach insists on the importance of the common sense component of the data processing activities. But, particularly interesting, is the approach that advocates the necessity of 'flattening' the structure of the business models as we know them today. This discipline is called Dimensional Modeling and it enables users to form multidimensional views of the relevant facts which are stored in a 'flat' (non-structured), easy-to-comprehend and easy-to-access database. When using dimensional modeling, we relax many of the axioms inherent in a relational model. We focus on the knowledge of the relevant facts which are reflecting the business operations and are the real basis for the decision support and business analysis. At the core of the dimensional modeling are fact tables that contain the non-discrete, additive data. To determine the level of aggregation of these facts, we use granularity tables that specify the resolution, or the level/detail, that the user is allowed to entertain. The third component is dimension tables that embody the knowledge of the constraints to be used to form the views.
信息处理需求的重点正从联机事务处理(OLTP)问题转向联机分析处理(OLAP)问题。前者用于确保实时联机事务处理的可行性(在正常情况下,其每秒的事务处理量已超过1000笔),而后者旨在实现对数据更复杂的分析操作。OLTP需求,即如何有效地将数据输入系统,已通过应用实体关系模型形式的关系理论得到解决。目前还没有与OLAP相关的理论能像关系理论为事务处理提供的那样有效地解决分析处理需求。“关系教条”还为集中式数据处理范式提供了数学基础,在这种范式中,关键任务信息作为数据的“唯一实例”被纳入,从而确保数据完整性。在这种环境下,支持业务分析和决策支持活动的信息是通过运行信息系统部门提供的预定义报告和查询来获取的。在当今竞争激烈的环境中,企业发现这种传统方法不够好。要保持领先地位、生存和繁荣,唯一的办法是将信息系统服务去中心化。新出现的分布式数据处理越来越强调赋予终端用户权力,但似乎在关系数据库模型中找不到足够的优点来证明依赖它是合理的。关系理论被证明过于僵化和复杂,无法满足分析处理需求。为了满足OLAP需求,即如何有效地将数据输出系统,人们设计了不同的模型、隐喻和理论。所有这些都表明需要简化在规范化到第三范式的关系数据库中发现的高度不直观的数学约束。面向对象方法强调数据处理活动中常识成分的重要性。但是,特别有趣的是一种主张“扁平化”我们今天所知的业务模型结构的必要性的方法。这一学科被称为维度建模,它使用户能够对存储在“扁平”(非结构化)、易于理解和访问的数据库中的相关事实形成多维视图。在使用维度建模时,我们放宽了关系模型中固有的许多公理。我们关注反映业务操作且是决策支持和业务分析真正基础有关事实的知识。维度建模的核心是包含非离散、可加性数据的事实表。为了确定这些事实的聚合级别,我们使用粒度表来指定用户可以考虑的分辨率或级别/细节。第三个组件是维度表,它体现了用于形成视图的约束知识。