登陆注册
27082000000029

第29章 Database System(8)

Data Warehousing

Data warehouses contain consolidated data from many sources?? spanning long time periods?? and augmented with summary information. Warehouses are much larger than other kinds of databases; sizes ranging from several gigabytes to terabytes are common. Typical workloads involve ad hoc?? fairly complex queries?? and fast response time is important. These characteristics differentiate warehouse applications from OLTP applications?? and different DBMS design and implementation techniques must be used to achieve satisfactory results. Adistributed DBMS with good scalability and high availability ??achieved by storing tables redundantly at more than one site?? is required for very large warehouses.

An organization's daily operations access and modify operational databases. Data from these operational databases and other external sources ??e. g.?? customer profiles supplied by external consultants?? are extracted by using gateways?? or standard external interfaces supported by the underlying DBMS. Standards such as Open Database Connectivity ??ODBC?? from Microsoft are emerging for gateways;ODBC is an application program interface that allows client programs to generate SQL statements to be executed at a sewer.

There are many challenges in creating and maintaining a large data warehouse. A goad database schema must be designed to hold an integrated collection of data copied from diverse sources. For example?? a company warehouse might include the Inventory and Personnel departments' databases?? together with Sales databases maintained by offices in different countries. Since the source databases are often created and maintained by different groups?? there are a number of semantic mismatches across these databases?? such as different currency units?? different names for the same attribute?? and differences in how tables are normalized or structured;these differences must be reconciled when data is brought into the warehouse. After the warehouse schema is designed?? the warehouse must be populated?? and over time?? it must be kept consistent with the primary data sources.

Data extracted from operational databases and external sources is first cleaned to minimize errors and fin in missing information when possible?? and transformed to reconcile semantic mismatches. Transforming data is typically accomplished by defining a relational view over the tables in the data sources ??the operational databases and other external sources??. Loading data consists of materializing such views and storing them in the warehouse. Unlike a standard view in a relational DBMS?? therefore?? the view is stored in a database ??the warehouse?? that is different from the database ??s?? containing the tables it is defined over.

The cleaned and transformed data is finally loaded into the warehouse?? Additional preprocessing such as sorting and generation of summary information is carried out at this stage. Data is partitioned and indexes are built for efficiency. The large volume of data to be loaded means that loading is a slow process; loading a terabyte of data sequentially can take weeks. Parallelism is therefore important for loading warehouses.

同类推荐
  • 用耳朵听最优美的名著

    用耳朵听最优美的名著

    系列图书精选的各类故事、散文、演讲、时文及名著片段,均用词精准简洁,语句流畅优美,将引领你进入趣、情、爱与理的博大世界,使你更加充满信心地去追求梦想。这里有嘻嘻哈哈的幽默故事,有体会幸福与生活的感悟故事,有帮你战胜挫折给你勇气的故事,有闪烁着人性光辉的美德故事,有发人深省的智慧故事,也有在成长路上给你动力的哲理故事。相信本系列图书能为你展现一个美丽新世界并使您的英语学习更上一层楼。
  • 儿子和情人

    儿子和情人

    矿工瓦尔特原本性格开朗,充满活力,后因酗酒而日渐沉沦。妻子格特鲁德失望之余,转而将希望寄托在两个儿子身上,长子威廉又不幸早夭,遂对次子保罗产生了强烈的感情。面对情感变态的母亲,以及两个各有其不同恋爱观的女友,年轻的保罗一时颇感迷惘。
  • 国王和渔夫

    国王和渔夫

    古时候,巴格达城中有位大商人,名叫格尔诺,专门做珠宝生意。由于精通商术,才华超群,他很快受到国王哈里发的重用,成为哈里发在生意场上的代理人,并肩负为国王挑选王妃的重任。一天,格尔诺正在柜台上算账,有位商人带着一个年轻姑娘走了进来。商人开门见山地说明来意,想把姑娘献给国王。格尔诺仔细打量这位姑娘,见她年轻貌美,异常迷人,心里十分满意。
  • 当英语也成为时尚——生活全由你创造

    当英语也成为时尚——生活全由你创造

    本书摘取了若干耐人寻味、震撼人心的哲理美文和励志故事,包括:“成功永远不会太晚”、“假如我又回到童年”、“循序渐进”等。
  • 纳尼亚传奇:狮子、女巫与魔衣柜(双语译林)

    纳尼亚传奇:狮子、女巫与魔衣柜(双语译林)

    《纳尼亚传奇》是英国著名作家刘易斯于1951年至1956年间创作的系列魔幻故事,被公认为20世纪最佳儿童图书之一。在半个世纪里,《纳尼亚传奇》的销售达到8500万册,至今已被翻译成30多种语言文字。在老教授的房子里有许多间屋子,屋子里有许多扇门,但是只有一扇通向另一个世界……纳尼亚。那里流传着一个预言:两个亚当的儿子和两个夏娃的女儿将会现身,击败邪恶的白女巫,结束永恒的寒冬。狮王阿斯兰说:纳尼亚的未来系于他们的勇气。在这里,一种命运即将应验,一段传奇拉开序幕。
热门推荐
  • 雷武战神

    雷武战神

    传承世家平庸嫡子“雷宇”被逼无奈,入驻伽蓝学院,却幸得雷龙逆鳞。从此雷宇修武道,战天穹;灭天尊,驭神龙;以神雷淬体,修无上雷诀!从小人物迅速崛起,登战神云顶!于万域称尊!美女相伴,万强伏首,亿万诸天,唯我至尊!【新书《重生之武法无天》已经上传。一本很独特的玄幻,求鼎力支持!】
  • 妃常执着三世寻夫

    妃常执着三世寻夫

    她只是想好好结个婚,一世新郎莫名其妙被车撞死了?二世,还在热恋期就被两族战争破坏了?三世,凤卿玖好不容易找到了老公,又有小人嫉妒想破坏?那她也就不客气了!用凤卿玖的原话说就是:别逼我,否则我优秀起来一发不可收拾!
  • 陌

    追忆星星年岁,命陌;轮回双生血缘亲陌;破碎生命信仰友陌;轻描拾荒之旅尘世陌;这四出戏剧般的情节点燃了一个坚强女孩的生命,让她活得如此丰盛。人生如戏,还是戏如人生。我们所走过的路皆是如此,却都在当时只道是寻常的惶惑中,迷失,忘记。
  • 寻墓往事

    寻墓往事

    在湘西民间流传着一个说法,说是每每七月,在那个荒山野岭的深处,时常会有白冥幽鬼从老坟堆里钻出,不论人与牲畜,遇其都会嗜其灵魂。而就在那一个雨夜里,一个少女从此就被幽鬼封进了玉棺;至那以后,那个少年便带领着他的伙伴,就走上了一条永看不到尽头的黑路,他手持一柄青铜牛角,日夜追寻着少女的下落……一次偶然的惊现,一场离奇的经历,一回命运的改变。在那个多风多雨的年代,寻墓!还会发生什么。
  • 智覃正禅师语录

    智覃正禅师语录

    本书为公版书,为不受著作权法限制的作家、艺术家及其它人士发布的作品,供广大读者阅读交流。
  • 晓初晨光向我而来

    晓初晨光向我而来

    24世纪伟大的天才女博士林晓初,因为一场车祸,穿越到了一个不知名的古代时空,经历了各种宫斗,宅斗,绿茶,白莲,最终手握两只小球,偶遇白马王子,走上人生赢家。
  • 名今扬

    名今扬

    本文同在贴吧,汤圆创作,九库文学更新。—不曾想,为了一次活命,救了一个人,没想救起的是一只跟屁虫。他绝色妖异,手段狠辣,却对这个偶然救他一命的小家伙,情有独钟。—明明是我先遇到她的……为什么……
  • 生活就像一本故事书

    生活就像一本故事书

    本书从古今中外的众多经典故事中精心选编了300多篇,它们或说理生动,或寓意深刻,或思想犀利,或耐人寻味。通过这些故事来阐述生活中已经发生的、正在发生的、将来还要发生的种种的问题。愿书中的这些故事能给我们的生活带来一抹亮色、一丝快乐,把难言的忧伤变为沉醉的美酒,把午夜的黑暗化为黎明的曙光,让我们的人生之旅变得格外轻松、欢快、达观。
  • 美人皎皎

    美人皎皎

    一张美人图,引来一场血雨腥风?有没有搞错?明皎皎:如果,我是说如果,我能凭一己之力平息这场战乱,是不是能从中捞点什么好处啊?
  • 暗中遇见你

    暗中遇见你

    年轻的我们,在刚刚走出校门的时候,总是会迷失自己的方向。面对爱情、友情,也时常会有意想不到的收获。现实的境地,常常逼着不愿长大的我们,快速的成长,面对世间的繁华,有些人屈服了,有些人更倔强。在灯火通明时,人们往往关注长相、背景、能力、家世,只有在黑暗里,才能用一颗最清澈的真心,去感受聆听最真实的心跳。