《数据仓库及应用数据仓库无锡.pptx》由会员分享,可在线阅读,更多相关《数据仓库及应用数据仓库无锡.pptx(37页珍藏版)》请在taowenge.com淘文阁网|工程机械CAD图纸|机械工程制图|CAD装配图下载|SolidWorks_CaTia_CAD_UG_PROE_设计图分享下载上搜索。
1、 Data warehouse is a very large database that stores integrated data of one or more business subject areas Data warehouse is built to support data analysis for decision making Integrated customer data warehouse is a necessary step to the success of the business intelligence strategy Data warehouses
2、are also used for many other purposes such as product manufacturing data warehouses Data warehouses become the focal point in the enterprise-wide IT infrastructureWhat is Data Warehouse?第1页/共37页What is Data WarehouseWhat is Data WarehouseA data warehouse is simply a single,complete,and consistent st
3、ore of data obtained from a variety of sources and made available to end users in a way they can understand and use in a business context.第2页/共37页What is Data WarehouseWhat is Data Warehouse第3页/共37页Data Warehouse DefinitionsData Warehouse DefinitionsThe key elements in the definitions:Subject-Orient
4、ed:Presentation as business subjects,not as computer files.Integrated:A single source of information for and about the business.Non-Volatile:Stable information that doesnt change each time an operational process is executed.Time-Variant:Containing a history of the business,as well as current busines
5、s information.Accessible:The primary purpose of a data warehouse is to provide readily accessible information to business people.第4页/共37页Subject-OrientedSubject-Oriented第5页/共37页IntegratedIntegrated第6页/共37页IntegratedIntegrated网管系统财务管理系统市场分析决策系统Telcom DWSystem分析型大客户CRM系统信用度管理客服系统(Call Center 1000/180/
6、112/170)系统其它电信专业网业务系统“九七工程”之营业管理生产调度系统(配线/配号/开通)112故障查修系统资源管理系统170/催缴系统营业系统本地公司/IC卡/磁卡管理系统网上营业系统商务管理层省级计费结算计费帐务系统中心业务管理层网络及网元管理层电信网网元第7页/共37页Non-VolatileNon-Volatile第8页/共37页Time VariantTime Variant第9页/共37页AccessibleAccessible第10页/共37页Data Warehouse Data Warehouse CharacteristicsCharacteristicsData w
7、arehouse separates functions from operational systems.PropertyOperationalData WarehouseResponse TimeSub-second to secondsSeconds to hoursData OperationDMLPrimarily read onlyNature of Data30-60 daysSnapshots over timeData OrganizationApplicationSubject,TimeSizeSmall to largeLarge to very largeData So
8、urcesOperational,InternalOperational,Internal,ExternalActivitiesProcessesAnalysis第11页/共37页Data Warehouse Data Warehouse CharacteristicsCharacteristicsData warehouse serves as a central repository for recording everything about the business for information retrieval.Data is loaded from internal busin
9、ess operational system,and external systems.第12页/共37页Data Warehouse Data Warehouse CharacteristicsCharacteristicsA data warehouse has a fundamental effect on how the users see the data available about the organization,what to do with it and how to use it for decision making.第13页/共37页Data Warehouse D
10、ata Warehouse CharacteristicsCharacteristics第14页/共37页Data Warehouse CharacteristicsData Warehouse CharacteristicsA data warehouse is not a single software or hardware product you purchase to strategic.It is a computing environment where users can find strategic information to make better decisions.I
11、t is a user-centric environment.第15页/共37页Data Warehouse Data Warehouse CharacteristicsCharacteristicsData warehouse is a blend of many different technologies needed for supporting the various functions of a data warehouse environment.These different technologies all work together in a data warehouse
12、 environment.ApplicationAdministrationStorage ManagementAnalysisData ManagementData ModelingData AcquisitionData Warehouse第16页/共37页Enterprise Data WarehouseEnterprise Data WarehouseEnterprise data warehouses are funded on a corporate basis.Enterprise data warehouse covers the entire business(corpora
13、tion),incorporating data from all operational systems.Information is extracted from the operational environment,cleansed,and transformed into a central,integrated enterprise-wide data warehouse environment,so that all the departments and other internal organizations of the corporation can benefit fr
14、om a consistent,integrated source of decision support information.第17页/共37页Data MartData MartData marts are often funded on a departmental basis.Data mart is a collection of data tailored to the DSS processing needs of a particular department.It is a subset of a enterprise data warehouse that has be
15、en customized to fit the needs of a department.Data marts serve users at a specific level,or for a specific department.第18页/共37页Data Warehouse versus Data Data Warehouse versus Data MartMart PropertyData WarehouseData MartScopeEnterpriseDepartmentSubjectsMultipleSingle-subjectData SourceManyFewSize(
16、Typical)TB TBImplement Time Months to yearsMonths第19页/共37页Data MartData Mart第20页/共37页Data MartData MartControl:A department can completely control the data and processing that occurs inside a data mart.Cost:The cost of storage and processing is less,because the data marts machine is smaller than DWs
17、Customization:The data marts data is customized to suit the peculiar needs of the department.第21页/共37页Data MartData Mart第22页/共37页Data MartData Mart第23页/共37页Data MartData MartDependent Data Mart:The source is the data warehouse.The extraction,transformation,and loading process is easy.The data mart i
18、s part of the enterprise plan.Independent Data Mart:The source are operational system external source.The extraction,transformation,and loading process is difficult.The data mart is built to satisfy analytical needs.第24页/共37页Operational Data Store Operational Data Store(ODS)(ODS)Integrate informatio
19、n from the production system.Relieve the production systems reporting and analysis demands.Provide access to current data.第25页/共37页ODSODS第26页/共37页ODSODS ODS looks very much like a data warehouse,such as subject-oriented,and integration.However,the remaining characteristics of an ODS are quite differ
20、ent from a data warehouse:Volatile:An ODS can be updated as a normal part of processing.Current-Values:An ODS typically contains daily,weekly,or even monthly data,but the data ages very quickly.Detailed Data:An ODS contains detailed data only.第27页/共37页Different Classes of the Different Classes of th
21、e ODSODSClass I:A synchronous interface in which a very,very small amount of time lapses between an applications transaction and the reflection of the transaction in the ODS.Class II:If an hour or two passes from the time a transaction is created and interacted in the application environment until t
22、hat transaction is reflected in the ODS.Class III:There may be a time lag between 12 hours and a day as transaction data is collected in the I&T interface.Class IV:The data is fed into the ODS directly from the data warehouse.第28页/共37页Determining the ClassDetermining the ClassSpeed of movement of da
23、ta into the ODSVolume of data that must be movedVolume of data that must be stored in intermediate location during I&T processingUpdate of data and integrity of transaction processingThe time of day the movement needs to occur第29页/共37页Data ArchitectureData WarehouseOperational Data Store ODSOperatio
24、nal Data Store ODSLegacy System Legacy System Legacy System Legacy System Call CenterWebEmailATMSFASupport Operational CRMSupport Analytical CRM第30页/共37页Example:The Content of a Customer ODSIdentificationNameAddressPhoneE-mailPreferencesOpt in/outMediumData sharingTransactionsPurchasesCancellationsR
25、eturnsHH/Company AffiliationEventsComplaintsPre-approvalsInquiriesSales callsCustomer ODSCorporate HierarchyHousehold link第31页/共37页数据仓库系统的体系结构两层架构(Generic Two-Level Architecture)独立型数据集市(Independent Data Mart)依赖型数据集市和操作型数据存储(Dependent Data Mart and Operational Data Store)第32页/共37页两层数据仓库体系结构 第33页/共37页基于独立数据集市的数据仓库体系结构 第34页/共37页基于依赖型数据集市和操作型数据存储(ODS)的数据仓库体系结构 第35页/共37页第36页/共37页感谢您的观看!第37页/共37页