Lecture Administration and visualization: Chapter 3.2 - Data modelling and databases OLTP & OLAP
Số trang: 71
Loại file: pdf
Dung lượng: 3.98 MB
Lượt xem: 13
Lượt tải: 0
Xem trước 8 trang đầu tiên của tài liệu này:
Thông tin tài liệu:
Lecture "Administration and visualization: Chapter 3.2 - Data modelling and databases OLTP & OLAP" provides students with content about: Overview; OLTP vs OLAP; Data warehouse modeling; Data warehouse design; Data warehouse Implementation;... Please refer to the detailed content of the lecture!
Nội dung trích xuất từ tài liệu:
Lecture Administration and visualization: Chapter 3.2 - Data modelling and databases OLTP & OLAP Chapter 3Data modelling and databases OLTP & OLAP 1Outline• Overview• OLTP vs OLAP• Data warehouse modeling• Data warehouse design• Data warehouse Implementation 2Heterogeneous data sources 3Why data integration• To facilitate information access and reuse through a single information access point• Data from different complementing information systems is to be combined to gain a more comprehensive basis to satisfy the need • Improve decision making • Improve customer experience • Increase competitiveness, Streamline operations • Increase productivity • Predict the future 4Data integration challenges• Physical systems • Various hardwares, standards • Distributed deployment • Various data format• Logical structures • Different data models • Different data schemas• Business organization • Data security and privacy • Business rules and requirements • Different administrative zones in the business organization 5Data Warehouse• A single, complete and consistent store of data obtained from a variety of different sources made available to end users in a what they can understand and use in a business context. [Barry Devlin]• A data warehouse is a copy of transaction data specifically structured for query and analysis [Ralph Kimball]• Data from several operational sources (OLTP) are extracted, transformed, and loaded (ETL) into a data warehouseData Warehouse usage• Three kinds of data warehouse applications • Information processing • supports querying, basic statistical analysis, and reporting using crosstabs, tables, charts and graphs • Analytical processing • multidimensional analysis of data warehouse data • supports basic OLAP operations, slice-dice, drilling, pivoting • Data mining • knowledge discovery from hidden patterns • supports associations, constructing analytical models, performing classification and prediction, and presenting the mining results using visualization tools 7Data Warehouse usage Which are our lowest/highest margin customers ? Who are my customers What is the most and what products effective distribution are they buying? channel? What product prom- Which customers -otions have the biggest are most likely to go impact on revenue? to the competition ? What impact will new products/services have on revenue and margins?Advantages• High query performance • But not necessary most current information• Does not interfere with local processing at sources • Complex queries at warehouse • OLTP at information sources 9Characteristics of Data warehouse• Subject-Oriented• Integrated• Time-variant• Non-volatile 10Subject-Oriented• Offer information regarding a theme instead of companies ongoing operations • Subjects can be sales, marketing, distributions, etc. • A data warehouse never focuses on the ongoing operations• Emphasis on modeling and analysis of data for decision making • Provide a simple and concise view around the specific subject by excluding data which not helpful 11Integrated• Constructed by integrating multiple, heterogeneous data sources • Data needs to be stored in the Datawarehouse in a common and universally acceptable manner • This integration helps in effective analysis of data• Data cleaning and data integration techniques are applied. • Ensure consistency in naming conventions, encoding structures, attribute measures, etc. among different data sources • E.g., Hotel price: currency, tax, breakfast covered, etc. • When data is moved to the warehouse, it is converted. 12Time-Variant• The time horizon for the data warehouse is signif ...
Nội dung trích xuất từ tài liệu:
Lecture Administration and visualization: Chapter 3.2 - Data modelling and databases OLTP & OLAP Chapter 3Data modelling and databases OLTP & OLAP 1Outline• Overview• OLTP vs OLAP• Data warehouse modeling• Data warehouse design• Data warehouse Implementation 2Heterogeneous data sources 3Why data integration• To facilitate information access and reuse through a single information access point• Data from different complementing information systems is to be combined to gain a more comprehensive basis to satisfy the need • Improve decision making • Improve customer experience • Increase competitiveness, Streamline operations • Increase productivity • Predict the future 4Data integration challenges• Physical systems • Various hardwares, standards • Distributed deployment • Various data format• Logical structures • Different data models • Different data schemas• Business organization • Data security and privacy • Business rules and requirements • Different administrative zones in the business organization 5Data Warehouse• A single, complete and consistent store of data obtained from a variety of different sources made available to end users in a what they can understand and use in a business context. [Barry Devlin]• A data warehouse is a copy of transaction data specifically structured for query and analysis [Ralph Kimball]• Data from several operational sources (OLTP) are extracted, transformed, and loaded (ETL) into a data warehouseData Warehouse usage• Three kinds of data warehouse applications • Information processing • supports querying, basic statistical analysis, and reporting using crosstabs, tables, charts and graphs • Analytical processing • multidimensional analysis of data warehouse data • supports basic OLAP operations, slice-dice, drilling, pivoting • Data mining • knowledge discovery from hidden patterns • supports associations, constructing analytical models, performing classification and prediction, and presenting the mining results using visualization tools 7Data Warehouse usage Which are our lowest/highest margin customers ? Who are my customers What is the most and what products effective distribution are they buying? channel? What product prom- Which customers -otions have the biggest are most likely to go impact on revenue? to the competition ? What impact will new products/services have on revenue and margins?Advantages• High query performance • But not necessary most current information• Does not interfere with local processing at sources • Complex queries at warehouse • OLTP at information sources 9Characteristics of Data warehouse• Subject-Oriented• Integrated• Time-variant• Non-volatile 10Subject-Oriented• Offer information regarding a theme instead of companies ongoing operations • Subjects can be sales, marketing, distributions, etc. • A data warehouse never focuses on the ongoing operations• Emphasis on modeling and analysis of data for decision making • Provide a simple and concise view around the specific subject by excluding data which not helpful 11Integrated• Constructed by integrating multiple, heterogeneous data sources • Data needs to be stored in the Datawarehouse in a common and universally acceptable manner • This integration helps in effective analysis of data• Data cleaning and data integration techniques are applied. • Ensure consistency in naming conventions, encoding structures, attribute measures, etc. among different data sources • E.g., Hotel price: currency, tax, breakfast covered, etc. • When data is moved to the warehouse, it is converted. 12Time-Variant• The time horizon for the data warehouse is signif ...
Tìm kiếm theo từ khóa liên quan:
Lecture Administration and visualization Administration and visualization Data modelling Databases OLTP & OLAP Data warehouse modeling Data warehouse designTài liệu liên quan:
-
Lecture Administration and visualization: Chapter 5.1 - Exploratory data analysis
83 trang 22 0 0 -
41 trang 18 0 0
-
Lecture Administration and visualization: Chapter 7 - Data visualization charts
72 trang 17 0 0 -
Lecture Administration and visualization: Chapter 6 - Tools for data visualization
33 trang 16 0 0 -
Lecture Administration and visualization: Chapter 8.2 - Interactive visualization
31 trang 16 0 0 -
Lecture Administration and visualization: Chapter 3.3 - Data lake
45 trang 15 0 0 -
Lecture Administration and visualization: Chapter 2.1 - File management
29 trang 15 0 0 -
Lecture Administration and visualization: Chapter 2.2 - Hadoop distributed file system (HDFS)
31 trang 13 0 0 -
Lecture Administration and visualization: Chapter 3.1 - Data modelling and databases
56 trang 13 0 0 -
Lecture Administration and visualization: Chapter 8.1 - Interactive visualization
48 trang 12 0 0