Presentation is loading. Please wait.

Presentation is loading. Please wait.

5 Creating the Physical Model. Designing the Physical Model Phase IV: Defining the physical model.

Similar presentations


Presentation on theme: "5 Creating the Physical Model. Designing the Physical Model Phase IV: Defining the physical model."— Presentation transcript:

1 5 Creating the Physical Model

2 Designing the Physical Model Phase IV: Defining the physical model

3 Database Object Naming Conventions Keep the logical and physical names similar and descriptive. Capitalize table and attribute names. Use underscores instead of spaces to delineate separate words in an object’s name. Use a suffix of _PK to indicate primary keys. Use a suffix of _ID to indicate production keys. Find a good balance between using very specific and very vague names.

4 Database Object Naming Conventions Develop a reasonable list of abbreviations. List all the objects’ names, and work with the user community to define them. Resolve name disputes. Document your naming standards in the metadata document. Plan for the naming standards to be a living document.

5 Translating the Dimensional Model into a Physical Model Apply the naming standards to the tables and attributes of the dimensional model. List table columns with primary keys listed first. Label primary keys consistently. Identify the format and length of columns. Label unique keys with a (#). Label column optionality with NULL (o) or NOT NULL (*) constraints. Label foreign keys with _FK. Use synonyms for user tables.

6 Physical Model Product *PRODUCT_ID v(11) *PRODUCT_DESC v(125) *PRODUCT_NAME v(35) *CATEGORY_ID v(20) *CATEGORY_DESC v(25) *SUPPLIER_ID v(20) *PRODUCT_STATUS v(10) *LIST_PRICE n *CATALOG_ID v(20) *PRODCUT_TYPEv(20) *PRODUCT_CODE v(10) *PROMOTION_CODE v(10) *WHSE_LOCATION v(10) *VALID_FROM_DATE d *VALID_TO_DATE d # *Product _PK n # *Channel_PK n # *Promotion_PK n

7 Defining the Hardware Transforming the base dimensional data model into the physical model includes some of the following: Defining naming and database standards Performing an initial sizing Designing tablespaces Defining an initial indexing strategy Using partitioning to split table and index data into smaller, more manageable chunks Determining where to place database objects on disk (RAID, striping, disk mapping) Using parallel processing

8 Architectural Requirements Scalability Manageability Availability Extensibility Integrated Accessibility Reliability Flexibility User Budget Business Technology

9 Architecture Characteristics Robust Available Reliable Extensible Scalable Supportable Recoverable Parallel VLM (very large memory) 64-bit Connective Open

10 Hardware Requirements SMP (Symmetric multiprocessing) Cluster and MPP (massively parallel processing) Hybrids using SMP and MPP

11 Evaluation Criteria Determine the platform for your needs: SMP Clusters MPP Scalability Maturity Low High Low High

12 Application Database Operating system Hardware Parallel Processing Parallel daily operations Shared resources –Memory –Disk –Nothing Loosely or tightly coupled

13 Requirements differ from operational systems Benchmark –Available from vendors –Develop your own –Use realistic queries Scalability important Making the Right Choice

14 Shared disks Common bus CPU Shared memory Symmetric Multiprocessing (SMP) Communication by shared memory Disk controllers accessible to all CPUs Proven technology

15 Benefits: High concurrency Workload balancing Moderate scalability Easy administration Limitations: Memory (cluster for improvements) Bandwidth CPU Shared memory SMP

16 Clusters Node 1 Node 2Node 3 Common high-speed bus Shared disks Common high-speed bus Shared memory CPU Shared memory CPU Shared memory CPU

17 Clusters Shared disk, loosely coupled Dedicated memory High-speed bus Shared resources SMP node

18 Massively Parallel Processing (MPP) CPU Memory CPU Memory CPU Memory CPU Disk

19 MPP n Cube Arrangements A shared nothing architecture Many nodes Fast access Exclusive memory on a node Low cost per node Scalable n CUBE configuration

20 MPP Benefits Unlimited incremental growth Very scalable Fast access Low cost per node Good for DSS

21 MPP Limitations Rigid partitioning Cache consistency Restricted disk access High memory cost per node High management burden Careful data placement

22 Architectural Tiers Tiered structures: Modular Logical separation Distributed structures: Two-tier Three-tier Four-tier (and more) DB server Apps server Workstations Web server Internet

23 Sample System Architecture

24 Gateway Middleware Technologies for integration

25 Database Server Requirements Robust Available Reliable Extensible Scalable Supportable Recoverable Parallel

26 Parallelism Database Query Load Index Sort Backup Recovery

27 Further Considerations Optimization strategy Partitioning strategy Summarization strategy Indexing techniques Hardware and software scalability Availability Administration

28 Processor 1 Elapsed time Not parallel Processor 2 Processor 1 Processor 4 Processor 3 Parallel Parallel Processing A large task broken into smaller tasks: Concurrent execution One or more processors

29 Processor 2 Processor 1 Processor 4 Processor 3 Parallel Parallel Database Increased speed Improved scalability Performance gains –Availability –Flexibility –More users

30 Parallel Query SQL code split among server processes Query Subquery

31 Feb 98 Mar 98 Order table Jan 98 Parallel Load Bypass SQL processing to speed throughput

32 Parallel Processing Index: reduces the time to create Sort: allocates memory in cache efficiently

33 Parallel Processing Backup: runs simultaneously from any node (online and offline) Recovery: runs simultaneously from redo logs Summaries: uses the CREATE TABLE AS SELECT statement


Download ppt "5 Creating the Physical Model. Designing the Physical Model Phase IV: Defining the physical model."

Similar presentations


Ads by Google