Presentation on theme: "The complexity and correctness of algorithms (with binary trees as an example)"— Presentation transcript:
The complexity and correctness of algorithms (with binary trees as an example)
2 The word ‘algorithm’ does not derive, as classically- trained algorithmophobes might guess, from the Greek root algos, meaning ‘pain’. Rather, it is a corruption of the name of the 9th century Persian mathematician Abu Abd Allah Muhammad ibn Musa al-Khwarizmi, which literally translates as “Mohammad, father of Adbdulla, son of Moses, the Kwarizmian”. (Kwarizm is an ancient city located in what is now the Xorazm Province of Uzbekistan.) Al-Khwarizmi is perhaps best known as the writer of the treatise Kitab al-jabr wa’l-Muqˆabala, from which the modern word algebra derives. The word algorithm is a corruption of the older word algorism (by false connection to the Greek arithmos, meaning ‘number’, and the aforementioned), used to describe the modern decimal system for writing and manipulating numbers—in particular, the use of a small circle or sifr to represent a missing quantity— which al Khwarizmı brought into Persia from India.
3 An algorithm is an unambiguous sequence of simple, mechanically executable instructions.
4 Trees A tree is a data structure used to represent different kinds of data and help solve a number of algorithmic problems Game trees (i.e., chess ), UNIX directory trees, sorting trees etc. We will study at least two kinds of trees: Binary Search Trees and Red-Black Trees.
5 Trees Trees have nodes. They also have edges that connect the nodes. Between two nodes there is always only one path. Tree nodes Tree edges
6 Trees Trees are rooted. Once the root is defined (by the user) all nodes have a specific level. Trees have internal nodes and leaves. Every node (except the root) has a parent and it also has zero or more children. level 0 level 1 level 2 level 3 root internal nodes leaves parent and child
7 Binary trees A binary tree is a tree such that each node has at most 2 children.
8 Binary trees Recursive definition (important): 1.An empty tree is a binary tree 2.A node with two child sub-trees is a binary tree 3.[often implicit] Only what you get from 1 by a finite number of applications of 2 is a binary tree. 56 26200 18 28 190 213 12 24 27 Values stored at tree nodes are called keys.
9 Binary search trees A binary search tree (BST) is a binary tree with the following properties: –The key of a node is always greater than the keys of the nodes in its left subtree. –The key of a node is always smaller than the keys of the nodes in its right subtree.
10 Binary search trees Stored keys must satisfy the binary search tree property. – y in left subtree of x, then key[y] key[x]. – y in right subtree of x, then key[y] key[x]. 56 26200 18 28 190 213 12 24 27
11 Binary search trees: examples C AD 14 10 1511818 16 14 10 15118 16 root
12 Binary search trees Data structures that can support dynamic set operations. –Search, minimum, maximum, predecessor, successor, insert, and delete. Can be used to build –Dictionaries. –Priority Queues.
13 BST – Representation Represented by a linked data structure of nodes. root(T) points to the root of tree T. Each node contains fields: –key –left – pointer to left child: root of left subtree. –right – pointer to right child : root of right subtree. –p – pointer to parent; p[root[T]] = NIL (optional).
14 Inorder traversal of a BST Inorder-Tree-Walk (p) if p NIL then Inorder-Tree-Walk(left[p]) print key[x] Inorder-Tree-Walk(right[p]) Can you prove the correctness on in-order traversal? How long does an in-order walk take? The binary search tree property allows the keys of a binary search tree to be printed, in (monotonically increasing) order, recursively. 56 26200 18 28 190 213 12 24 27
15 Correctness of inorder traversal Must prove that it prints all elements, in order, and that it terminates. By induction on size of tree. Size=0: Easy. Size >1: –Prints left subtree in order by induction. –Prints root, which comes after all elements in left subtree (still in order). –Prints right subtree in order (all elements come after root, so still in order).
16 Notice how we used the recursive definition of a tree in our inductive proof. We exploit the recursive structure of a tree, and this approach - which is general to all recursive definitions, and not restricted to trees - is called structural induction.
17 Searching in a BST Tree-Search(x, k) if x = NIL or k = key[x] then return x if k < key[x] then return Tree-Search(left[x], k) else return Tree-Search(right[x], k) 56 26200 18 28 190 213 12 24 27 Time complexity is proportional with h is the height of the tree. Height 4 Height 3 Height 2 Height 1
18 Inserting an element in a BST Tree-Insert(Tree T, int z) y NIL x root[T] while x NIL do y x if key[z] < key[x] then x left[x] else x right[x] p[z] y if y = NIL then root[t] z else if key[z] < key[y] then left[y] z else right[y] z Change the dynamic set represented by a BST. Ensure the binary search tree property holds after change. Insertion is easier than deletion. 56 26200 18 28 190 213 12 24 27
19 Analysis of Insertion Initialization: constant time While loop in lines 3-7 searches for place to insert z, maintaining parent y. This takes time proportional with h Lines 8-13 insert the value: constant time TOTAL: C 1 + C 2 + C 3 *h Tree-Insert(T, z) y NIL x root[T] while x NIL do y x if key[z] < key[x] then x left[x] else x right[x] p[z] y if y = NIL then root[t] z else if key[z] < key[y] then left[y] z else right[y] z
20 Deletion Goal: Delete a given node z from a binary search tree. We need to consider several cases. –Case 1: z has no children. Delete z by making the parent of z point to NIL, instead of to z. 15 16 20 1823 6 5 12 3 7 10 15 16 20 1823 6 5 12 3 7 10 13 delete z
21 Deletion Case 2: z has one child. –Delete z by making the parent of z point to z’s child, instead of to z. –Update the parent of z’s child to be z’s parent. 15 16 20 1823 6 5 12 3 7 1013 delete 15 20 1823 6 5 12 3 7 10 z
22 Deletion Case 3: z has two children. –z’s successor (y) is the minimum node in z’s right subtree. –y has either no children or one right child (but no left child). Why? –Delete y from the tree (via Case 1 or 2). –Replace z’s key and satellite data with y’s. 15 16 20 1823 6 5 12 3 7 1013 delete z y 15 16 20 1823 7 6 12 3 1013 6
23 Successor node The successor of node x is the node y such that key[y] is the smallest key greater than key[x]. The successor of the largest key is NIL. Search for a successor consists of two cases. –If node x has a non-empty right subtree, then x’s successor is the minimum in the right subtree of x. –If node x has an empty right subtree, then: As long as we move to the left up the tree (move up through right children), we are visiting smaller keys. x’s successor y is the node that x is the predecessor of (x is the maximum in y’s left subtree). In other words, x’s successor y, is the lowest ancestor of x whose left child is also an ancestor of x. We can define the predecessor node similarly.
24 Exercise: Sorting using BSTs Sort (A) for i 1 to n do tree-insert(A[i]) inorder-tree-walk(root) –What are the worst case and best case running times? –In practice, how would this compare to other sorting algorithms?
25 Wrap-up Determining the complexity of algorithms is usually the first step in obtaining better algorithms. –Or in realizing we cannot do any better. What should you know? –Inductive proofs on trees. –Binary search trees and operations on BSTs. –Height of trees and how it influences efficiency.