Presentation is loading. Please wait.

Presentation is loading. Please wait.

PMLAB, IECS, FCU Designing Efficient Matrix Transposition on Various Interconnection Networks Using Tensor Product Formulation Presented by Chin-Yi Tsai.

Similar presentations


Presentation on theme: "PMLAB, IECS, FCU Designing Efficient Matrix Transposition on Various Interconnection Networks Using Tensor Product Formulation Presented by Chin-Yi Tsai."— Presentation transcript:

1 PMLAB, IECS, FCU Designing Efficient Matrix Transposition on Various Interconnection Networks Using Tensor Product Formulation Presented by Chin-Yi Tsai

2 PMLAB, IECS, FCU 2 Outline Introduction Tensor Product Notation Matrix Transposition Designing Matrix Transposition on Various Interconnection Networks Conclusions and Future Work

3 PMLAB, IECS, FCU 3 Introduction Matrix transposition is a simple, but an important computational problem. A matrix is a two-dimensional data structure which is stored in a one-dimensional computer memory. A simple double-loop transposition program will perform poorly in modern computer architecture with memory hierarchy.

4 PMLAB, IECS, FCU 4 Introduction (cont ’ d) We develop matrix transposition algorithms on various interconnection networks, including omega, baseline and hypercube networks. Tensor product has been successfully used for designing block recursive algorithm, such as FFT, Strassen ’ s matrix multiplication, parallel prefix algorithm, Hilbert space-filling curve, and Karatsuba ’ s multiplication. Tensor product formulas are also suitable for specifying interconnection networks.

5 PMLAB, IECS, FCU 5 Introduction (cont ’ d) Different interconnection networks have their own architectural characteristics and properties. Distributed-memory algorithms and VLSI circuit design. A major goal of this study is to provide an effective way for designing VLSI circuits of DSP algorithms.

6 PMLAB, IECS, FCU 6 Tensor Product Notation Let A and B be two matrices of size and, respectively Stride permutation

7 PMLAB, IECS, FCU 7 Matrix Transposition Matrix transposition can be viewed as changing the elements from the row-major order to column- major order. Matrix A is stored in a computer memory, the index scheme of element : –Row-major order –Column-major order Various matrix transposition algorithms can be designed by manipulating stride permutation:

8 PMLAB, IECS, FCU 8 Matrix Transposition (cont ’ d) Step1: blocks with qs elements of each block Step2: perform transposition of matrix for pr blocks Step3: transpose a block matrix with each block of qs elements Step4: convert a block structure order of blocks with qs elements of each blcok to the row- major order of the transposed matrix

9 PMLAB, IECS, FCU 9 Designing Matrix Transposition on Various Interconnection Networks We consider two kinds of networks: –multistage interconnection network, –direct interconnection network. The basic component of multistage interconnection network is a switching element. A direct interconnection network is a set of processors connected by a set of links. x0x0 x1x1 y1y1 y0y0 x0x0 x1x1 y1y1 y0y0

10 PMLAB, IECS, FCU 10 Designing Matrix Transposition on Various Interconnection Networks Suppose that N=2 n, Omega network Baseline network Hypercube network

11 PMLAB, IECS, FCU 11

12 PMLAB, IECS, FCU 12

13 PMLAB, IECS, FCU 13 04 6 5 1 2 7 3 812 14 13 9 10 15 11 01 5 3 2 4 7 6 89 13 11 10 12 15 14 0123

14 PMLAB, IECS, FCU 14 Deviation of Algorithm on Omega Interconnection Network

15 PMLAB, IECS, FCU 15 Omega Interconnection Network

16 PMLAB, IECS, FCU 16 Deviation of Algorithm on Baseline Interconnection Network Bit-reversal operation Partial bit-reversal operation

17 PMLAB, IECS, FCU 17 Baseline Interconnection Network

18 PMLAB, IECS, FCU 18 Hypercube Interconnection Network 0 23 1 01 3 2 0 23 1 0 3 1 2 0 23 1 0 3 1 2

19 PMLAB, IECS, FCU 19 Deviation of Algorithm on Hypercube Interconnection Network

20 PMLAB, IECS, FCU 20 01 5 3 2 4 7 6 89 13 11 10 12 15 14 Hypercube Interconnection Network (cont ’ d) 01 3 5 4 2 7 6 89 11 13 12 10 15 14 04 6 5 1 2 7 3 812 14 13 9 10 15 11 04 12 5 1 8 13 9 26 14 7 3 10 15 11 04 5 12 8 1 13 9 26 7 14 10 3 15 11

21 PMLAB, IECS, FCU 21 Conclusions and Future Work We use tensor product as the framework to design matrix transposition algorithms on various interconnection networks. To manipulate stride permutation operations to fit into networks. VLSI circuit design for DSP and image processing algorithms on various interconnection networks.


Download ppt "PMLAB, IECS, FCU Designing Efficient Matrix Transposition on Various Interconnection Networks Using Tensor Product Formulation Presented by Chin-Yi Tsai."

Similar presentations


Ads by Google