releases HotSpotAtHome

Slides:

Advertisements

Similar presentations

Advertisements

CS 61C L13Introduction to MIPS: Instruction Representation I (1) Garcia, Spring 2004 © UCB Lecturer PSOE Dan Garcia inst.eecs.berkeley.edu/~cs61c.

Chapter 2 Instructions: Language of the Computer Part III.

CS2100 Computer Organisation MIPS Part III: Instruction Formats (AY2014/2015) Semester 2.

1 CS61C L9 Fl. Pt. © UC Regents CS61C - Machine Structures Lecture 9 - Floating Point, Part I September 27, 2000 David Patterson

Inst.eecs.berkeley.edu/~cs61c UCB CS61C : Machine Structures Lecture 08 MIPS Instruction Representation I Lecturer SOE Dan Garcia

CS61C L10 : MIPS Instruction Representation II (1) Pearce, Summer 2010 © UCB inst.eecs.berkeley.edu/~cs61c UCB CS61C : Machine Structures Lecture 10 MIPS.

Integer Multiplication (1/3)

CS61C L14 Introduction to MIPS: Instruction Representation II (1) Garcia © UCB Lecturer PSOE Dan Garcia inst.eecs.berkeley.edu/~cs61c.

Inst.eecs.berkeley.edu/~cs61c UCB CS61C : Machine Structures Lecture 13 MIPS Instruction Representation I Shanghai Jiaotong University and Lanxiang.

CS 61C L09 Instruction Format (1) A Carle, Summer 2005 © UCB inst.eecs.berkeley.edu/~cs61c/su05 CS61C : Machine Structures Lecture #9: MIPS Instruction.

CS61C L14 MIPS Instruction Representation II (1) Garcia, Spring 2007 © UCB Lecturer SOE Dan Garcia inst.eecs.berkeley.edu/~cs61c.

CS61C L13 MIPS Instruction Representation I (1) Garcia, Spring 2007 © UCB Lecturer SOE Dan Garcia inst.eecs.berkeley.edu/~cs61c.

Instruction Representation II (1) Fall 2007 Lecture 10: Instruction Representation II.

CS 61C L14Introduction to MIPS: Instruction Representation II (1) Garcia, Fall 2004 © UCB Lecturer PSOE Dan Garcia inst.eecs.berkeley.edu/~cs61c.

CS 61C L15 Floating Point I (1) Garcia, Spring 2004 © UCB Lecturer PSOE Dan Garcia inst.eecs.berkeley.edu/~cs61c CS61C :

1  2004 Morgan Kaufmann Publishers Chapter Three.

CS61C L11 Floating Point I (1) Chae, Summer 2008 © UCB Albert Chae, Instructor inst.eecs.berkeley.edu/~cs61c CS61C : Machine Structures Lecture #11 – Floating.

Chap Instruction Representation Jen-Chang Liu, Spring 2006 Adapted from

CS61C L13 Introduction to MIPS: Instruction Representation I (1) Garcia © UCB Lecturer PSOE Dan Garcia inst.eecs.berkeley.edu/~cs61c.

CS 61C L15 Floating Point I (1) Krause, Spring 2005 © UCB This day in history… TA Danny Krause inst.eecs.berkeley.edu/~cs61c-td inst.eecs.berkeley.edu/~cs61c.

Lecture 5 Sept 14 Goals: Chapter 2 continued MIPS assembly language instruction formats translating c into MIPS - examples.

Instruction Representation II (1) Fall 2005 Lecture 10: Instruction Representation II.

CS61C L9 MIPS Logical & Shift Ops, and Instruction Representation I (1) Beamer, Summer 2007 © UCB Scott Beamer, Instructor inst.eecs.berkeley.edu/~cs61c.

CS 61C L15 Floating Point I (1) Garcia, Fall 2004 © UCB Lecturer PSOE Dan Garcia inst.eecs.berkeley.edu/~cs61c CS61C : Machine.

Instruction Representation I (1) Fall 2005 Lecture 05: INSTRUCTION REPRESENTATION.

Inst.eecs.berkeley.edu/~cs61c UCB CS61C : Machine Structures Lecture 13 MIPS Instruction Representation I The National Science Foundation (NSF)

CS 61C L10 Floating Point (1) A Carle, Summer 2005 © UCB inst.eecs.berkeley.edu/~cs61c/su05 CS61C : Machine Structures Lecture #10: Floating Point

CS61C L15 Floating Point I (1) Jacobs, Spring 2008 © UCB TA Ordinaire Dave Jacobs inst.eecs.berkeley.edu/~cs61c CS61C : Machine.

CS61C L15 Floating Point I (1) Garcia, Spring 2007 © UCB Lecturer SOE Dan Garcia inst.eecs.berkeley.edu/~cs61c CS61C : Machine.

CS61C L15 Floating Point I (1) Garcia, Fall 2006 © UCB #16 Cal rolls over OSU  Behind the arm of Nate Longshore’s 341 yds passing & 4 TDs, the Bears roll.

CS61C L14 MIPS Instruction Representation II (1) Garcia, Fall 2006 © UCB Intel: 80 cores/chip in 5 yrs!  At their developer’s forum in SF on Tuesday,

Inst.eecs.berkeley.edu/~cs61c UCB CS61C : Machine Structures Lecture 14 MIPS Instruction Representation II IBM wants to use “self-assembling”

IT 251 Computer Organization and Architecture Introduction to Floating Point Numbers Chia-Chi Teng.

Csci 136 Computer Architecture II – Floating-Point Arithmetic

CEN 316 Computer Organization and Design Computer Arithmetic Floating Point Dr. Mansour AL Zuair.

CS61C L11 Floating Point (1) Pearce, Summer 2010 © UCB inst.eecs.berkeley.edu/~cs61c UCB CS61C : Machine Structures Lecture 11 Floating Point

Computing Systems Basic arithmetic for computers.

CSCI-365 Computer Organization Lecture Note: Some slides and/or pictures in the following are adapted from: Computer Organization and Design, Patterson.

Data Representation in Computer Systems

Floating Point (a brief look) We need a way to represent –numbers with fractions, e.g., –very small numbers, e.g., –very large numbers,

CSC 221 Computer Organization and Assembly Language

Lecture 8. MIPS Instructions #3 – Branch Instructions #1 Prof. Taeweon Suh Computer Science Education Korea University 2010 R&E Computer System Education.

Chapter 10 The Assembly Process. What Assemblers Do Translates assembly language into machine code. Assigns addresses to all symbolic labels (variables.

CSCI-365 Computer Organization Lecture Note: Some slides and/or pictures in the following are adapted from: Computer Organization and Design, Patterson.

FLOATING POINT ARITHMETIC P&H Slides ECE 411 – Spring 2005.

CS 61C L3.2.1 Floating Point 1 (1) K. Meinz, Summer 2004 © UCB CS61C : Machine Structures Lecture Floating Point Kurt Meinz inst.eecs.berkeley.edu/~cs61c.

CDA 3101 Fall 2013 Introduction to Computer Organization

Computer Organization CS224 Fall 2012 Lessons 7 and 8.

CENG 311 Instruction Representation

1 ELEN 033 Lecture 4 Chapter 4 of Text (COD2E) Chapters 3 and 4 of Goodman and Miller book.

Chapter 2 — Instructions: Language of the Computer — 1 Memory Operands Main memory used for composite data – Arrays, structures, dynamic data To apply.

순천향대학교 정보기술공학부 이 상 정 1 3. Arithmetic for Computers.

Dan Garcia 1 CS 61C: Great Ideas in Computer Architecture MIPS Instruction Representation II.

Instruction Representation. Levels of Representation (abstractions) High Level Language Program (e.g., C) Assembly Language Program (e.g.,MIPS) Machine.

CSCI-365 Computer Organization Lecture Note: Some slides and/or pictures in the following are adapted from: Computer Organization and Design, Patterson.

Instructor: Dr. Mike Turi Department of Computer Science and Computer Engineering Pacific Lutheran University Lecture slides adapted from Part 8, EE 334.

Inst.eecs.berkeley.edu/~cs61c UCB CS61C : Machine Structures Lecture 08 MIPS Instruction Representation I Guest Lecturer Alan Christopher

Inst.eecs.berkeley.edu/~cs61c CS61C : Machine Structures Lecture 14 – Introduction to MIPS Instruction Representation II Lecturer PSOE.

There is one handout today at the front and back of the room!

April 2006 Saeid Nooshabadi

Lecturer PSOE Dan Garcia

CS 61C: Great Ideas in Computer Architecture Floating Point Arithmetic

Hello to Robb Neuschwander listening from Louisville, KY!

October 24 Programming problems? Read Section 6.1 for November 5

Instructions and Conditional Execution

CS61C : Machine Structures Lecture 3. 1

Lecturer SOE Dan Garcia

Floating Point Numbers

Lecturer PSOE Dan Garcia

Presentation transcript:

releases HotSpotAtHome inst.eecs.berkeley.edu/~cs61c CS61C : Machine Structures Lecture #10 – Instruction Representation II, Floating Point I 2007-7-11 Scott Beamer, Instructor releases HotSpotAtHome Greet class www.sfgate.com

Review… opcode rs rt immediate rd funct shamt R I J target address Logical and Shift Instructions Operate on individual bits (arithmetic operate on entire word) Use to isolate fields, either by masking or by shifting back & forth Use shift left logical, sll,for multiplication by powers of 2 Use shift right arithmetic, sra,for division by powers of 2 Simplifying MIPS: Define instructions to be same size as data word (one word) so that they can use the same memory (compiler can use lw and sw). Computer actually stores programs as a series of these 32-bit numbers. MIPS Machine Language Instruction: 32 bits representing a single instruction opcode rs rt immediate rd funct shamt R I J target address

Problem 0: Unsigned # sign-extended? I-Format Problems (0/3) Problem 0: Unsigned # sign-extended? addiu, sltiu, sign-extends immediates to 32 bits. Thus, # is a “signed” integer. Rationale addiu so that can add w/out overflow See K&R pp. 230, 305 sltiu suffers so that we can have ez HW Does this mean we’ll get wrong answers? Nope, it means assembler has to handle any unsigned immediate 215 ≤ n < 216 (I.e., with a 1 in the 15th bit and 0s in the upper 2 bytes) as it does for numbers that are too large. 

I-Format Problems (1/3) Problem 1: Chances are that addi, lw, sw and slti will use immediates small enough to fit in the immediate field. …but what if it’s too big? We need a way to deal with a 32-bit immediate in any I-format instruction.

I-Format Problems (2/3) Solution to Problem 1: New instruction: Handle it in software + new instruction Don’t change the current instructions: instead, add a new instruction to help out New instruction: lui register, immediate stands for Load Upper Immediate takes 16-bit immediate and puts these bits in the upper half (high order half) of the specified register sets lower half to 0s

Solution to Problem 1 (continued): I-Format Problems (3/3) Solution to Problem 1 (continued): So how does lui help us? Example: addi $t0,$t0, 0xABABCDCD becomes: lui $at, 0xABAB ori $at, $at, 0xCDCD add $t0,$t0,$at Now each I-format instruction has only a 16-bit immediate. Wouldn’t it be nice if the assembler would this for us automatically? (later)

Branches: PC-Relative Addressing (1/5) Use I-Format opcode rs rt immediate opcode specifies beq v. bne rs and rt specify registers to compare What can immediate specify? Immediate is only 16 bits PC (Program Counter) has byte address of current instruction being executed; 32-bit pointer to memory So immediate cannot specify entire address to branch to.

Branches: PC-Relative Addressing (2/5) How do we usually use branches? Answer: if-else, while, for Loops are generally small: typically up to 50 instructions Function calls and unconditional jumps are done using jump instructions (j and jal), not the branches. Conclusion: may want to branch to anywhere in memory, but a branch often changes PC by a small amount

Branches: PC-Relative Addressing (3/5) Solution to branches in a 32-bit instruction: PC-Relative Addressing Let the 16-bit immediate field be a signed two’s complement integer to be added to the PC if we take the branch. Now we can branch ± 215 bytes from the PC, which should be enough to cover almost any loop. Any ideas to further optimize this?

Branches: PC-Relative Addressing (4/5) Note: Instructions are words, so they’re word aligned (byte address is always a multiple of 4, which means it ends with 00 in binary). So the number of bytes to add to the PC will always be a multiple of 4. So specify the immediate in words. Now, we can branch ± 215 words from the PC (or ± 217 bytes), so we can handle loops 4 times as large.

Branches: PC-Relative Addressing (5/5) Branch Calculation: If we don’t take the branch: PC = PC + 4 PC+4 = byte address of next instruction If we do take the branch: PC = (PC + 4) + (immediate * 4) Observations Immediate field specifies the number of words to jump, which is simply the number of instructions to jump. Immediate field can be positive or negative. Due to hardware, add immediate to (PC+4), not to PC; will be clearer why later in course

beq branch is I-Format: Branch Example (1/3) MIPS Code: Loop: beq $9,$0,End add $8,$8,$10 addi $9,$9,-1 j Loop End: beq branch is I-Format: opcode = 4 (look up in table) rs = 9 (first operand) rt = 0 (second operand) immediate = ???

Branch Example (2/3) MIPS Code: Immediate Field: Loop: beq $9,$0,End addi $8,$8,$10 addi $9,$9,-1 j Loop End: Immediate Field: Number of instructions to add to (or subtract from) the PC, starting at the instruction following the branch. In beq case, immediate = 3

Branch Example (3/3) MIPS Code: Loop: beq $9,$0,End addi $8,$8,$10 addi $9,$9,-1 j Loop End: decimal representation: 4 9 3 binary representation: 000100 01001 00000 0000000000000011

Questions on PC-addressing Does the value in branch field change if we move the code? What do we do if destination is > 215 instructions away from branch? Since it’s limited to ± 215 instructions, doesn’t this generate lots of extra MIPS instructions? Why do we need all these addressing modes? Why not just one?

Administrivia Any questions on course issues?

Section 1: The Core Instruction Set Green Sheet Errors Section 1: The Core Instruction Set lb, lbu, lw scratch out 0/ sll, srl shift rt not rs so change R[rs] to R[rt] jal should be R[31] = PC + 8, not +4 Section 2: Register Name, Number, Use, Call Convention $ra is not preserved across calls so make yes a no

J-Format Instructions (1/5) For branches, we assumed that we won’t want to branch too far, so we can specify change in PC. For general jumps (j and jal), we may jump to anywhere in memory. Ideally, we could specify a 32-bit memory address to jump to. Unfortunately, we can’t fit both a 6-bit opcode and a 32-bit address into a single 32-bit word, so we compromise.

J-Format Instructions (2/5) Define “fields” of the following number of bits each: 6 bits 26 bits As usual, each field has a name: opcode target address Key Concepts Keep opcode field identical to R-format and I-format for consistency. Combine all other fields to make room for large target address.

J-Format Instructions (3/5) For now, we can specify 26 bits of the 32-bit bit address. Optimization: Note that, just like with branches, jumps will only jump to word aligned addresses, so last two bits are always 00 (in binary). So let’s just take this for granted and not even specify them.

J-Format Instructions (4/5) Now specify 28 bits of a 32-bit address Where do we get the other 4 bits? By definition, take the 4 highest order bits from the PC. Technically, this means that we cannot jump to anywhere in memory, but it’s adequate 99.9999…% of the time, since programs aren’t that long only if straddle a 256 MB boundary If we absolutely need to specify a 32-bit address, we can always put it in a register and use the jr instruction.

J-Format Instructions (5/5) Summary: New PC = { PC[31..28], target address, 00 } Understand where each part came from! Note: { , , } means concatenation { 4 bits , 26 bits , 2 bits } = 32 bit address { 1010, 11111111111111111111111111, 00 } = 10101111111111111111111111111100 Note: Book uses ||

Peer Instruction Question ABC 1: FFF 2: FFT 3: FTF 4: FTT 5: TFF 6: TFT 7: TTF 8: TTT (for A,B) When combining two C files into one executable, recall we can compile them independently & then merge them together. Jump insts don’t require any changes. Branch insts don’t require any changes. You now have all the tools to be able to “decompile” a stream of 1s and 0s into C! Answer: A: False B: True (it’s all relative) C: True

Branches use PC-relative addressing, Jumps use absolute addressing. In semi-conclusion… MIPS Machine Language Instruction: 32 bits representing a single instruction Branches use PC-relative addressing, Jumps use absolute addressing. Disassembly is simple and starts by decoding opcode field. (more in a week) opcode rs rt immediate rd funct shamt R I J target address

James Gosling Sun Fellow Java Inventor 1998-02-28 Quote of the day “95% of the folks out there are completely clueless about floating-point.” James Gosling Sun Fellow Java Inventor 1998-02-28

Computers are made to deal with numbers Review of Numbers Computers are made to deal with numbers What can we represent in N bits? Unsigned integers: 0 to 2N - 1 Signed Integers (Two’s Complement) -2(N-1) to 2(N-1) - 1

What about other numbers? Very large numbers? (seconds/century) 3,155,760,00010 (3.1557610 x 109) Very small numbers? (atomic diameter) 0.0000000110 (1.010 x 10-8) Rationals (repeating pattern) 2/3 (0.666666666. . .) Irrationals 21/2 (1.414213562373. . .) Transcendentals e (2.718...),  (3.141...) All represented in scientific notation

Scientific Notation (in Decimal) mantissa exponent 6.0210 x 1023 decimal point radix (base) Normalized form: no leadings 0s (exactly one digit to left of decimal point) Alternatives to representing 1/1,000,000,000 Normalized: 1.0 x 10-9 Not normalized: 0.1 x 10-8,10.0 x 10-10

Scientific Notation (in Binary) mantissa exponent 1.0two x 2-1 “binary point” radix (base) Computer arithmetic that supports it called floating point, because it represents numbers where the binary point is not fixed, as it is for integers Declare such variable in C as float

Floating Point Representation (1/2) Normal format: +1.xxxxxxxxxxtwo*2yyyytwo Multiple of Word Size (32 bits) 31 S Exponent 30 23 22 Significand 1 bit 8 bits 23 bits S represents Sign Exponent represents y’s Significand represents x’s Represent numbers as small as 2.0 x 10-38 to as large as 2.0 x 1038

Floating Point Representation (2/2) What if result too large? (> 2.0x1038 ) Overflow! Overflow  Exponent larger than represented in 8-bit Exponent field What if result too small? (>0, < 2.0x10-38 ) Underflow! Underflow  Negative exponent larger than represented in 8-bit Exponent field How to reduce chances of overflow or underflow? overflow underflow overflow 2x10-38 2x1038 1 -1 -2x10-38 -2x1038

Double Precision Fl. Pt. Representation Next Multiple of Word Size (64 bits) 31 S Exponent 30 20 19 Significand 1 bit 11 bits 20 bits Significand (cont’d) 32 bits Double Precision (vs. Single Precision) C variable declared as double Represent numbers almost as small as 2.0 x 10-308 to almost as large as 2.0 x 10308 But primary advantage is greater accuracy due to larger significand

QUAD Precision Fl. Pt. Representation Next Multiple of Word Size (128 bits) Unbelievable range of numbers Unbelievable precision (accuracy) This is currently being worked on The current version has 15 bits for the exponent and 112 bits for the significand Oct-Precision? It’s been implemented before… (256 bit) Half-Precision? Yep, that’s for a short (16 bit)

IEEE 754 Floating Point Standard (1/4) Single Precision, DP similar Sign bit: 1 means negative 0 means positive Significand: To pack more bits, leading 1 implicit for normalized numbers 1 + 23 bits single, 1 + 52 bits double always true: 0 < Significand < 1 (for normalized numbers) Note: 0 has no leading 1, so reserve exponent value 0 just for number 0

IEEE 754 Floating Point Standard (2/4) Kahan wanted FP numbers to be used even if no FP hardware; e.g., sort records with FP numbers using integer compares Could break FP number into 3 parts: compare signs, then compare exponents, then compare significands Wanted it to be faster, single compare if possible, especially if positive numbers Then want order: Highest order bit is sign ( negative < positive) Exponent next, so big exponent => bigger # Significand last: exponents same => bigger #

IEEE 754 Floating Point Standard (3/4) Negative Exponent? 2’s comp? 1.0 x 2-1 v. 1.0 x2+1 (1/2 v. 2) 1111 1111 000 0000 0000 0000 0000 0000 1/2 0000 0001 000 0000 0000 0000 0000 0000 2 This notation using integer compare of 1/2 v. 2 makes 1/2 > 2! Instead, pick notation 0000 0001 is most negative, and 1111 1111 is most positive 1.0 x 2-1 v. 1.0 x2+1 (1/2 v. 2) 1/2 0111 1110 000 0000 0000 0000 0000 0000 1000 0000 2

IEEE 754 Floating Point Standard (4/4) Called Biased Notation, where bias is number subtract to get real number IEEE 754 uses bias of 127 for single prec. Subtract 127 from Exponent field to get actual value for exponent 1023 is bias for double precision Summary (single precision): 31 S Exponent 30 23 22 Significand 1 bit 8 bits 23 bits (-1)S x (1 + Significand) x 2(Exponent-127) Double precision identical, except with exponent bias of 1023

What is the decimal equivalent of the floating pt # above? Peer Instruction 1 1000 0001 111 0000 0000 0000 0000 0000 1: -1.75 2: -3.5 3: -3.75 4: -7 5: -7.5 6: -15 7: -7 * 2^129 8: -129 * 2^7 What is the decimal equivalent of the floating pt # above?

Peer Instruction Answer What is the decimal equivalent of: 1 1000 0001 111 0000 0000 0000 0000 0000 S Exponent Significand (-1)S x (1 + Significand) x 2(Exponent-127) (-1)1 x (1 + .111) x 2(129-127) -1 x (1.111) x 2(2) -111.1 1: -1.75 2: -3.5 3: -3.75 4: -7 5: -7.5 6: -15 7: -7 * 2^129 8: -129 * 2^7 -7.5

Floating Point numbers approximate values that we want to use. “And in conclusion…” Floating Point numbers approximate values that we want to use. IEEE 754 Floating Point Standard is most widely accepted attempt to standardize interpretation of such numbers Every desktop or server computer sold since ~1997 follows these conventions Summary (single precision): 31 S Exponent 30 23 22 Significand 1 bit 8 bits 23 bits (-1)S x (1 + Significand) x 2(Exponent-127) Double precision identical, bias of 1023