CS/ECE 752 Spring 2009

CS/ECE 752 Advanced Computer Architecture I Spring 2009 Section 1
Instructor Mark D. Hill and T. A. Arkaprava Basu
URL: http://www.cs.wisc.edu/~markhill/cs752/Spring2009/

Homework 5 // Due at Lecture Fri Mon Apr 3

Problem 1 (15 points)

The simple, bus-based multiprocessor illustrated below represents a commonly-implemented symmetric shared-memory architecture. Each processor has a single, private cache with coherence maintained using the snooping coherence protocol of Figure 4.7. Each cache is direct-mapped, with four blocks each holding two words. To simplify the illustration, the cache address tag contains the full address and each word shows only two hex characters, with the least significant word on the right. The coherence states are denoted M, S, and I for Modified, Shared, and Invalid.

For each subproblem below, assume the initial cache and memory state as illustrated in the figure. Each subproblem specifies a sequence of one or more CPU operations of the form:

P#: <op> <address> [ <-- <value> ]

Where P# designates the CPU (e.g., P0), <op> is the CPU operation (e.g., read or write), <address> denotes the memory address, and <value> indicates the new word to be assigned on a write operation.

What is the final state (i.e., coherence state, tags, and data) of the caches and memory after the given sequence of CPU operations has completed? Show only the blocks that change, e.g., P0.B0: (I, 120, 00 01) indicates that CPU P0's block B0 has the final state of I, tag of 120, and data words 00 and 01. Also, what value is returned by each read operation?

P15: read 118
P15: write 118 <-- 80
P0: read 128
P1: write 108 <-- 80
P0: write 108 <-- 80

Problem 2 (4 points)

In SMT processsors, at each cycle, the processor must select which thread(s) to issue instruction(s) from.

What properties should a good thread selection policy have?
Describe how "ICOUNT" policy may perform with respect your answer to (a).

Problem 3 (4 points)

In the current technology of FLASH drives, data must be erased before new data may be written.

How does the erasing work in the FLASH drive?
Why does erasing make FLASH drives different from conventional magentic disks?

Problem 4 (5 points)

Describe the evolution of GPU pipeline over the last decade.

Computer Sciences | UW Home

Page last modified: Friday, 27-Mar-2009 11:12:04 CDT
Feedback or content questions: send email to Email Address of Mark Hill
Technical or accessibility issues: lab@cs.wisc.edu
Copyright © 2002, 2003 The Board of Regents of the University of Wisconsin System.

CS/ECE 752 Advanced Computer Architecture I Spring 2009 Section 1 Instructor Mark D. Hill and T. A. Arkaprava Basu URL: http://www.cs.wisc.edu/~markhill/cs752/Spring2009/

Homework 5 // Due at Lecture Fri Mon Apr 3

Problem 1 (15 points)

Problem 2 (4 points)

Problem 3 (4 points)

Problem 4 (5 points)

CS/ECE 752 Advanced Computer Architecture I Spring 2009 Section 1
Instructor Mark D. Hill and T. A. Arkaprava Basu
URL: `http://www.cs.wisc.edu/~markhill/cs752/Spring2009/`