Fourth edition of Computer Architecture: A Quantitative Approach



Merrit Malloy of Morgan Kaufman asked me to pass this along. For more
details and a sample chapter, visit http://www.mkp.com, where the book
is the first thing on the home page. In a previous message she said
that comp.arch readers can write to her for a discount code if you
want to buy the book.

Computer Architecture: A Quantitative Approach, Fourth Edition
by Dave Patterson and John Hennessy
(publishing Sept. 15th, 2006)

Features & Benefits

---------------------------------
Increased coverage on achieving parallelism with multiprocessors.

Case studies of latest technology from industry including the Sun Niagara
Multiprocessor, AMD Opteron, and Pentium 4.

Three review appendices, included in the printed volume, review the basic
and intermediate principles the main text relies upon.

Eight reference appendices, collected on the CD, cover a range of topics
including specific architectures, embedded systems, application specific
processors--some guest authored by subject experts.

Reviews

---------------------------------

"If Neil Armstrong offers to give you a tour of the lunar
module, or Tiger Woods asks you to go play golf with him, you should
do it. When Hennessy and Patterson offer to lead you on a tour of
where computer architecture is going, they call it Computer
Architecture: A Quantitative Approach, 4th Edition. You need
one. Tours leave on the hour." --Robert Colwell, Intel lead designer

---------------------------------

The era of seemingly unlimited growth in processor performance is
over: single chip architectures can no longer overcome the performance
limitations imposed by the power they consume and the heat they
generate. Today, Intel and other semiconductor firms are abandoning
the single fast processor model in favor of multi-core
microprocessors--chips that combine two or more processors in a single
package. In the fourth edition of Computer Architecture, the authors
focus on this historic shift, increasing their coverage of
multiprocessors and exploring the most effective ways of achieving
parallelism as the key to unlocking the power of multiple processor
architectures. Additionally, the new edition has expanded and updated
coverage of design topics beyond processor performance, including
power, reliability, availability, and dependability.

Contents

---------------------------------
1 Fundamentals of Computer Design
1.1 Introduction
1.2 The Changing Face of Computing and the Task of the Computer Designer
1.3 Technology Trends
1.4 Power in Integrated Circuits
1.5 Trends in Cost
1.6 Reliability, Availability and Dependability
1.7 Measuring and Reporting Performance
1.8 Quantitative Principles of Computer Design
1.9 Putting It All Together: Performance and Price-Performance
1.10 Fallacies and Pitfalls
1.11 Concluding Remarks

2 Instruction Level Parallelism and Its Exploitation
2.1 Instruction-Level Parallelism: Concepts and Challenges
2.2 Basic Compiler Techniques for Exposing ILP
2.3 Reducing Branch Costs with Prediction
2.4 Overcoming Data Hazards with Dynamic Scheduling
2.5 Dynamic Scheduling: Examples and the Algorithm
2.6 Hardware-Based Speculation
2.7 Exploiting ILP using Multiple Issue and Static Scheduling
2.8 Exploiting ILP using Dynamic Scheduling, Multiple Issue, and Speculation
2.9 Advanced Techniques for Instruction Delivery and Speculation
2.10 Putting It All Together: The Intel Pentium 4
2.11 Fallacies and Pitfalls
2.12 Concluding Remarks

3 Advanced Techniques for Exploiting Instruction-Level Parallelism and Their
Limits
3.1 Introduction
3.2 Studies of the Limitations of ILP
3.3 Limitations on ILP for Realizable Processors
3.4 Crosscutting Issues: Hardware versus Software Speculation
3.5 Multithreading: Using ILP Support to Exploit Thread-level Parallelism
3.6 Putting It All Together: Performance and Efficiency in
Advanced Multiple Issue Processors
3.7 Fallacies and Pitfalls
3.8 Concluding Remarks

4 Multiprocessors and Thread-Level Parallelism
4.1 Introduction
4.2 Symmetric Shared-Memory Architectures
4.3 Performance of Symmetric Shared-Memory Multiprocessors
4.4 Distributed Shared Memory and Directory-Based Coherence
4.5 Synchronization: The Basics
4.6 Models of Memory Consistency: An Introduction
4.7 Crosscutting Issues
4.8 Putting It All Together: The Sun T1 Multiprocessor
4.9 Fallacies and Pitfalls
4.10 Concluding Remarks

5 Memory Hierarchy Design
5.1 Introduction
5.2 Eleven Advanced Optimizations of Cache Performance
5.3 Memory Technology and Optimizations
5.4 Protection: Virtual Memory and Virtual Machines
5.5 Crosscutting Issues: The Design of Memory Hierarchies
5.6 Putting It All Together: AMD Opteron Memory Hierarchy
5.7 Fallacies and Pitfalls
5.8 Concluding Remarks

6 Storage Systems
6.1 Introduction
6.2 Advanced Topics in Disk Storage
6.3 Definition and Examples of Real Faults and Failures
6.4 I/O Performance, Reliability Measures, and Benchmarks
6.5 A Little Queuing Theory
6.6 Crosscutting Issues
6.7 Designing and Evaluating an I/O System - The Internet Archive Cluster
6.8 Putting It All Together: NetApp FAS6000 Filer
6.9 Fallacies and Pitfalls
6.10 Concluding Remarks

Appendix A: Pipelining: Basic and Intermediate Concepts
A.1 Introduction
A.2 The Major Hurdle of Pipelining~WPipeline Hazards
A.3 How Is Pipelining Implemented?
A.4 What Makes Pipelining Hard to Implement?
A.5 Extending the MIPS Pipeline to Handle Multicycle Operations
A.6 Putting It All Together: The MIPS R4000 Pipeline
A.7 Crosscutting Issues
A.8 Fallacies and Pitfalls
A.9 Concluding Remarks

Appendix B: Instruction Set Principles and Examples
B.1 Introduction
B.2 Classifying Instruction Set Architectures
B.3 Memory Addressing
B.4 Addressing Modes for Signal Processing 1
B.5 Type and Size of Operands
B.6 Operations in the Instruction Set
B.7 Instructions for Control Flow
B.8 Encoding an Instruction Set
B.9 Crosscutting Issues: The Role of Compilers
B.10 Putting It All Together: The MIPS Architecture
B.11 Fallacies and Pitfalls
B.12 Concluding Remarks

Appendix C: Introduction to Memory Hierarchy
C.1 Introduction
C.2 Cache Performance
C.3 Seven Basic Cache Optimizations
C.4 Virtual Memory
C.5 Protection and Examples of Virtual Memory
C.6 Fallacies and Pitfalls
C.7 Concluding Remarks
.



Relevant Pages

  • Fourth edition of Computer Architecture: A Quantitative Approach
    ... Computer Architecture: ... Increased coverage on achieving parallelism with multiprocessors. ... Advanced Techniques for Instruction Delivery and Speculation ... Distributed Shared Memory and Directory-Based Coherence ...
    (comp.arch)
  • Re: PIC vs ARM assembler (no flamewar please)
    ... At least architectures with different instruction word and data word ... length must be harvard architecture. ... They would likely be Harvard indeed. ... 16-bit reads to a 14-bit instruction memory. ...
    (comp.arch.embedded)
  • Re: Learning to use PICS
    ... -If your objective is to understand the architecture of a 16F84, ... It facilitates a uniform instruction set where virtually every ... banking is required in order to access them. ... Not true for the data memory, and mostly no longer true for program memory ...
    (sci.electronics.basics)
  • Re: Learning to use PICS
    ... -If your objective is to understand the architecture of a 16F84, ... It facilitates a uniform instruction set where virtually every ... banking is required in order to access them. ... Not true for the data memory, and mostly no longer true for program memory ...
    (sci.electronics.design)
  • Re: [Lit.] Buffer overruns
    ... > floating point support or a memory expansion option. ... had virtual memory support grafted on. ... > where the modified instruction was fetched from. ... vis-a-vis the official coporate strategic operating system TSS/360. ...
    (sci.crypt)

Loading