Hot Chips Bringing Workstation Graphics Performance to a Desktop Near You. S3 Incorporated August 18-20, 1996

Similar documents
0;L$+LJK3HUIRUPDQFH ;3URFHVVRU:LWK,QWHJUDWHG'*UDSKLFV

AGP Rev. A+ Graphics Card User s Manual

Module Introduction. Content 15 pages 2 questions. Learning Time 25 minutes

Optimizing Games for ATI s IMAGEON Aaftab Munshi. 3D Architect ATI Research

Module 13C: Using The 3D Graphics APIs OpenGL ES

Structure. Woo-Chan Park, Kil-Whan Lee, Seung-Gi Lee, Moon-Hee Choi, Won-Jong Lee, Cheol-Ho Jeong, Byung-Uck Kim, Woo-Nam Jung,

CS451Real-time Rendering Pipeline

Barracuda. Technical Specification. Barracuda Issue 1.0 Page 1 of 10

VISUALIZE Workstation Graphics for Windows NT. By Ken Severson HP Workstation System Lab

Windowing System on a 3D Pipeline. February 2005

Falanx Microsystems. Company Overview

1. Introduction 2. Methods for I/O Operations 3. Buses 4. Liquid Crystal Displays 5. Other Types of Displays 6. Graphics Adapters 7.

Pipeline Operations. CS 4620 Lecture 14

TVP4010 USERS GUIDE Architecture Overview

CS4620/5620: Lecture 14 Pipeline

Spring 2009 Prof. Hyesoon Kim

The Need for Programmability

Graphics Processing Unit Architecture (GPU Arch)

GoForce 3D: Coming to a Pixel Near You

POWERVR MBX. Technology Overview

Pipeline Operations. CS 4620 Lecture Steve Marschner. Cornell CS4620 Spring 2018 Lecture 11

2D/3D Graphics Accelerator for Mobile Multimedia Applications. Ramchan Woo, Sohn, Seong-Jun Song, Young-Don

Building scalable 3D applications. Ville Miettinen Hybrid Graphics

Standard Graphics Pipeline

Architectures. Michael Doggett Department of Computer Science Lund University 2009 Tomas Akenine-Möller and Michael Doggett 1

Architecture Overview

Rasterization Overview

CS427 Multicore Architecture and Parallel Computing

- Rasterization. Geometry. Scan Conversion. Rasterization

Texture Compression. Jacob Ström, Ericsson Research

Real-Time Rendering (Echtzeitgraphik) Michael Wimmer

Optimizing and Profiling Unity Games for Mobile Platforms. Angelo Theodorou Senior Software Engineer, MPG Gamelab 2014, 25 th -27 th June

Lecture 2. Shaders, GLSL and GPGPU

Drawing Fast The Graphics Pipeline

Overview. Technology Details. D/AVE NX Preliminary Product Brief

Graphics Hardware. Instructor Stephen J. Guy

Spring 2011 Prof. Hyesoon Kim

GeForce4. John Montrym Henry Moreton

Post processing techniques to accelerate assertion development Ajay Sharma

Graphics Hardware, Graphics APIs, and Computation on GPUs. Mark Segal

What Next? Kevin Walsh CS 3410, Spring 2010 Computer Science Cornell University. * slides thanks to Kavita Bala & many others

Pipeline Operations. CS 4620 Lecture 10

Scanline Rendering 2 1/42

Mali-400 MP: A Scalable GPU for Mobile Devices Tom Olson

Lets assume each object has a defined colour. Hence our illumination model is looks unrealistic.

3D Graphics and OpenGl. First Steps

AMD Embedded PCIe ADD-IN BOARD E6760/E6460 Datasheet. (ER93FLA/ER91FLA-xx)

AMD HD5450 PCIe ADD-IN BOARD. Datasheet AEGX-A3T5-01FST1

Introduction to Computer Graphics (CS602) Lecture No 03 Graphics Systems

User Guide. NVIDIA Quadro FX 4700 X2 BY PNY Technologies Part No. VCQFX4700X2-PCIE-PB

MB86290 Series Graphics Driver V02 User's Manual Rev. 3.1 Lime

CS 464 Review. Review of Computer Graphics for Final Exam

Support Triangle rendering with texturing: used for bitmap rotation, transformation or scaling

Digital Blocks Semiconductor IP

Denali Technical Overview

AMD HD5450 PCI ADD-IN BOARD. Datasheet. Advantech model number:gfx-a3t5-61fst1

Mattan Erez. The University of Texas at Austin

Advanced Rendering Techniques

CS 112 The Rendering Pipeline. Slide 1

Grafica Computazionale: Lezione 30. Grafica Computazionale. Hiding complexity... ;) Introduction to OpenGL. lezione30 Introduction to OpenGL

Whiz-Bang Graphics and Media Performance for Java Platform, Micro Edition (JavaME)

1.2.3 The Graphics Hardware Pipeline

Real - Time Rendering. Graphics pipeline. Michal Červeňanský Juraj Starinský

Cornell University CS 569: Interactive Computer Graphics. Introduction. Lecture 1. [John C. Stone, UIUC] NASA. University of Calgary

Mattan Erez. The University of Texas at Austin

Graphics Hardware and Display Devices

Optimizing DirectX Graphics. Richard Huddy European Developer Relations Manager

CS 498 VR. Lecture 19-4/9/18. go.illinois.edu/vrlect19

X. GPU Programming. Jacobs University Visualization and Computer Graphics Lab : Advanced Graphics - Chapter X 1

The VISUALIZE fx family of graphics subsystems consists of three

A HIGHLY-INTEGRATED WORKSTATION GRAPHICS SYSTEM DESIRED FUNCTIONALITY

3-D Accelerator on Chip

A 120mW Embedded 3D Graphics Rendering Engine with 6Mb Logically Local Frame-Buffer and 3.2GByte/s Run-time Reconfigurable Bus for PDA-Chip

Copyright 1999 Fire GL Graphics. All rights reserved

Drawing Fast The Graphics Pipeline

Optimizing for DirectX Graphics. Richard Huddy European Developer Relations Manager

Vertex Shader Design I

ECE 571 Advanced Microprocessor-Based Design Lecture 20

100MHz Front Side Bus Yes: 512KB Pipeline Burst SRAM Integrated into CPU Data Bus Width/Address Bus Width 64-bit/64-bit

The Rasterization Pipeline

AMD E M PCIEx16 GFX-AE6460F16-5C

Module Contact: Dr Stephen Laycock, CMP Copyright of the University of East Anglia Version 1

GV-N52128D / GV-N52128T-P

From Vertices to Fragments: Rasterization. Reading Assignment: Chapter 7. Special memory where pixel colors are stored.

CS130 : Computer Graphics. Tamar Shinar Computer Science & Engineering UC Riverside

Shadow Algorithms. CSE 781 Winter Han-Wei Shen

Rendering. Converting a 3D scene to a 2D image. Camera. Light. Rendering. View Plane

Hardware-driven visibility culling

Build cost-effective, reliable signage solutions with the 8 display output, single slot form factor NVIDIA NVS 810

Upgrade your Life AV32DG USER'S MANUAL. RADEON TM 7000E Graphics Accelerator REV MD-AV32DG-102

E.Order of Operations

2.11 Particle Systems

AMD E8870 4GB PCIEX16 Mini DP X4 Low profile ER24FL-SK4 GFX-AE8870L16-5J

Enabling immersive gaming experiences Intro to Ray Tracing

The F-Buffer: A Rasterization-Order FIFO Buffer for Multi-Pass Rendering. Bill Mark and Kekoa Proudfoot. Stanford University

Reading. 18. Projections and Z-buffers. Required: Watt, Section , 6.3, 6.6 (esp. intro and subsections 1, 4, and 8 10), Further reading:

AMD HD5450 PCIe X1 ADD-IN BOARD. Datasheet. Advantech model number: GFX-A3T5-71FST1

A fixed-point 3D graphics library with energy-efficient efficient cache architecture for mobile multimedia system

CS230 : Computer Graphics Lecture 4. Tamar Shinar Computer Science & Engineering UC Riverside

CIS 581 Interactive Computer Graphics (slides based on Dr. Han-Wei Shen s slides) Requirements. Reference Books. Textbook

Transcription:

Hot Chips 1996 Bringing Workstation Graphics Performance to a Desktop Near You S3 Incorporated August 18-20, 1996

Agenda ViRGE/VX Marketing Slide! Overview of ViRGE/VX accelerator features 3D rendering Live video and MPEG YUV stream display C-Model and verification

ViRGE/VX Marketing slide The Only Marketing Slide in This Presentation! ViRGE / VX is the worlds only integrated NT accelerator High performance 2D, Video, and 3D engine Worlds only integrated 220MHz RAMDAC Supports 1600x1200x16 at 85Hz! 1600x1200x24 at 60Hz is achievable Workstation Software Support OpenGL via 3D-DDI, ICD, MCD and TCD NT3.51, and NT4.0

ViRGE/VX Architectural Features Bus Interface Unit LPB Command FIFO VGA Regs S3D FIFO S3d Regs LPB FIFO LPB Regs Memory Interface Unit Video Memory Streams Processor I G A DAC

Video Playback 2 memory display streams Primary: VGA modes, 8 bit palletized; 16 and 24 bit color Secondary: 8, 16, and 24 bit as well as YUV Secondary stream is very flexible Horizontal and vertical stretching Horizontal and vertical filtering Secondary stream can be overlaid, blended or color-keyed onto primary stream

3D Rendering Features of the S3d Engine Architecture Triangle based rendering Shading features Flat shading Gouraud shading Texturing features Perspective correction (w. divide) Point, fast bi-linear and tri-linear filtering MIP-mapping Alpha blending Z-buffering Depth cueing Compressed texture formats

S3D 2D/3D Engine Block Diagram FROM BUS INTERFACE UNIT FROM COMMAND FIFO FROM TRIANGLE PROCESSOR SPAN ENGINE DMA FORMAT COMMAND FIFO 4 PORT REGISTER FILE 4 PORT REGISTER FILE 2D AND Z ENGINE - IMAGE WRITE - COLOR EXPANSION - SCREEN-SCREEN BLT/ROP - TEXTURE READ FORMAT - Z-COMPARE - BLOCK WRITE FIFO (BLT) BLENDER FIFO ADDRESS GENERATOR AND CLIPPING INTERPOLATOR INTERPOLATOR NON-LINEAR DIVIDE PIPE PIXEL FIFO TRIANGLE PROCESSOR PIXEL PROCESSOR TO 2D AND Z ENGINE TO PIXEL PROCESSOR MEMORY READ MEMORY WRITE TO MEMORY INTERFACE UNIT ADDRESS/BE

Multi-Pass Sub-Span Rendering Sub-spans are 16 to 64 pixels long Low gate count is achieved by matching the production order (in TP) with consumption order (in PP) Up to 4 pass rasterization on each sub-span Z, if Z-buffering is enabled (u, v, w), if texture mapping (r, g, b), if gouraud or lit-textures a, if fogging or alpha-transparency Sub-spans

MIP-Mapping MIP-mapping is the basic texture map format A rectangular flat map is a subset of this mode MIP-mapping allows: improved texture fetch performance by increasing locality Reduced texture download bandwidth Texture anti-aliasing LOD calculation is computed on a pixel-by- pixel basis Up Up to to 9 levels levels on on powers powers of of 2 sizes sizes

C-Model Architecture Application Driver Script Parser test.tri S3d Engine Modules Initial frame- buffer contents PCI Master/Slave Triangle Processor Pixel Processor Memory Interface Framebuffer test.hwk (PCI Trans.) test.hwc (Command Trans.) test.hwp (Pixel Trans.) test.hwm (Memory Trans.)

Development Project Phases Functional Spec & Algorithm Design [Algorithmic C-Model] Detailed Microarch. Transaction accurate C-Model VHDL Entry and Verification Full Chip Integration and Verification (pre-route) Final Place and route Post Route Simulation Silicon and Driver Verification Algorithm and Functional Validation Accounted for Over 60% of Elapsed Time Ability to Lay Down Gates Is Getting Easier. Validating the Functionality is Getting Harder

C-Model Architecture Application Driver Script Parser PCI Master/Slave test.tri test.hwk (PCI trans.) S3d Engine Modules Triangle Processor Pixel Processor Memory Interface Framebuffer test.hwc (Command trans.) test.hwp (Pixel trans.) test.hwm (Memory trans.) Initial frame- buffer contents

C-Model Properties Bit and Register Accurate Transaction/Order Accurate at Module Partitions Verified Visually Using Pictures/ Animations

C-Model Properties (cont.) Allows the Same Test to Be: Run at module, full chip, and silicon level Run on behavioral and gate level models IC test vectors Allows Drivers and Applications to Run Using C-Model Before Silicon Is Available

Emulation Full Chip Emulated at 1 MHz on P5 PCI Motherboard Using Quickturn System Realizer Box Display Capability Using Quickturn Picasso Module Emulation Started When ViRGE- Core Was Stable. Allowed Net- List Conversion and Physical Setup Debugging Before Engine Logic Stable.

Emulation (cont.) Major Objectives Were: Run VGA compliance tests Debug S/W: drivers and diags Run Winbench and Windows and other Large Tests. Use to probe internal nodes during silicon debug and metal mask fixes

Summary ViRGE/VX provides the complete feature set for a high-end PC graphics solution High performance & quality 3D acceleration Outstanding 2D acceleration for windows Enabled by an integrated 220MHz RAMDAC Complete VGA compatibility Video capture and display Low cost, Single-chip solution