Filtering Approaches for Real-Time Anti-Aliasing

Slides:



Advertisements
Similar presentations
TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST
Advertisements

You have been given a mission and a code. Use the code to complete the mission and you will save the world from obliteration…
Destruction Masking in Frostbite 2 using Volume Distance Fields
Visible-Surface Detection(identification)
CRYTEK © 2010 Crytek GmbH BRINGING STEREO TO CONSOLES Nicolas Schulz, R&D Graphics Engineer GDC Europe 2010, AAA Stereo-3D in CryENGINE.
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.
and 6.855J Cycle Canceling Algorithm. 2 A minimum cost flow problem , $4 20, $1 20, $2 25, $2 25, $5 20, $6 30, $
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
Multiplying binomials You will have 20 seconds to answer each of the following multiplication problems. If you get hung up, go to the next problem when.
0 - 0.
ALGEBRAIC EXPRESSIONS
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
MULTIPLYING MONOMIALS TIMES POLYNOMIALS (DISTRIBUTIVE PROPERTY)
SUBTRACTING INTEGERS 1. CHANGE THE SUBTRACTION SIGN TO ADDITION
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
Addition Facts
Year 6 mental test 5 second questions
Environmental Remote Sensing GEOG 2021
Around the World AdditionSubtraction MultiplicationDivision AdditionSubtraction MultiplicationDivision.
7.1 Vis_04 Data Visualization Lecture 7 3D Scalar Visualization Part 2 : Volume Rendering- Introduction.
16.1 Si23_03 SI23 Introduction to Computer Graphics Lecture 16 – Some Special Rendering Effects.
Robust Window-based Multi-node Technology- Independent Logic Minimization Jeff L.Cobb Kanupriya Gulati Sunil P. Khatri Texas Instruments, Inc. Dept. of.
Grass, Fur and all things hairy Nicolas ThibierozKarl Hillesland Gaming Engineering Manager, AMDSenior Research Engineer, AMD.
Re-examining Instruction Reuse in Pre-execution Approaches By Sonya R. Wolff Prof. Ronald D. Barnes June 5, 2011.
Real-Time Projector Tracking on Complex Geometry Using Ordinary Imagery Tyler Johnson and Henry Fuchs University of North Carolina – Chapel Hill ProCams.
Deferred Lighting and Post Processing on PLAYSTATION®3
Filtering Approaches for Real-Time Anti-Aliasing
Chapter 4 Memory Management Basic memory management Swapping
ABC Technology Project
Project 5: Virtual Memory
1 Overview Assignment 4: hints Memory management Assignment 3: solution.
2 |SharePoint Saturday New York City
O X Click on Number next to person for a question.
25 July, 2014 Martijn v/d Horst, TU/e Computer Science, System Architecture and Networking 1 Martijn v/d Horst
25 July, 2014 Martijn v/d Horst, TU/e Computer Science, System Architecture and Networking 1 Martijn v/d Horst
5 August, 2014 Martijn v/d Horst, TU/e Computer Science, System Architecture and Networking 1 Martijn v/d Horst
Factor P 16 8(8-5ab) 4(d² + 4) 3rs(2r – s) 15cd(1 + 2cd) 8(4a² + 3b²)
8 2.
© 2012 National Heart Foundation of Australia. Slide 2.
Lets play bingo!!. Calculate: MEAN Calculate: MEDIAN
Past Tense Probe. Past Tense Probe Past Tense Probe – Practice 1.
Properties of Exponents
Addition 1’s to 20.
25 seconds left…...
Test B, 100 Subtraction Facts
Week 1.
Vector Algebra One Mark Questions PREPARED BY:
SIGGRAPH 2010 Dmitry Andreev (AND) Real-time Frame Rate Up-conversion for Video Games or how to get from 30 to 60 fps for “free”
Group Meeting Presented by Wyman 10/14/2006
We will resume in: 25 Minutes.
Bottoms Up Factoring. Start with the X-box 3-9 Product Sum
O X Click on Number next to person for a question.
PSSA Preparation.
DirectCompute Accelerated Separable Filtering 28th February 20112AMD‘s Favorite Effects.
X-box Factoring. X- Box 3-9 Product Sum Factor the x-box way Example: Factor 3x 2 -13x (3)(-10)= x 2x 3x 2 x-5 3x +2.
CS123 | INTRODUCTION TO COMPUTER GRAPHICS Andries van Dam © 1/16 Deferred Lighting Deferred Lighting – 11/18/2014.
Adaptive Segmentation Based on a Learned Quality Metric
Technische Universität München Computer Graphics SS 2014 Graphics Effects Rüdiger Westermann Lehrstuhl für Computer Graphik und Visualisierung.
Filtering Approaches for Real-Time Anti-Aliasing
Filtering Approaches for Real-Time Anti-Aliasing
Filtering Approaches for Real-Time Anti-Aliasing /
Xbox MB system memory IBM 3-way symmetric core processor ATI GPU with embedded EDRAM 12x DVD Optional Hard disk.
Stencil Routed A-Buffer
Graphics Processing Unit
Deferred Lighting.
The Graphics Rendering Pipeline
UMBC Graphics for Games
RADEON™ 9700 Architecture and 3D Performance
Presentation transcript:

Filtering Approaches for Real-Time Anti-Aliasing

Filtering Approaches for Real-Time Anti-Aliasing FXAA 3.11 in 15 Slides Timothy Lottes NVIDIA

What Is FXAA 3.11? Fast approXimate Anti-Aliasing –Two algorithms, FXAA 3.11 Console (360 and PS3) FXAA 3.11 Quality (PC) Fixed set of constraints –One shader pass, only color input, only color output –Run on all APIs (GL, DX9, through DX11, etc) –Certainly better can be done under other constraints!

Why FXAA 3.11? Resolution + Deferred + MSAA = Problem! –5760 x 1080 x Stereo = 12.5 Mpix Memory 12.5 Mpix –238 MB for just one non-MSAA G-buffer tiny 20B/pixel) Texture 12.5 Mpix –Only 6.25 tex/pix/ms (GTX590) –Compare to 8 tex/pix/ms for 1 Mpix (~720p)

What Does MSAA Cost? Cost varies based on scene, type of engine, GPU, etc Example average extra ms/frame and %frame for MSAA –8xMSAA, World Of 1920x ms (GTX 570) = 17%, 2.2 ms (HD 6950) = 17% –4xMSAA, Lost Planet 1920x ms (GTX 570) = 14%, 3.3 ms (HD 6950) = 13% –4xMSAA, 1280x ms (GTS 450) = 18%, 1.4 ms (HD 6850) = 11% –4xMSAA, Just Cause 1280x ms (GTS 450) = 11%, 3.1 ms (HD 6850) = 16% –4xMSAA, Metro 1280x ms (GTS 450) = 32%, 3.5 ms (HD 6850) = 23%

FXAA 3.11 Console FXAANo AA Test Image from NVIDIA Stencil Routed K-Buffer SDK 10 Sample

FXAA 3.11 Console Early Exit Early exit for pixels not needing AA –Fetch 4 filtered luma values, and luma for M Need AA if contrast is high relative to maxLuma –maxLuma = max(nw,ne,sw,se) –contrast = max(nw,ne,sw,se,m) - min(nw,ne,sw,se,m) –if(contrast >= max(minThreshold, maxLuma * threshold)) high ratio (edge) medium ratio (edge) low ratio (no edge) N WME S

All pixels which do not exit get this 2 tap filter –Direction perpendicular to local luma gradient Use the four 2x2 box filtered luma values –dir.x = -((NW+NE)-(SW+SE)) –dir.y = ((NW+SW)-(NE+SE)) –dir.xy = normalize(dir.xy) * scale Optional extra 2 taps –Scale dir.xy by 1/minDir minDir = min(|dir.x|, |dir.y|) * sharpness –Then limit filter width to 8 pixels FXAA 3.11 Console Taps WME S N EWM S N EWM S N

FXAA 3.11 Console Extra Taps Check if the full 4-tap filter is invalid –Compare 4-tap filter luma to neighborhood luma, Use the min and max luma range of the original 4 samples – {NW, NE, SW, SE} If 4-tap filter luma exceeds this range, –Assume invalid and use just the first 2 taps luma range

FXAA 3.11 Console on Hz = 3% –0.8 ms in shader ms for EDRAM resolve –Using FXAA_GREEN_AS_LUMA Optimizations –Use free texture sampler exponent bias Alias multiple samplers to same input texture –Manual tfetch2D assembly to include offsets –Use early-exit branch –Optimize constant usage

FXAA 3.11 Console on PS Hz = 3.6% –Using RGBL input and FXAA_EARLY_EXIT –Very close to estimated NVShaderPerf of 15 clk/pixel Optimizations –Increase from 3 to 7 registers saves 0.15 ms/frame Increases TEX$ hits –Optimize for PS3 RSX pixel pipeline including, FP16 precision, non-perspective interpolation De-vectorize and hand schedule scalar ops at shader entry Re-vectorize from half4 to half2 xy and zw pairs Take advantage of free power-of-2 multiply and divide Turned early-exit into a conditional assignment (no branch)

FXAA 3.11 Console FSS FXAA FSSNo AA Image captured from NVIDIA Hair SDK 11 Sample

FXAA 3.11 Quality Preset 13 FXAANo AA Image from modified NVIDIA Stochastic Transparency Demo

FXAA 3.11 Quality on PC Default preset performance, –Note performance will vary Based on preset, settings, GPU, and image source –GTX Hz = 2.3% –GTX Hz = 5.3%

FXAA 3.11 Quality FSS Image from modified NVIDIA Endless City Demo

Teaser for FXAA TSSAA Low Motion Fast Motion NoAA

Thanks Thanks again for all the developer feedback. –FXAA has been greatly improved thanks to your comments!