WebGL Deferred Shading

University of Pennsylvania, CIS 565: GPU Programming and Architecture, Project 6

Tongbo Sui
Tested on: Google Chrome 46.0.2490.71, Firefox 41.0.2, on Windows 10, i5-3320M @ 2.60GHz 8GB, NVS 5400M 2GB (Personal)

Overview

WebGL based deferred lighting, with bloom and toon effects.

Live Online

Demo Video

Features and Analysis

Effects

Deferred Blinn-Phong shading [4]
- Blinn-Phong lighting with normal mapping support, rendered in a separate pass outside fragment shader
- Optimization: remove out surface normal calculations (see Optimized g-buffer)
- Possible further improvements: better falloff computation

Deferred render

Debug views for deferred lighting

Bloom
- Post-process Gaussian blur [1] used to simulate glow effect
- Optimization: use two-pass Gaussian blur instead of one single convolution kernel
  - Single convolution kernel requires significantly more time to compute. For a 5x5 kernel, that would require 25 reads per fragment
  - On the other hand, two-pass convolution only needs 10 reads per fragment, which yields a 2.5x speedup
- Possible further improvements: none
- Performance impact: minor performance impact; see graph below

Bloom

Toon shading
- Ramp shading with an edge detector for rendering outlines [2][3]
- Uses Sobel operator as edge detectors. The operator requires two convolution passes, each of which is a 3x3 kernel
- Possible further improvements: more passes for faster convolution. E.g. break each pass further into 2 passes, so that in each pass the shader is only applying convolution with a 1D kernel
- Performance impact: medium performance impact; see graph below

Toon

Additional material properties
- Supports specular exponent as an input to g-buffers
- Performance impact: no visible impact

Additional material property

Frame time (ms) with varying # of lights

Optimizations

Optimized g-buffer format, reduce the number and size of g-buffers:
- Surface normal pre-calculation
  - Applying the normal map in the copy shader pass instead of copying both geometry normals and normal maps
  - Performance analysis
    - Significant improvement over non-optimized g-buffer for the following two reasons:
      - 1 less g-buffer to read/write. This results in overall shorter memory access time
      - Since the Blinn-Phong shader shades fragments in a loop, pre-calculating surface normal avoids recalculating surface normal over and over again inside the loop. This greatly reduces the computation time during deferred lighting
    - Tradeoff: less flexibility inside the shader. The shader won't get geometry normal as input.
- RGBA packing; code derived from [5]
  - On top of pre-calculating surface normal, packing RGBA information into a single floating point value in g-buffer
  - Color channels are packed into one single float by simulating bit-shifts with multiplication
  - Performance analysis
    - Minor performance improvement, due to one less g-buffer and hence overall faster memory access
    - In general, this provides only marginal performance improvement, while reducing color quality greatly during color reconstruction
    - Tradeoff: significant loss in color quality
- Improved RGBA packing; same code structure of [5], but different packing method
  - On top of pre-calculating surface normal, packing RGBA information into a single floating point value in g-buffer
  - Instead of simulating bit-shifts, operate in base 10 and allocate each channel 3 digits in the floating point value
    - Due to some unknown issue with GLSL on floating point values (either precision or interpretation), more digits per channel is not feasible
    - This doesn't include alpha channel. Therefore it's a 9-digit integer stored as a floating point value. With alpha, each channel will only get 2 digits
  - Performance analysis
    - Exact same performance benefit as "old" color-packing method, but without visible color quality loss
    - Color quality losses do happen. In fact, due to the ordering of the channels during packing, blue channel suffers from greater quality loss than red and green channels.
    - However compared to the bit-shift method, quality loss is much less visible, if not indistinguishable
    - Tradeoff
      - Screenshot feature doesn't work (returns black image); Need to press print screen on the keyboard for anything to be captured
      - Alpha channel is excluded
- master-backup has the version without color-packing. The render logic differs greatly in fragment shader and other shaders. Therefore they are in separate branches

Render with RGBA packing (Old). Notice the reduced color quality of the reconstructed color map

Quality of improved RGBA packing. Notice the almost identical color quality

G-buffer optimization effects

Scissoring
- Scissor the screen for each light, such that it only renders in a rectangle around the light
- Performance analysis
  - Effectivity
    - Effective in most cases, as instead of calculating lighting for all fragments, only a small portion of the fragments participate in the actual computation. Hence if a fragment is lit by only a few lights, less computation is needed.
    - However, it becomes ineffective when number of lights increases to a significant amount. As seen in "Frame time" graph above, at 120 lights, the frame time increases dramatically. At this point, lots of lights are covering almost every fragment in the scene. Even though for each light, it's still scissored, but since there is too much overlapping, all fragments need to participate in the computation many times. Therefore the scissoring is much less effective
  - Tradeoff: some artifacts due to loose AABB calculation (fixed below)

Scissoring. Light boxes are in red. Darker means less light overlaps

Improved screen-space AABB for scissor test
- Calculate bounding box at sphere's front-facing max circle to minimize distortion due to transformations
- Performance analysis
  - Faster scissor tests due to more accurate (smaller) AABB calculation. Potentially much less overlapping on the edge of each light, for each fragment
  - Fixes artifacts caused by default AABB calculation

AABB size comparison. Improved AABB calculation yields smaller scissor boxes

Scissoring performance (no effects)

Scissoring performance (effects)

Two-pass Gaussian blur [1]
- Uses separate passes to apply convolution in both x and y directions
- Performance analysis
  - Faster post-processing
  - As stated previoiusly, single convolution kernel requires significantly more time to compute. For a 5x5 kernel, two-pass convolution yields a 2.5x speedup
  - Tradeoff: one more render pass. This requires one more buffer manipulation and memory setup

References

[1] Bloom: GPU Gems, Ch. 21
[2] Edge detector: Sobel Operator
[3] Toon ramping: Cel shading
[4] Blinn-Phong: Blinn–Phong shading model

Third-party codes

[5] RGBA packing: Aras Pranckevičius / StackOverflow

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
glsl		glsl
img		img
js		js
lib		lib
models		models
.gitignore		.gitignore
README.md		README.md
favicon.ico		favicon.ico
index.html		index.html
server.bat		server.bat
server.py		server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

WebGL Deferred Shading

Overview

Live Online

Demo Video

Features and Analysis

Effects

Deferred render

Debug views for deferred lighting

Bloom

Toon

Additional material property

Frame time (ms) with varying # of lights

Optimizations

Render with RGBA packing (Old). Notice the reduced color quality of the reconstructed color map

Quality of improved RGBA packing. Notice the almost identical color quality

G-buffer optimization effects

Scissoring. Light boxes are in red. Darker means less light overlaps

AABB size comparison. Improved AABB calculation yields smaller scissor boxes

Scissoring performance (no effects)

Scissoring performance (effects)

References

Third-party codes

About

Uh oh!

Releases

Packages

Languages

StoneBird/Project6-WebGL-Deferred-Shading

Folders and files

Latest commit

History

Repository files navigation

WebGL Deferred Shading

Overview

Live Online

Demo Video

Features and Analysis

Effects

Deferred render

Debug views for deferred lighting

Bloom

Toon

Additional material property

Frame time (ms) with varying # of lights

Optimizations

Render with RGBA packing (Old). Notice the reduced color quality of the reconstructed color map

Quality of improved RGBA packing. Notice the almost identical color quality

G-buffer optimization effects

Scissoring. Light boxes are in red. Darker means less light overlaps

AABB size comparison. Improved AABB calculation yields smaller scissor boxes

Scissoring performance (no effects)

Scissoring performance (effects)

References

Third-party codes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages