© Tobias Goetz, Fraunhofer ITWM
1
3D VIZ WITHOUT THE GPU
The PV-4D Render Engine
© Tobias Goetz, Fraunhofer ITWM
2
Content
1. Introduction
2. Features
3. Examples
4. Technology
5. Target Applications & Competition
6. Implementation: XtreemView
© Tobias Goetz, Fraunhofer ITWM
3
Introduction
PV-4D is
 a parallel 3D render engine, i.e. software/code to generate images from three (or
more) dimensional datasets.
 completely CPU based, i.e. does not require graphic hardware
 capable of rendering huge datasets
 scalable, i.e. adjust the hardware to the problem not vice versa.
 able to deliver photorealistic renderings as well as game/animation style images
 easy and intuitive to implement in a software solution
© Tobias Goetz, Fraunhofer ITWM
4
Introduction
PV-4D directly supports three different kind of data types:
1. Volumetric Data (e.g. seismic data, MRI/CT datasets, 3D X-Rays, …)
2. Triangulated Objects (e.g. CAD/CAM, Games, Architecture, Film, …)
3. Polygon Objects (i.e. hexahedrons / tetrahedrons, e.g. reservoir simulations)
Besides those datasets, PV-4D has an interface for regular OpenGL programming,
thus users can add custom objects easily into and/or on top of a 3D scene
© Tobias Goetz, Fraunhofer ITWM
5
Features
 High quality display of full real amplitude values (32bit float) in HD quality.
 Easy and gradual blending between two volumetric datasets.
 Volume rendering on entire datasets. Transparency set by alpha value for the
color, associated to certain values.
 Real-time support for full quality zoom, pan, and rotate, as well as scaling and
translating individual objects within the scene
 Perspective and parallel projection for easy orientation and fast, distortion free
browsing through large datasets.
 Easy cross-section displays and slicing of x, y and z planes.
 Easy advanced slicing along I, j, k hexa-planes with range selection in real-time
using rebuilt multi bounding volume hierarchy.
 Instant read-out of cursor position and amplitude value(s) at any given position.
© Tobias Goetz, Fraunhofer ITWM
6
Features
 PV-4D directly supports triangles, voxels, hexahedrons; all other objects indirectly
with OpenGL
 Full scene graph for easy management of multiple objects
 OpenGL can co-exist in both directions using depth buffer management:
 PV-4D  OpenGL
 OpenGL  PV-4D
 Available 32bit z-Buffer for integration of user objects with OpenGL/Mesa
 On single node systems, OpenGL can be used directly on the GPU, on multi-node
systems, Mesa3D is used. Engine switches automatically
 Engine comes with documentation as well as example code snippets to illustrate
how it is used.
© Tobias Goetz, Fraunhofer ITWM
7
Features
 State-of-the-art geometry handling (compiler, traversal, intersection)
 Full HDR pipeline
 Scene lighting using HDR environment maps (individual light sources under
development)
 High-Quality texture filtering
 High-Quality anti-aliasing
 C/C++ library (Intel compiler required to link because of specific intrinsics)
 Builds under Linux, Windows and Mac (parallel version only for Linux)
© Tobias Goetz, Fraunhofer ITWM
8
Examples
High-Quality CAD Visualization
 ~ 25 million triangles
 Resolution: 2800 × 1050
 Full ray differentials
 HQ texture and normalmap filtering
 Un-compressed textures (up to 4k x 4k)
 16x anti-aliasing
 Mitchell-Netravali-Filter reconstruction
 Fully interactive framerate
© Tobias Goetz, Fraunhofer ITWM
9
Examples
Dynamic Fairy With Shadows
 ~ 170,000 triangles (< 5 ms rebuild)
 Keyfreame animation with 100k triangles
 Resolution: 2800 × 1050
 Full ray differentials
 HQ texture and normalmap filtering
 Un-compressed textures (up to 2 k × 2 k)
 8 × anti-aliasing
 Cone-filter reconstruction
 > 60 fps on average
© Tobias Goetz, Fraunhofer ITWM
10
Examples
Triangulated CAD Model
 ~ 350 million triangles
 Resolution: 2800 × 1050
 Triangles distorted/displaced on
purpose by model provider
 No textures, but up to uncompressed
4k x 4k possible
 Fully interactive framerate on a single
workstation
© Tobias Goetz, Fraunhofer ITWM
11
Examples
Interactive Volume Rendering
 2x 57GB seismic dataset
 First rendered using volume rendering,
focusing on interesting layers with high
amplitude values
 Second rendered as regular volume and cut
back to reveal transparent one
 Resolution: 1900 x 1050
 Full 32bit float value used
 Fully interactive framerate on a dual
workstation setting
© Tobias Goetz, Fraunhofer ITWM
12
Technology
 Realtime CPU-based parallel
Raytracing, i.e.
 no special graphics hardware
required, and
 no bandwidth bottleneck between
main memory and GPU memory.
 Keep all data in memory: instant
access at all times
 Parallel and scalable: match hardware
to problem!
 Parallel handling of data: double the
hardware, double the speed
Main Memory, RAM (256GB)
CPU
File System
Graphic Card
4 – 8 GB
~100 GB/s
~8 GB/s
Main Memory, RAM (256GB)
CPU
Main Memory, RAM (256GB)
CPU
Main Memory, RAM (256GB)
CPU
Main Memory, RAM (256GB)
CPU
Main Memory, RAM (256GB)
CPU
Main Memory, RAM (256GB)
CPU
Main Memory, RAM (256GB)
CPU
Main Memory, RAM (256GB)
CPU
Parallel File
System
~100 GB/s each
Classic approach:
Data runs through
GPU for visualization
and needs to go
through 8GB/s
bottleneck.
PV-4D approach:
Omit GPU and do all
visualization on CPU
and in parallel.
© Tobias Goetz, Fraunhofer ITWM
13
Technology: GPI2
 Partitioned Global Address Space (PGAS) solution by Fraunhofer ITWM
 Allows to create large block of memory over many compute nodes with direct
read and write access for all nodes  asynchronous communication
 Uses Ethernet (10GB / 40GB) or Infiniband Interconnects
 Allows fastest image composition even with large number of compute nodes
(the more nodes are calculating an image, the harder the compositing step)
© Tobias Goetz, Fraunhofer ITWM
14
Technology
 Several patented algorithms for under-the-hood tasks
 Fast algorithms for bounding volume hierarchy (BVH) traversing
 Hybrid acceleration structures with dual multi BVH
 State of the art quad / hexahedron intersection detection
 Realtime primitive compilers
 Fastest software based image compositing over many nodes
© Tobias Goetz, Fraunhofer ITWM
15
Anyone who needs fast,
large scale data
visualization
Oil & Gas
Communication
Processing & Interpretation
Interpretation Departments
Processing DepartmentsAutomotive
Design Stage
Virtual Showroom
Medical X-Ray, CRT, MRI
Filming/
Gaming
Animated Movies
Fast CGI
Architecture
Life walk-through
Photorealistic
…
Possible Application Fields
© Tobias Goetz, Fraunhofer ITWM
16
QUESTIONS?
Tobias Goetz
North-America Representative
Fraunhofer Institute for Industrial Mathematics (ITWM)
Competence-Center High-Performance Computing
San Francisco, CA 94103
cell: +1 (510) 908-0867
mail: tobias.goetz@itwm.fraunhofer.de

More Related Content

PPTX
Parallel implementation of geodesic distance transform with application in su...
PDF
Making of-the-logistic-map-bifurcation-diagram
PPTX
High Performance Pedestrian Detection On TEGRA X1
PPTX
View-Dependent Texture Atlases (EG 2010)
PPTX
Real-Time Visual Simulation of Smoke
PDF
PhD defense talk (portfolio of my expertise)
PDF
Fine grained asynchronism for pseudo-spectral codes - with application to tur...
PDF
Point cloud mesh-investigation_report-lihang
Parallel implementation of geodesic distance transform with application in su...
Making of-the-logistic-map-bifurcation-diagram
High Performance Pedestrian Detection On TEGRA X1
View-Dependent Texture Atlases (EG 2010)
Real-Time Visual Simulation of Smoke
PhD defense talk (portfolio of my expertise)
Fine grained asynchronism for pseudo-spectral codes - with application to tur...
Point cloud mesh-investigation_report-lihang

What's hot (20)

PDF
Bryan Thompson, Chief Scientist and Founder at SYSTAP, LLC at MLconf NYC
PDF
cnsm2011_slide
PPTX
Geometry Batching Using Texture-Arrays
PDF
Siggraph2016 - The Devil is in the Details: idTech 666
PPT
Stable SSAO in Battlefield 3 with Selective Temporal Filtering
PPSX
Advancements in-tiled-rendering
PPTX
FlameWorks GTC 2014
PPTX
Frostbite on Mobile
PPTX
Rendering of Complex 3D Treemaps (GRAPP 2013)
PPTX
OTOY Presentation - 2016 NVIDIA GPU Technology Conference - April 5 2016
PDF
OpenGL 4.4 - Scene Rendering Techniques
PPTX
2.5D Clip-Surfaces for Technical Visualization
PPT
The Intersection of Game Engines & GPUs: Current & Future (Graphics Hardware ...
PDF
NVIDIA effects GDC09
PPTX
Tips and Tricks for Data Visualization in Python
PPTX
PPTX
Applying Recursive Temporal Blocking for Stencil Computations to Deeper Memor...
PPTX
Beyond porting
PDF
ARCHES ICF
PPTX
OTOY Presentation - 2015 NVIDIA GPU Technology Conference - March 17 2015
Bryan Thompson, Chief Scientist and Founder at SYSTAP, LLC at MLconf NYC
cnsm2011_slide
Geometry Batching Using Texture-Arrays
Siggraph2016 - The Devil is in the Details: idTech 666
Stable SSAO in Battlefield 3 with Selective Temporal Filtering
Advancements in-tiled-rendering
FlameWorks GTC 2014
Frostbite on Mobile
Rendering of Complex 3D Treemaps (GRAPP 2013)
OTOY Presentation - 2016 NVIDIA GPU Technology Conference - April 5 2016
OpenGL 4.4 - Scene Rendering Techniques
2.5D Clip-Surfaces for Technical Visualization
The Intersection of Game Engines & GPUs: Current & Future (Graphics Hardware ...
NVIDIA effects GDC09
Tips and Tricks for Data Visualization in Python
Applying Recursive Temporal Blocking for Stencil Computations to Deeper Memor...
Beyond porting
ARCHES ICF
OTOY Presentation - 2015 NVIDIA GPU Technology Conference - March 17 2015
Ad

Viewers also liked (20)

PDF
論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...
PPTX
Esphera 3D visualization plataform
PDF
Act 00085 i towns, nouveau framework pour la visualisation 3d web
PPT
2009 Opti Tex for online and virtual stores
PDF
Virtual Reality / Real Data: Immersive 3D Visualization and Salesforce
PDF
Need 3D Visualization for your project
PDF
Mago3D: A Brand-New Live 3D Geo-Platform
PDF
Introduction of MAGO3D
PDF
EUGM 2013 - Timea Polgar (ChemAxon) - 3D visualization for medicinal chemists
PDF
GPU - Basic Working
PPTX
Lec04 gpu architecture
PPTX
How Augmented Reality can Boost Print Book Sales!
PDF
GPU - An Introduction
PPTX
Graphics processing unit (gpu)
PDF
CPU vs. GPU presentation
PPT
Graphics Processing Unit - GPU
PPTX
GRAPHICS PROCESSING UNIT (GPU)
PPTX
Graphic Processing Unit (GPU)
PPTX
Graphics processing unit (GPU)
PPTX
Graphics processing unit ppt
論文紹介"DynamicFusion: Reconstruction and Tracking of Non-­‐rigid Scenes in Real...
Esphera 3D visualization plataform
Act 00085 i towns, nouveau framework pour la visualisation 3d web
2009 Opti Tex for online and virtual stores
Virtual Reality / Real Data: Immersive 3D Visualization and Salesforce
Need 3D Visualization for your project
Mago3D: A Brand-New Live 3D Geo-Platform
Introduction of MAGO3D
EUGM 2013 - Timea Polgar (ChemAxon) - 3D visualization for medicinal chemists
GPU - Basic Working
Lec04 gpu architecture
How Augmented Reality can Boost Print Book Sales!
GPU - An Introduction
Graphics processing unit (gpu)
CPU vs. GPU presentation
Graphics Processing Unit - GPU
GRAPHICS PROCESSING UNIT (GPU)
Graphic Processing Unit (GPU)
Graphics processing unit (GPU)
Graphics processing unit ppt
Ad

Similar to Realtime 3D Visualization without GPU (20)

PDF
Unite 2013 optimizing unity games for mobile platforms
PPTX
Presentation NBMP and PCC
PPTX
Tutorial on Point Cloud Compression and standardisation
PDF
Scalability for All: Unreal Engine* 4 with Intel
PPT
Your Game Needs Direct3D 11, So Get Started Now!
PDF
Lecture 15 ryuzo okada - vision processors for embedded computer vision
PPT
FIR filter on GPU
PPTX
Programmable Exascale Supercomputer
PPTX
graphics processing unit ppt
PPTX
Exascale Capabl
PPT
OpenGL ES based UI Development on TI Platforms
PDF
Данило Ульянич “C89 OpenGL for ARM microcontrollers on Cortex-M. Basic functi...
PPT
NVIDIA Graphics, Cg, and Transparency
PDF
Why Networked FICON Storage Is Better Than Direct Attached Storage
PDF
Mod 2 hardware_graphics.pdf
PPT
Vpu technology &gpgpu computing
PDF
[03 1][gpu용 개발자 도구 - parallel nsight 및 axe] miller axe
PPTX
Transformer Zoo (a deeper dive)
PPT
Vpu technology &gpgpu computing
PPT
Vpu technology &gpgpu computing
Unite 2013 optimizing unity games for mobile platforms
Presentation NBMP and PCC
Tutorial on Point Cloud Compression and standardisation
Scalability for All: Unreal Engine* 4 with Intel
Your Game Needs Direct3D 11, So Get Started Now!
Lecture 15 ryuzo okada - vision processors for embedded computer vision
FIR filter on GPU
Programmable Exascale Supercomputer
graphics processing unit ppt
Exascale Capabl
OpenGL ES based UI Development on TI Platforms
Данило Ульянич “C89 OpenGL for ARM microcontrollers on Cortex-M. Basic functi...
NVIDIA Graphics, Cg, and Transparency
Why Networked FICON Storage Is Better Than Direct Attached Storage
Mod 2 hardware_graphics.pdf
Vpu technology &gpgpu computing
[03 1][gpu용 개발자 도구 - parallel nsight 및 axe] miller axe
Transformer Zoo (a deeper dive)
Vpu technology &gpgpu computing
Vpu technology &gpgpu computing

Recently uploaded (20)

PDF
IT-ITes Industry bjjbnkmkhkhknbmhkhmjhjkhj
PDF
Lung cancer patients survival prediction using outlier detection and optimize...
PDF
Transform-Your-Streaming-Platform-with-AI-Driven-Quality-Engineering.pdf
PPTX
AI-driven Assurance Across Your End-to-end Network With ThousandEyes
PDF
Dell Pro Micro: Speed customer interactions, patient processing, and learning...
PDF
Auditboard EB SOX Playbook 2023 edition.
PDF
Planning-an-Audit-A-How-To-Guide-Checklist-WP.pdf
PDF
LMS bot: enhanced learning management systems for improved student learning e...
PDF
Transform-Your-Factory-with-AI-Driven-Quality-Engineering.pdf
PPT
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
PPTX
Configure Apache Mutual Authentication
PDF
Improvisation in detection of pomegranate leaf disease using transfer learni...
PPTX
Module 1 Introduction to Web Programming .pptx
PPTX
MuleSoft-Compete-Deck for midddleware integrations
PDF
Enhancing plagiarism detection using data pre-processing and machine learning...
PPTX
Microsoft User Copilot Training Slide Deck
PPTX
agenticai-neweraofintelligence-250529192801-1b5e6870.pptx
PPTX
Custom Battery Pack Design Considerations for Performance and Safety
PDF
The-2025-Engineering-Revolution-AI-Quality-and-DevOps-Convergence.pdf
PDF
Early detection and classification of bone marrow changes in lumbar vertebrae...
IT-ITes Industry bjjbnkmkhkhknbmhkhmjhjkhj
Lung cancer patients survival prediction using outlier detection and optimize...
Transform-Your-Streaming-Platform-with-AI-Driven-Quality-Engineering.pdf
AI-driven Assurance Across Your End-to-end Network With ThousandEyes
Dell Pro Micro: Speed customer interactions, patient processing, and learning...
Auditboard EB SOX Playbook 2023 edition.
Planning-an-Audit-A-How-To-Guide-Checklist-WP.pdf
LMS bot: enhanced learning management systems for improved student learning e...
Transform-Your-Factory-with-AI-Driven-Quality-Engineering.pdf
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
Configure Apache Mutual Authentication
Improvisation in detection of pomegranate leaf disease using transfer learni...
Module 1 Introduction to Web Programming .pptx
MuleSoft-Compete-Deck for midddleware integrations
Enhancing plagiarism detection using data pre-processing and machine learning...
Microsoft User Copilot Training Slide Deck
agenticai-neweraofintelligence-250529192801-1b5e6870.pptx
Custom Battery Pack Design Considerations for Performance and Safety
The-2025-Engineering-Revolution-AI-Quality-and-DevOps-Convergence.pdf
Early detection and classification of bone marrow changes in lumbar vertebrae...

Realtime 3D Visualization without GPU

  • 1. © Tobias Goetz, Fraunhofer ITWM 1 3D VIZ WITHOUT THE GPU The PV-4D Render Engine
  • 2. © Tobias Goetz, Fraunhofer ITWM 2 Content 1. Introduction 2. Features 3. Examples 4. Technology 5. Target Applications & Competition 6. Implementation: XtreemView
  • 3. © Tobias Goetz, Fraunhofer ITWM 3 Introduction PV-4D is  a parallel 3D render engine, i.e. software/code to generate images from three (or more) dimensional datasets.  completely CPU based, i.e. does not require graphic hardware  capable of rendering huge datasets  scalable, i.e. adjust the hardware to the problem not vice versa.  able to deliver photorealistic renderings as well as game/animation style images  easy and intuitive to implement in a software solution
  • 4. © Tobias Goetz, Fraunhofer ITWM 4 Introduction PV-4D directly supports three different kind of data types: 1. Volumetric Data (e.g. seismic data, MRI/CT datasets, 3D X-Rays, …) 2. Triangulated Objects (e.g. CAD/CAM, Games, Architecture, Film, …) 3. Polygon Objects (i.e. hexahedrons / tetrahedrons, e.g. reservoir simulations) Besides those datasets, PV-4D has an interface for regular OpenGL programming, thus users can add custom objects easily into and/or on top of a 3D scene
  • 5. © Tobias Goetz, Fraunhofer ITWM 5 Features  High quality display of full real amplitude values (32bit float) in HD quality.  Easy and gradual blending between two volumetric datasets.  Volume rendering on entire datasets. Transparency set by alpha value for the color, associated to certain values.  Real-time support for full quality zoom, pan, and rotate, as well as scaling and translating individual objects within the scene  Perspective and parallel projection for easy orientation and fast, distortion free browsing through large datasets.  Easy cross-section displays and slicing of x, y and z planes.  Easy advanced slicing along I, j, k hexa-planes with range selection in real-time using rebuilt multi bounding volume hierarchy.  Instant read-out of cursor position and amplitude value(s) at any given position.
  • 6. © Tobias Goetz, Fraunhofer ITWM 6 Features  PV-4D directly supports triangles, voxels, hexahedrons; all other objects indirectly with OpenGL  Full scene graph for easy management of multiple objects  OpenGL can co-exist in both directions using depth buffer management:  PV-4D  OpenGL  OpenGL  PV-4D  Available 32bit z-Buffer for integration of user objects with OpenGL/Mesa  On single node systems, OpenGL can be used directly on the GPU, on multi-node systems, Mesa3D is used. Engine switches automatically  Engine comes with documentation as well as example code snippets to illustrate how it is used.
  • 7. © Tobias Goetz, Fraunhofer ITWM 7 Features  State-of-the-art geometry handling (compiler, traversal, intersection)  Full HDR pipeline  Scene lighting using HDR environment maps (individual light sources under development)  High-Quality texture filtering  High-Quality anti-aliasing  C/C++ library (Intel compiler required to link because of specific intrinsics)  Builds under Linux, Windows and Mac (parallel version only for Linux)
  • 8. © Tobias Goetz, Fraunhofer ITWM 8 Examples High-Quality CAD Visualization  ~ 25 million triangles  Resolution: 2800 × 1050  Full ray differentials  HQ texture and normalmap filtering  Un-compressed textures (up to 4k x 4k)  16x anti-aliasing  Mitchell-Netravali-Filter reconstruction  Fully interactive framerate
  • 9. © Tobias Goetz, Fraunhofer ITWM 9 Examples Dynamic Fairy With Shadows  ~ 170,000 triangles (< 5 ms rebuild)  Keyfreame animation with 100k triangles  Resolution: 2800 × 1050  Full ray differentials  HQ texture and normalmap filtering  Un-compressed textures (up to 2 k × 2 k)  8 × anti-aliasing  Cone-filter reconstruction  > 60 fps on average
  • 10. © Tobias Goetz, Fraunhofer ITWM 10 Examples Triangulated CAD Model  ~ 350 million triangles  Resolution: 2800 × 1050  Triangles distorted/displaced on purpose by model provider  No textures, but up to uncompressed 4k x 4k possible  Fully interactive framerate on a single workstation
  • 11. © Tobias Goetz, Fraunhofer ITWM 11 Examples Interactive Volume Rendering  2x 57GB seismic dataset  First rendered using volume rendering, focusing on interesting layers with high amplitude values  Second rendered as regular volume and cut back to reveal transparent one  Resolution: 1900 x 1050  Full 32bit float value used  Fully interactive framerate on a dual workstation setting
  • 12. © Tobias Goetz, Fraunhofer ITWM 12 Technology  Realtime CPU-based parallel Raytracing, i.e.  no special graphics hardware required, and  no bandwidth bottleneck between main memory and GPU memory.  Keep all data in memory: instant access at all times  Parallel and scalable: match hardware to problem!  Parallel handling of data: double the hardware, double the speed Main Memory, RAM (256GB) CPU File System Graphic Card 4 – 8 GB ~100 GB/s ~8 GB/s Main Memory, RAM (256GB) CPU Main Memory, RAM (256GB) CPU Main Memory, RAM (256GB) CPU Main Memory, RAM (256GB) CPU Main Memory, RAM (256GB) CPU Main Memory, RAM (256GB) CPU Main Memory, RAM (256GB) CPU Main Memory, RAM (256GB) CPU Parallel File System ~100 GB/s each Classic approach: Data runs through GPU for visualization and needs to go through 8GB/s bottleneck. PV-4D approach: Omit GPU and do all visualization on CPU and in parallel.
  • 13. © Tobias Goetz, Fraunhofer ITWM 13 Technology: GPI2  Partitioned Global Address Space (PGAS) solution by Fraunhofer ITWM  Allows to create large block of memory over many compute nodes with direct read and write access for all nodes  asynchronous communication  Uses Ethernet (10GB / 40GB) or Infiniband Interconnects  Allows fastest image composition even with large number of compute nodes (the more nodes are calculating an image, the harder the compositing step)
  • 14. © Tobias Goetz, Fraunhofer ITWM 14 Technology  Several patented algorithms for under-the-hood tasks  Fast algorithms for bounding volume hierarchy (BVH) traversing  Hybrid acceleration structures with dual multi BVH  State of the art quad / hexahedron intersection detection  Realtime primitive compilers  Fastest software based image compositing over many nodes
  • 15. © Tobias Goetz, Fraunhofer ITWM 15 Anyone who needs fast, large scale data visualization Oil & Gas Communication Processing & Interpretation Interpretation Departments Processing DepartmentsAutomotive Design Stage Virtual Showroom Medical X-Ray, CRT, MRI Filming/ Gaming Animated Movies Fast CGI Architecture Life walk-through Photorealistic … Possible Application Fields
  • 16. © Tobias Goetz, Fraunhofer ITWM 16 QUESTIONS? Tobias Goetz North-America Representative Fraunhofer Institute for Industrial Mathematics (ITWM) Competence-Center High-Performance Computing San Francisco, CA 94103 cell: +1 (510) 908-0867 mail: [email protected]