Caustic Graphics

Caustic Graphics was a computer graphics and fabless semiconductor company that developed technologies to bring real-time ray-traced computer graphics to the mass market.

The company name derived from an optical effect caused by the concentration of light on to a surface resulting from focusing through reflection or refraction phenomena.

Caustic was founded on the premise that realistic 3D graphics would be easier to create if GPU hardware were as efficient at processing a ray as processing a vertex or fragment using existing rasterisation methods.

History
Caustic was founded in 2006 by three ex-Apple engineers, one of which had developed graphics technologies for the 5th Generation iPod and 1st Generation iPhone. The founding business plan was to build a complete real-time ray traced graphics system to initially accelerate professional 3D visualisation and later for entertainment applications on PCs and gaming consoles.

In 2008, Caustic Graphics acquired Splutterfish, a company that developed and sold a production rendering software product called Brazil R/S.

Caustic later released a series of commercial software products called "Visualiser Plugins" that leveraged the Brazil R/S rendering technology and Caustic hardware to enable interactive and offline ray traced rendering inside Autodesk 3DSMax, Autodesk Maya, McNeel Rhino and later SketchUp

In 2010, Caustic Graphics was acquired by Imagination Technologies.

Hardware
The Caustic ray tracing hardware was not a replacement for existing rasterisation GPUs, but rather added several new hardware functions to increase the efficiency of ray-tracing:
 * 1) Coherence gathering: The ability for the GPU to generate work for itself and for those work-units to be deferred and sorted into coherent groups that were requesting a common resource, such as a data or shader program address in memory.
 * 2) Fixed-function triangle intersection units: A unit was developed that could intersect a ray with an edge sharing triangle pair and return barycentric coordinates and other hit information.
 * 3) Fixed-function bounding volume intersection units: A unit was developed that could intersect a ray with an AABB and return hit information.
 * 4) Framebuffer accumulator: This unit would receive commands from the ALUs to add colour to a given pixel in the frame buffer.
 * 5) Scene hierarchy generator: This unit would read lists of triangles (or other scene geometry representations) and stream out to memory a directed acyclic graph of 3D volumes surrounding the geometry, known as a bounding volume hierarchy.

Coherence gathering
The Caustic architecture parallelised execution not on pixels, but on individual rays. This allowed for hardware to gather these rays into coherent groups to perform a common operation, such as access a memory address within the scene acceleration structure and underlying geometry or execution of a common material shader program.

Streaming bounding volume hierarchy generator
Caustic developed a hardware pipeline that was capable of building a bounding volume hierarchy bottom-up by directly consuming assembled primitives post vertex shading and gradually streaming out a graph of AABB nodes to DRAM in a single pass. This worked by implementing a conservative triangle voxelizer which would produce spatial addresses into a cache-like structure to group triangles and AABBs within common parts of 3D space. This relied on the application submitting triangles in a generally spatially coherent order. Failure to submit triangles in a spatially coherent order would result in premature evictions from the spatial cache and thus generate overlapping bounding volumes and incur a higher ray traversal overhead. The hardware included a feature to increase the voxelisation rate for long and narrow triangles to permit more efficient bounding and higher rendering performance for off-axis geometry.

This hardware unit, known as the SHG, was implemented within the PowerVR 6XT GR6500 from Imagination. In a demonstration, the company claimed that in their 600MHz test silicon, the SHG had a throughput of 100 million dynamic triangles per second.

CausticGL/OpenRL API
The OpenRL API (previously called CausticGL) was derived from OpenGL ES 2.0 and added a series of features to both the host API and GLSL required to support ray tracing. Caustic shipped high performance implementations of the API for both SSE and AVX capable Intel CPUs, OpenCL capable GPUs and CUDA support for NVIDIA GPUs.

OpenRL SDK
The OpenRL API was shipped in a free SDK with implementations for Intel CPUs, OpenCL and CUDA compatible GPUs and the Caustic hardware. Applications built on the OpenRL SDK would automatically detect supporting hardware and use the highest performing available option.

The SDK included a series of example programs, "Profiler" to help debug applications by allowing full inspection of the ray tracing "tree" within a pixel and a tool called "StatsPlotter" which would expose internal performance counters.

PowerVR Brazil SDK
PowerVR Brazil SDK (also known as Resin) was a software layer utilising the OpenRL API which enabled physically based rendering implementations, programmable material shaders, dynamic geometry and hardware platform independence. The Resin technology grew out of the expertise from the Brazil R/S team which were acquired by Caustic Graphics.

PowerVR Wizard / PLATO
After the acquisition by Imagination Technologies, an initiative began to integrate the Caustic ray tracing coherence gathering, fixed function geometry testers and a new streaming BVH generator into an exiting Series 6 PowerVR GPU architecture. This allowed GLSL shaders which previously had to run on a host CPU to instead be executed on-chip using the existing PowerVR shading clusters and still gain benefit from the ray reordering being performed by the Caustic traversal unit.

Additionally, the GPU was still a tile-based deferred rasteriser (TBDR) and could perform hybrid rasterisation-raytracing by allowing triangle fragment shaders to emit rays directly into the ray tracing pipeline.

Imagination Technologies taped out test silicon containing the complete PowerVR 6XT GR6500 ray tracing GPU on a PCI express card called PLATO and it was demonstrated publicly at various events. It was claimed by the company to have memory bandwidth and power consumption characteristics similar to a mid-range consumer smartphone in 2016.

PCIe cards
A pair of PCIe card products were launched incorporating the Caustic RT2 ASIC implementation of the RTU. The RT2 was a custom ASIC built on a 90nm TSMC process. The R2100 card had a single RT2 chip and 4GB of onboard DDR2 memory and a claimed peak power consumption of 40 Watts. The R2500 incorporated a pair of RT2 chips and a total of 16GB of memory and had a claimed peak power consumption of 60 Watts. The cards were released for sale in 2013.

The CausticOne PCIe card had two FPGA implementations of the RTU running at 100Mhz each with a single channel DDR2 memory for scene storage and connected to the host via 4-lane PCI Express 1.0. It was demonstrated in 2009 and sold in limited volume later that year.