Introducing Animation Fractal

This post introduces animation-fractal, an app to create live visuals. In two parts, I present:

How the project works, and,
What the implementation looks like.

But first, here is a video where I demonstrate the app along with a modular synthesizer:

Generative Art

The base principle of generative art is to create images using a computer system. I am mostly interested in the following two techniques, IFS and SDF ray marching:

IFS stands for Iterated Function System, where we recursively evaluate a function and observe it’s behavior. For example, \(z = z^2 + c\) produces the mandelbrot set.
Ray marching is a form of ray casting method where we create objects with a signed distance function (SDF).

We implement a function to produce a color for each pixel. This kind of work is best performed on the graphic cards using a fragment shader.

Fragment Shader

The shadertoy website lets you interact with fragment shaders directly in your web browser. Given a pixel coordinate named fragCoord.xy, the shader code outputs the pixel color named fragColor to render a fullscreen image.

The shadertoy environment provides additional variables, such as the window resolution iResolution and the elapsed time iTime. The shader code is defined with the GLSL ES language, here is an example:

The shader function mainImage is executed very quickly by the graphic card for each individual pixel. This execution is highly concurrent and it is suitable for realtime rendering.

Shadertoy is truely fascinating. Inspired by the possibilities, I wanted more:

Variables with controllers, and,
Offscreen rendering, using scripted modulations.

Thus, I had to build my own thing, and the next sections describe how I went about it.

Haskell GameDev

I decided to use the Haskell language to write animation-fractal. I chose Haskell because:

The expressive type system lets me define and modify every part of the system with the invaluable help of the compiler.
The tooling provides a great developer experience: REPL with GHCi, editor integration with the haskell-language-server and ghcid for reloading the code. Static analysis tool such as hlint, weeder and calligraphy are also very useful.
The Haskell community is constantly producing interesting work and I find it fascinating to see such progress in the development of the language.

I previously used Haskell for writting a web service application named monocle, and I wanted to apply Haskell to a different task.

If you are interested in doing Haskell GameDev, checkout haskell-game.dev, and I recommend this blog post: Text-Mode Games as First Haskell Projects.

Vulkan

To execute shader code, you can use either OpenGL or Vulkan. I picked Vulkan because it is the new standard, and it enables using different shader languages thanks to the new intermediary representation named SPIR-V.

Vulkan is notoriously difficult to use, and writting a bare bone application is overly complicated. Thus I’m relying on the Keid engine to deal with most of the lower level details such as:

Context initialization.
Framebuffer creation and presentation.
Ready-to-use render pass and pipelines.

Keid introduces a concept of stages to define scene resources, event handlers, and how to draw the final image. Even though that sounds simple, you need a good understanding of computer graphics to use the engine. To get more familiar with this system, I implemented the vkguide using Keid in this project: keid-vkguide.

Here is an overview of my animation-fractal stage:

graph LR
  subgraph "Offscreen Pass"
    Variables --> SP(Fractal Pipeline)
    SP --> TX(Texture)
  end
  subgraph "Presentation"
    FB(Framebuffer)
  end
  subgraph "ForwardMSAA Pass"
    TX --> FP(Fullscreen Pipeline) --> FB
    DearImGui --> FB
  end

That is a fairly simple setup where I didn’t bother with 3d models or lighting because everything is done by the shader code. The next section describes how the variables are provided to the render pipeline.

Shader input

The shader code is executed on a distinct processing unit called the GPU. After uploading the code, you have multiple options to use your application’s data.

From more static to dynamic:

Shader specialization constants. Those are compiled-in and thus fastest. They even turn switches and ifs into no-ops!
Descriptor sets. Stuff like uniform buffer data, textures and samplers. Shared across all the shader “invocations”.
Instance and vertex attributes. Only for graphics pipelines.
Push constants. Tiny bits of data embedded right into command buffer. Reasonably fast, but tiny - only 128 bytes are guaranteed.

The composite signature of shader input is called layout in Vulkan.

Descriptor Set

I don’t fully understand this part (any suggestion for improvement would be appreciated), and to keep things simple, I used a static descriptor set containing a single structure declared as uniform.

Here’s how it is defined in GLSL:

layout(set=0, binding=1, std140) uniform Globals {
  float screenRatio;
  uvec2  screenResolution;
  vec2  origin;
  float zoom;
  float var1;
  float var2;
} scene;

And here’s its CPU counterpart:

data Scene = Scene
    { screenRatio :: Float
    , screenResolution :: UVec2
    , origin :: Vec2
    , zoom :: Float
    , var1 :: Float
    , var2 :: Float
    }
    deriving (Generic)

instance GStorable Scene

GStorable is a deriving-storable helper class that removes some of the boilerplate and most of the time does the right thing.

In a future version I will look into automatically deriving the one definition from another.

Next, I wanted to define custom scene variables that can be adjusted while the application is running.

GPU Buffer Lens

Thanks to the StateVar and generic-lens libraries, I created a couple helper modules:

AnimationFractal.Variable to define generic variables and implement controllers.
AnimationFractal.Scene to map scene fields to generic variables.

For example, a 2d vector is defined with this function:

newVec2Var :: Text -> Lens' Scene Vec2 -> Worker.Var Scene -> STM Variable
newVec2Var name sceneLens sceneState = do
    current <- newTVar 0
    mx <- sceneOutVar (name <> ".x") sceneLens v2x sceneState current
    my <- sceneOutVar (name <> ".y") sceneLens v2y sceneState current
    pure $ Variable name (ControllerVec2 (tvar2StateVar current) mx.current my.current) [mx, my]

… which uses these lens composition helpers:

sceneOutVar :: Text -> Lens' Scene a -> Lens' a Float -> Worker.Var Scene -> TVar a -> STM OutVar
sceneOutVar name sceneLens valueLens sceneState controllerState =
    newOutVar name controllerVar sceneVar
  where
    controllerVar = tvarStateLens valueLens controllerState
    sceneVar = makeSceneStateVar (sceneLens . valueLens) sceneState

makeSceneStateVar :: Lens' Scene a -> Worker.Var Scene -> StateVar a
makeSceneStateVar sceneLens sceneState = makeStateVar getv setv
  where
    getv = view sceneLens <$> Worker.readVar sceneState
    setv v = Worker.pushInput sceneState $ sceneLens .~ v

This system enables describing arbitrary scene values into generic variables. For example, a scene declares its variables like this:

variables = [
    newVec2Var "origin" #origin
  , newFloatVar "zoom" #zoom
  , newSciVar "pow" #var1
]

… which results in this representation:

Checkout the Arbitrary precision float controller post for more details.

This implementation enables the scene variables to be defined through the static uniform descriptor set. When any value changes, the full scene data is updated and uploaded to the GPU to trigger a new render. This abstraction is useful to implement generic modulations.

Modulation Input

To make the visuals more interesting, I wanted to modulate the variables based on external events. As presented in the Varying Modulation post, I created a couple helper modules:

AnimationFractal.Modulation to define modulations and their targets.
AnimationFractal.Input to define input sources.

The inputs, such as pulseaudio line-in, or PortMidi, are running in their own thread, and they trigger modulations asynchronously. Then, for each frame, the process works as follow:

Copy the current variable value from the controller, as defined by the user.
Update the modulation and apply to each target.
Push the final value to the scene data type.

To make this interactive, I implemented an interface to configure the modulation at runtime: af-modulations-gui

This part is still under construction and I am looking for new algorithms to provides better modulation. For example, I would like to implement second order dynamics as presented in this video: Giving Personality to Procedural Animations using Math.

Conclusion

I faced quite a few grey-hair-inducing issues caused by Vulkan and Wayland while I was working on this project. I questioned my technical decision multiple times and I was tempted to start over using a more popular stack, such as WebGL or wgpu.rs. Although it felt like I kept banging my head against the wall, eventually, I found the cause and fixed the symptoms. I’m glad I did because I gained invaluable knowledge of computer graphics I wouldn’t have learnt otherwise.

I was also worried about space leak or GC pause, but that has not been a problem so far. On the contrary, I skipped premature optimizations, and the application already runs at a steady 60 frame per seconds using less than 10% of the available CPU.

I am quite happy with the end result, and I’m looking forward continuing the development of animation-fractal.

Cheers!