Neshny

Neshny is an C++/WebGPU/OpenGL library for games and simulations. It is designed to be unobtrusive - use as little or as much of it as you wish. It is not an engine, rather a tool to help make writing your own engine easier. The core feature is an entity system that runs mostly on the graphics card. There is an optional UI that can overlay your window and display valuable debugging information.

There are two main ways to use it - with OpenGL or with WebGPU. The two are mutually exclusive. The recommendation is to pick WebGPU, as it is set up to cross compile to both the desktop and the browser. OpenGL support will continue for some time but may be removed in the distant future, depending on how well WebGPU does.

Dependencies

Qt >= v6
Dear ImGui >= v1.89 (included in starter)
Metastuff (included in starter)
SDL >= 2.26
CMake >= 3.18
NodeJS >= 16.0
Python >= 3.0 (for now)

Installation

Clone this repo to a location of your choice:

git clone https://github.com/GrigoryGraborenko/neshny.git

Using CMake

Pick one of the starter projects in the Examples directory - for example, the EmptyWebGPU_Jumbo will create a near-empty cmake based project optimized for Visual Studio. It also sets up a pattern for Jumbo builds - this is highly recommended, as this reduces all compilation speeds down to seconds.

Make a copy of the entire directory and place it wherever you like. Ensure you have cmake 3.18 or greater installed.

Rename the UserSettings.cmake.template to UserSettings.cmake and edit it to point to the correct directories where libraries are installed:

set(NESHNY_DIR "C:/Code/Neshny") # change all these to your local
set(QT_DIR "C:/Qt/6.5.0/msvc2019_64")
set(QT_WASM_DIR "C:/Qt/6.5.0/wasm_singlethread")
set(SDL2_DIR "C:/SDL/SDL2-2.0.14")
set(SHADER_PATH "src/shaders")
set(DAWN_PATH "C:/CodeLib/Dawn") # where you would like it to be installed - see below

Also make sure to rename ProjectSettings.cmake.template to ProjectSettings.cmake and change the name of the project:

set(PROJECT_NAME "EmptyQTJumboBuild") # change to your project name

Then navigate to the root of the copied project and run this command in the terminal:

cmake . -B build -G "Visual Studio 17 2022" -A x64

Replace "Visual Studio 17 2022" with whatever compiler you prefer. This will generate all the project files needed.

WebGPU - installing dawn

If you're using WebGPU, you will need to install dawn for desktop builds. Once cmake is run, you should have a file in the root directory called dawn_setup.js. Make sure node and python are both installed (apologies, google still needs python - will see if I can remove this dependency later) and run:

node dawn_setup.js

This will download and build dawn in your DAWN_PATH and copy all required files to your external folder.

Then open up the solution/project files and run it. You should see this:

If you are using WebGPU, to make a browser build, run this:

emcmake cmake -B build-web -G "Ninja" -DCMAKE_BUILD_TYPE=Release

Then when you need to compile and host on port 8000, run:

cmake --build build-web
node server

Manually

Clone this repo and point to the base dir in your include directories list. Then add

// put this in your headers
#include <IncludeAll.h> // Neshny
// put this in your main.cpp or similar
#include <IncludeAll.cpp> // Neshny
using namespace Neshny;

This assumes you already have QT, SDL, ImGui and Metastuff installed.

WebGPU usage

TODO

OpenGL usage

Shader and buffer convenience classes

GLShader shader;
QString err_msg;
// takes error string ref, lambda that loads the actual file given a path,
// and three paths for vertex, fragment and geometry shaders
// geometry shader is optional and can be left empty
shader.Init(err_msg, [] (QString path, QString& err_msg) -> QByteArray {
    QFile file(QString("shader_prefix\\%1").arg(path));
    if (!file.isOpen()) {
        err_msg = "File error - " + file.errorString();
        return QByteArray();
    }
    return file.readAll();
}, "shader.vert", "shader.frag", "shader.geom", "uniform int extra_uniform;");
shader.UseProgram();
glUniform1i(shader.GetUniform("extra_uniform"), 123);

Shader loading and viewing

Here's an example of using a rendering shader:

// dynamically load the shader that corresponds to Fullscreen.vert and Fullscreen.frag
// loads from "../src/Shaders/", set by cmake var SHADER_PATH in UserSettings.cmake
GLShader* prog = Core::GetShader("Fullscreen");
prog->UseProgram();
glUniform1i(prog->GetUniform("uniform_name"), 123);

GLBuffer* buff = Core::GetBuffer("Square"); // get a built-in model
buff->UseBuffer(prog); // attach the program to the buffer
buff->Draw(); // executes the draw call

Here's how you would run a compute shader:

// dynamically load the shader that corresponds to Compute.comp
// loads from "../src/Shaders/", set by cmake var SHADER_PATH in UserSettings.cmake
GLShader* compute_prog = Core::GetComputeShader("Compute");
compute_prog->UseProgram();

// runs 64 instances
// second var is result of multiplying local sizes together
// usually set by layout(local_size_x = 8, local_size_y = 8, local_size_z = 8) in;
// will call glDispatchCompute multiple times if it exceeds 64x512 instances
// automatically populates uCount and uOffset integer uniform
Core::DispatchMultiple(compute_prog, 64, 512);

// or you could manage it yourself:
// glDispatchCompute(4, 4, 4);

GPU entities and pipelines

A GPUEntity manages the concept of an entity that lives and dies entirely in GPU memory. It is best used with a Pipeline, which is a wrapper around a call to a compute shader that modifies or renders the entity, and handles all the setup and initialization. You start by defining a plain old data data type with packing alignment of 1. Then you add each member to a metastuff definition. Then each tick you call PipelineStage::ModifyEntity with a compute shader that runs for each entity, modifies it, and can delete it as well.

#pragma pack(push)
#pragma pack (1) // padding will break the GPU transfer if you don't have this

struct GPUProjectile {
    int     p_Id; // all entities should have an integer ID
    fVec3   p_Pos;
    fVec3   p_Vel;
    int     p_Type;
    int     p_State;
};

#pragma pack(pop)

// gotta define all struct members using metastuff to ensure correct storage
namespace meta {
    template<>
    inline auto registerMembers<GPUProjectile>() {
        return members(
            member("Id", &GPUBoid::p_Id),
            member("Pos", &GPUBoid::p_Pos),
            member("Vel", &GPUBoid::p_Vel),
            member("Type", &GPUBoid::p_Type),
            member("State", &GPUBoid::p_State)
        );
    }
}

// store this somewhere permanent so it persists across frames
GPUEntity projectiles(
    "Projectile", // name used for debugging
    GPUEntity::DeleteMode::MOVING_COMPACT, // no gaps but cannot trust an index to it, may be moved
    // DeleteMode::STABLE_WITH_GAPS will stay put so you can reference it
    // will have holes in the data structure though
    &GPUProjectile::p_Id, // pointer to ID member
    "Id" // name of ID member
);
// should be done when you have an active OpenGL context
// add in the max number of these entities you will EVER have
// for performance reasons this GPU memory will be pre-allocated
projectiles.Init(100000);

for (int i = 0; i < 10; i++) {
    GPUProjectile proj{
        0, // id doesn't matter, gets assigned by system
        fVec3(100.0, 110.0, 120.0),
        fVec3(1.0, 0.0, 0.0),
        1,
        123 // p_State
    };
    m_Projectiles.AddInstance((float*)&proj);
}

PipelineStage::ModifyEntity(
    projectiles // GPU entity
    ,"ProjectileMove" // name of compute shader
    ,true // do you want the main function to be written for you?
    ,{ "MACRO_VALUE 123" } // will set up a #define of these items
)
.AddCreatableEntity(some_other_entity)
.AddEntity(read_only_entity)
.AddSSBO("buffer_name", data_buffer, MemberSpec::T_INT)
.Run([delta_seconds](GLShader* prog) { // optional lambda that gets called right before dispatch
    glUniform1f(prog->GetUniform("uDeltaSeconds"), delta_seconds);
});

// when you wanna render the item
PipelineStage::RenderEntity(
    projectiles // GPU entity
    ,"ProjectileRender" // name of .frag, .vert and optionally .geom shader
    ,false, // do you want the main vertex function to be written for you?
    ,Core::GetBuffer("Square") // buffer to render with
    { "DEFINITION 1234" }
)
.Run([&vp](GLShader* prog) {
    glUniformMatrix4fv(prog->GetUniform("uWorldViewPerspective"), 1, GL_FALSE, vp.data());
});

This is what the ProjectileMove compute shader might look like:

uniform float uDeltaSeconds;

// if you said true to the third arg, use this pattern:
bool ProjectileMain(int item_index, Projectile proj, inout Projectile new_proj) {
    // proj is the input entity
    // new_proj is a copy of the entity - changes you make to it will be saved

    new_proj.State = proj.State - 1;
    new_proj.Pos = proj.Pos + proj.Vel * uDeltaSeconds;
    new_proj.Vel.y -= 0.98; // add gravity

    if(new_proj.State < 0) {
        return true; // deletes the entity
    }
    return false; // it lives to fight another iteration
}

Features

Viewing the editor overlay

Assuming you already have ImGui installed, call these functions each render cycle, preferrably near the end of the cycle:

if (show_editor) {
    // optional rendering of debug visuals
    DebugRender::Render3DDebug(view_perspective_matrix, width, height);
    // reset for per-cycle debugging visuals
    DebugRender::Clear();
    // this will do all UI/UX for the editor
    Core::RenderEditor();
}

Entity data viewer with time-travel

Debugging GPU entities can be difficult, since you can't just inspect CPU memory at a breakpoint like a regular C++ object. Thus Neshny provides a buffer viewer which can capture the internal structure of a GPU entity each frame and display it in a human readable format. Furthermore, you can store these captures for a customizable number of frames and rewind, or step frame by frame until you figure out what's going on. Be mindful that this is for debug purposes only and will slow down your framerate and consume a lot of memory.

The way this works is that at every checkpoint where you care about the internals of an entity or SSBO, you add this:

// this checkpoint is called tick because it's called once per tick
// call it what you like, as often as you like - each call creates a temp copy
BufferViewer::Checkpoint("Tick", entity);

If you open up the buffer viewer, you should see something like this once you tick the "All" checkbox, or the checkbox next to the specific buffer you are interested in. All the entities will be displayed in rows, and each column shows one of the previous frame's data for that entity.

There is also a scrollbar up the top where you can rewind to a point in time some frames ago. The newest are on the left, and get older as you go right. Time travelling will also render that old data within your game, but only if you use the PipelineStage::RenderEntity system. It works by substituting the chunk of memory saved previously instead of the data that is currently stored for that entity when rendering.

Cameras

There are currently three convenience camera classes provided: Camera2D, Camera3DOrbit and Camera3DFPS. They only exist to provide a useful Matrix4 view perspective matrix. There are some convenience functions to move them around, but currently the recommended usage is to modify their internals directly, such as p_Pos for position or p_Zoom in Camera2D.

Resource system

The resource system uses threads to load and initialize resources in the background. Calling a resource via Core::GetResource the first time will initiate the loading process - every subsequent call will return either PENDING, IN_ERROR or 'DONE' for m_State [TODO change m_ to p_]. Once it is in the DONE state it will be cached and immediately return a pointer to the resource in question. This is designed to be used with a functional-style loop, much like ImGui. The call itself should be lightweight and block the executing thread for near-zero overhead. Each tick you get the same resource for as long as you require it, and it may be several or even hundreds of ticks later that the resource is resolved.

auto tex = Core::GetResource<Texture2D>("../images/example.png");
if(tex.IsValid()) {
    glBindTexture(GL_TEXTURE_2D, tex->Get().GetTexture());
} else if(tex.m_State == ResourceState::PENDING) {
    // show some loading info
} else if(tex.m_State == ResourceState::IN_ERROR) {
    // show info about error using tex.m_Error
}

Currently there are resources for SoundFile (using SDL), Texture2D, TextureTileset and TextureSkybox. Creating your own resource type is easy - simply subclass from Resource and implement the abstract function virtual bool Init(QString path, QString& err) and the two memory estimate functions like so:

class CustomResource : public Resource {
public:
    // no need for constructor, all the work is done in Init
    virtual ~CustomResource(void) {}

    virtual bool Init(QString path, QString& err) {
        // anything here will be run in an offline thread
        // opengl resource creation here will be shared
        for(int i = 0; i < 1000000; i++) {
            // some slow process
            cached_value++;
        }
        if(!cached_value) {
            err = "Reason for failure";
            return false; // resource will be "IN_ERROR"
        }
        return true; // resource will be "DONE"
    };
    virtual qint64 GetMemoryEstimate    ( void ) const { return 0; }
    virtual qint64 GetGPUMemoryEstimate ( void ) const { return 0; }

private:
    int cached_value = 0; // whatever payloads/objects you like
};

If you want to create a resource based off data in a file, you have the convenience base class FileResource which requires you to implement the virtual bool FileInit(QString path, unsigned char* data, int length, QString& err) function that is called after the file in the path is loaded. data and length contain the contents of the file if it has been successfully loaded.

The default resource management is for older, larger resources to be deallocated once the memory limit is reached. You can change the memory limit for both regular memory and GPU memory in your IEngine subclass by setting m_MaxMemory and m_MaxGPUMemory as bytes. They are set to 2 gig and 1 gig respectively. If you would like to take over memory management yourself, override the function virtual void ManageResources(ResourceManagementToken token, qint64 allocated_ram, qint64 allocated_gpu_ram). Have a look inside the default implementation for a guide on how to do that.

3D Debug Visualizer

An easy way to get started on a 3D project is to chuck in some debug lines so you can get some basic context. Here's an example of origin axis lines:

// 10 units. What's a unit? How long's a piece of string?
const double size_grid = 10.0; 
// red X
DebugRender::Line(Vec3(0, 0, 0), Vec3(size_grid, 0, 0), Vec4(1, 0, 0, 1));
// green Y
DebugRender::Line(Vec3(0, 0, 0), Vec3(0, size_grid, 0), Vec4(0, 1, 0, 1));
// blue Z
DebugRender::Line(Vec3(0, 0, 0), Vec3(0, 0, size_grid), Vec4(0, 0, 1, 1));

There is also DebugRender::Point, DebugRender::Triangle, DebugRender::Circle and DebugRender::Square. The last two are 2D. Colors are set via a Vec4 that specifies red, green, blue and alpha where 0 is none and 1 is max.

Then make sure you at some point in your render cycle call

// use the same view_perspective_matrix from your camera as your other objects
// width and height are the pixel size of your viewport
DebugRender::Render3DDebug(view_perspective_matrix, width, height);
DebugRender::Clear();

Scrapbook

It may be useful from time to time to test out or visualize simple concepts without messing with your core render loop. Neshny provides a 2D and a 3D scrapbook which you can add debug render visuals to, provide simple ImGui controls, or do custom rendering inside the context of the scrapbook window.

You can add visuals much like with the DebugRender (it uses the same underlying code) with functions such as Scrapbook3D::Point and Scrapbook3D::Line.

Here is an example of adding custom controls:

// width and height are passed in by Scrapbook3D to your lambda
Scrapbook3D::Controls([this, &stage](int width, int height) {
    if (ImGui::Button("Do something")) {
        RunFunction();
    }
    ImGui::SetNextItemWidth(width * 0.75);
    ImGui::SliderInt("Variable", &stage, 0, 100);
    ImGui::Text("Test info %i", 123);
});

Here is an example of running custom render code:

{ // create a local scope
    // get a render-to-texture token
    auto token = Scrapbook3D::ActivateRTT();

    // do graphics stuff here
    // Scrapbook3D's render to texture is active
    // any rendering will end up in the scrapbook window not your window

} // on exit, token is deconstructed and RTT is switched off
// important - make sure to only use tokens within a temporary/function scope!
// do not let them leak into the rest of your regular rendering code

GrigoryGraborenko / neshny