Optimizing Performance¶
Glaze is focused on delivering peak performance through compile time evaluation, reducing dynamic memory allocations, and giving the developer deep control over serialization/deserialization. Simultaneously, Glaze tries to provide a simpler interface by default than many libraries, which makes it easy to get excellent performance with little effort.
Compiler Flags¶
Building with optimizations is the most important for performance: -O2 or -O3
Use -march=native if you will be running your executable on the build platform.
SIMD Architecture Flags¶
Glaze automatically detects the target architecture using compiler-predefined macros and defines the appropriate SIMD flags:
| Flag | Detected When | Architecture |
|---|---|---|
GLZ_USE_SSE2 |
__x86_64__ or _M_X64 |
x86-64 (always has SSE2) |
GLZ_USE_AVX2 |
__AVX2__ (in addition to x86-64) |
x86-64 with AVX2 |
GLZ_USE_NEON |
__aarch64__, _M_ARM64, or __ARM_NEON |
ARM64 / AArch64 |
These macros are set by the compiler based on the target architecture, so they work correctly when cross-compiling (e.g., an x86 host building for ARM will not define __x86_64__).
When AVX2 is available, both GLZ_USE_SSE2 and GLZ_USE_AVX2 are defined. The AVX2 path handles 32-byte chunks, then the SSE2 path handles 16-byte remainders.
Disabling SIMD¶
To disable all SIMD intrinsics, use the CMake option:
This sets the GLZ_DISABLE_SIMD compile definition as an INTERFACE property, so it automatically propagates to all targets that link against glaze::glaze. If you aren't using CMake, define GLZ_DISABLE_SIMD before including Glaze headers.
[!NOTE]
Glaze uses SIMD Within A Register (SWAR) for most optimizations, which are fully cross-platform and do not require any SIMD intrinsics or special compiler flags. SIMD intrinsics are used for specific hot paths (e.g., JSON string escaping), so disabling them typically has a modest impact on overall performance.
Reducing Compilation Time¶
Glaze aggressively uses forced inlining (GLZ_ALWAYS_INLINE) for peak runtime performance. If compilation time is more important than peak performance, you can disable forced inlining:
Or define GLZ_DISABLE_ALWAYS_INLINE before including Glaze headers.
This causes GLZ_ALWAYS_INLINE and GLZ_FLATTEN to fall back to regular inline hints, letting the compiler decide what to inline.
[!NOTE]
This option primarily reduces compilation time, not binary size. Modern compilers typically inline hot paths anyway using their own heuristics, so binary size reduction is often minimal. Forced inlining is automatically disabled in debug builds (when
NDEBUGis not defined).
Reducing Binary Size¶
Optimization Levels¶
Glaze provides an optimization_level option for controlling the trade-off between binary size and performance. For embedded systems, WebAssembly, or mobile apps where binary size matters:
// Use the size-optimized preset
auto ec = glz::read<glz::opts_size{}>(value, buffer);
auto json = glz::write<glz::opts_size{}>(obj);
The size optimization level:
- Uses
std::to_charsinstead of lookup tables for number serialization - Defaults to linear search for key matching
- Can reduce binary size by ~280KB or more
See Optimization Levels for full details on available levels and their characteristics.
Linear Search Option¶
For finer control, you can enable linear search independently:
struct small_binary_opts : glz::opts {
bool linear_search = true; // Use linear key search instead of hash tables
};
auto ec = glz::read<small_binary_opts{}>(value, buffer);
This provides significant binary size reduction (typically 40-50% smaller than default) by: - Using fold-expression dispatch instead of jump tables - Eliminating hash table generation for each struct type - Reducing template instantiation bloat
The trade-off is slightly slower parsing for objects with many fields, though performance remains competitive with other JSON libraries. For objects with few fields (< 10), the difference is negligible.
Best Data Structures/Layout¶
It is typically best to match the JSON and C++ object layout. C++ structs should mimic JSON objects, and vice versa.
What you know at compile time should be represented in your C++ data types. So, if you know names at compile time then use a C++ struct. If you know an array is a fixed size that can exist on the stack then use a std::array. Normal C++ tips for performance apply as Glaze is just an interface layer over your C++ data structures. Prefer std::unordered_map over std::map for performance if ordering isn't important.
Only use glz::generic if you know nothing about the JSON you are ingesting.
[!NOTE]
Note that structs with more fields often require more complex hash algorithms, so if you can reduce the number of fields in a struct then you'll typically get better hash performance.
Reducing Memory Allocations¶
Use the read/write APIs that do not internally allocate the object and buffer. Instead, reuse previous buffers and objects if you are making repeated calls to serialize or parse.
For writing:
auto ec = glz::write_json(value, buffer); // Prefer this API if calls happen more than once
auto result = glz::write_json(value); // Allocates the buffer within the call
For reading:
auto ec = glz::read_json(value, buffer); // Prefer this API if calls happen more than once
auto result = glz::read_json<Value>(buffer); // Allocates the Value type within the call
Reusing Context Across Calls¶
Glaze uses a glz::context object internally during parsing and serialization. This context contains a scratch buffer that is used as temporary storage in specific cases — such as parsing object keys that require unescaping, reading quoted wrapped values, escape_bytes decoding, and filesystem path deserialization. Most parsing and serialization does not use the scratch buffer, so for simple structs with plain string or numeric fields, reusing a context provides little benefit. By default, a fresh context is created for each call:
// Each call creates and destroys its own context (and scratch buffer)
for (auto& item : items) {
auto ec = glz::read_json(item, buffers[i]);
}
For hot loops, you can create a context once and pass it to every call. The scratch buffer grows as needed and is reused across calls, avoiding repeated allocations:
glz::context ctx{};
for (auto& item : items) {
auto ec = glz::read<glz::opts{}>(item, buffers[i], ctx);
}
This is most beneficial when parsing data that exercises the scratch buffer (e.g., objects with escaped keys, quoted fields, or filesystem paths) in a tight loop.
[!NOTE]
The context stores error state, so if you reuse a context you should check for errors after each call. The error state is overwritten by each subsequent call.
Custom contexts that inherit from glz::context also benefit from scratch buffer reuse:
struct my_context : glz::context {
size_t max_string_length = 1024;
};
my_context ctx{};
for (auto& msg : messages) {
auto ec = glz::read<glz::opts{}>(msg.value, msg.buffer, ctx);
if (ec) { /* handle error */ }
}
Buffers¶
It is recommended to use a non-const std::string as your input and output buffer. When reading, Glaze will automatically pad the std::string for more efficient SIMD/SWAR and resize back to the original once parsing is finished.
Compile Time Options¶
Glaze provides options that are handled at compile time and directly affect the produced assembly. The default options for Glaze tend to favor performance.
Don't switch these from their defaults unless needed.
null_terminated = true: Using null terminated buffers allows the code to have fewer end checks and therefore branches.error_on_unknown_keys = true: Erroring on unknown keys allows more efficient hashing, because it can reject all unknown keys and doesn't require additional handling.error_on_missing_keys = false: Erroring on missing keys requires looking up whether all keys were read from a bitmap, this adds overhead.
std::string_view Fields¶
You can use std::string_view as your JSON fields if you do not want to copy data from the buffer or allocate new memory.
[!IMPORTANT]
std::string_viewwill point to the JSON string in your original input buffer. You MUST keep your input buffer alive as long as you are reading from these values.
When you parse into a std::string_view Glaze does not unescape JSON strings with escapes (e.g. escaped unicode). Instead the JSON string is left in its escaped form and no allocations are made (the original buffer must be kept alive). You can always read a std::string_view into a std::string if you want to unescape. Typically users should use a std::string, as Glaze very efficiently unescapes JSON and allocates more memory as needed. But, if you need peak performance and know that your buffer will stay alive for as long as you access your std::string_view, or want to avoid heap allocations, the option is there to use a std::string_view.
Performance FAQ¶
Does the order of members in a struct matter?
No, Glaze uses compile time generated hash maps for C++ structs. Accessing fields out of order has little to no performance effect on Glaze.