ClassiCube/third_party/tinyalloc/README.md

116 lines
5.4 KiB
Markdown
Raw Normal View History

2024-09-10 04:39:20 -04:00
# thi.ng/tinyalloc
Tiny replacement for `malloc` / `free` in unmanaged, linear memory situations, e.g. [WASM](http://webassembly.org) and [embedded devices](https://github.com/thi-ng/ws-ldn-12).
## Updates
For an updated version (written in TypeScript, but still targetting the same linear
memory setup) with more features and improved block splitting/coalescing, please visit:
[thi.ng/malloc](https://github.com/thi-ng/umbrella/tree/develop/packages/malloc).
For an in-depth discussion and comparison with other allocators, please see:
- [Toward Specialization of Memory
Management in Unikernels (Hugo Lefeuvre)](https://os.itec.kit.edu/downloads/2020_BA_Lefeuvre_Toward_Specialization_of_Memory_Management_in_Unikernels.pdf)
## Features
- written in standalone C11, no dependencies, C runtime or syscalls used
- configurable address region (and max. block count) for heap space
- configurable pointer alignment in heap space
- optional compaction of consecutive free blocks
- optional block splitting during alloc (if re-using larger free'd blocks)
- tiny, the WASM binary is 1.5KB (1.1KB w/ compaction disabled)
## Details
**tinyalloc** maintains 3 linked lists: fresh blocks, used blocks, free blocks. All lists are stored in the same fixed sized array so the memory overhead can be controlled at compile time via the [configuration vars](#configuration) listed below. During initialization all blocks are added to the list of fresh blocks.
The difference between free & fresh blocks is the former already have an associated heap address and size from previous usage. OTOH fresh blocks are uninitialized and are only used if no existing free blocks satisfy an allocation request.
The diagram illustrates the state of having 1 freed block (green), 2 used blocks (red) and the beginning of the fresh (unused) block list:
![memory layout](tinyalloc.png)
### Allocation
When a new chunk of memory is requested, all previously freed blocks are checked for potential re-use. If a block is found, is larger than the requested size and the size difference is greater than the configured threshold (`TA_SPLIT_THRESH`), then the block is first split, the chunks added to the used & free lists respectively and the pointer to the first chunk returned to the user. If no blocks in the free list qualify, a new block is allocated at the current heap top address, moved from the "fresh" to the "used" block list and the pointer returned to the caller.
Note: All returned pointers are aligned to `TA_ALIGN` word boundaries. Same goes for allocated block sizes. Also, allocation will fail when all blocks in the fixed size block array are used, even though there might still be ample space in the heap memory region...
### Freeing & compaction
The list of freed blocks is sorted by block start address. When a block is being freed, **tinyalloc** uses insertion sort to add the block at the right list position and then runs a compaction procedure, merging blocks as long as they form consecutive chunks of memory (with no gaps in between them). The resulting obsolete blocks are re-added to the list of available blocks.
## API
### ta\_init(void \*base, void \*limit, size_t heap_blocks, size_t split_thresh, size_t alignment)
Initializes the control datastructure. MUST be called prior to any other **tinyalloc** function.
| Argument | Description |
|----------|-------------|
| `base` | Address of **tinyalloc** control structure, typically at the beginning of your heap |
| `limit` | Heap space end address |
| `heap_blocks` | Max. number of memory chunks (e.g. 256) |
| `split_thresh` | Size threshold for splitting chunks (a good default is 16) |
| `alignment` | Word size for pointer alignment (e.g. 8) |
- `alignment` is assumed to be >= native word size
- `base` must be an address in RAM (on embedded devices)
### void* ta\_alloc(size\_t num)
Like standard `malloc`, returns aligned pointer to address in heap space, or `NULL` if allocation failed.
### void* ta\_calloc(size\_t num, size\_t t)
Like standard `calloc`, returns aligned pointer to zeroed memory in heap space, or `NULL` if allocation failed.
### bool ta\_free(void \*ptr)
Like `free`, but returns boolean result (true, if freeing succeeded). By default, any consecutive memory blocks are being merged during the freeing operation.
### bool ta\_check()
Structural validation. Returns `true` if internal heap structure is ok.
## Building
### Configuration
| Define | Default | Comment |
|--------|---------|---------|
| `TA_DEBUG` | undefined | Trace debug information |
| `TA_DISABLE_COMPACT` | undefined | Disable free block compaction |
| `TA_DISABLE_SPLIT` | undefined | Disable free block splitting during re-alloc |
On a 32bit system, the default configuration causes an overhead of 3088 bytes in RAM, but can be reduced if fewer memory blocks are needed.
**Notes:**
If building in debug mode (if `TA_DEBUG` symbol is defined), two externally defined functions are required:
- `print_s(char *)` - to print a single string
- `print_i(size_t)` - to print a single unsigned int
### Building for WASM
(Requires [emscripten](http://emscripten.org))
```sh
emcc -Oz -s WASM=1 -s SIDE_MODULE=1 -o tinyalloc.wasm tinyalloc.c
```
#### Disassemble to WAST
(Requires [WABT](https://github.com/WebAssembly/wabt))
```sh
wasm2wast --generate-names tinyalloc.wasm > tinyalloc.wast
```
## License
© 2016 - 2017 Karsten Schmidt - Apache Software License 2.0 (see [LICENSE](./LICENSE))