RaftLib[1] is a portable parallel processing system that aims to provide extreme performance while increasing programmer productivity.
It enables a programmer to assemble a massively parallel program (both local and distributed) using simple iostream-like operators.
RaftLib handles threading, memory allocation, memory placement, and auto-parallelization of compute kernels.
[2] It enables applications to be constructed from chains of compute kernels forming a task and pipeline parallel compute graph.
Programs are authored in C++ (although other language bindings are planned).