RaftLib

RaftLib[1] is a portable parallel processing system that aims to provide extreme performance while increasing programmer productivity.

It enables a programmer to assemble a massively parallel program (both local and distributed) using simple iostream-like operators.

RaftLib handles threading, memory allocation, memory placement, and auto-parallelization of compute kernels.

[2] It enables applications to be constructed from chains of compute kernels forming a task and pipeline parallel compute graph.

Programs are authored in C++ (although other language bindings are planned).