@neauoire OpenMP calls this a parallel-for; it detects non-dependencies between elements and unrolls/parallelizes as appropriate.
Usual advice applies: use a library first, unless you want the rabbit hole, in which case measure your single-thread perf, thread launch cost, and back-of-the-envelope tradeoffs, then measure again. And again.