This is a variation on the "hello world" (vecAddKernel) example given in Chapter 2 of Programming Massively Parallel Processors. This example does a reduction over 1's populated in blocks up to a ...