Jacket: Output

type='double'
Speed-up: gpu,gfor vs gpu,for = 4.10398
Speed-up: gpu,gfor vs cpu,for = 6.22496
 
type='single'
Speed-up: gpu,gfor vs gpu,for = 8.66214
Speed-up: gpu,gfor vs cpu,for = 8.3635