I have been able to make the GR32-resampling as fast as mine unthreaded, by making some simple changes to the procedure GR32_Resamplers.Resample (in Implementation-part):
changing the order of X- and Y- loop in the filling of the horizontal buffer, avoiding jumps in the bitmap-memory,
using pointers to walk along the arrays,
turning on compiler-optimization for the procedure (biggest improvement)
If you want to see for yourself, in the attachment are 3 changed .pas-files that need to overwrite the corresponding ones in the Algorithms-folder under Bitmap Scaling.
Renate
Bitmap Scaling-Diff.zip