Vector scatter/gather support

changed title from Scattergather to Vector scatter/gather support

changed the description

added 1 commit

a946d58e - gather/scatter vector instructions

Compare with previous version

changed the description

approved this merge request

enabled an automatic merge when the pipeline for a946d58e succeeds

canceled the automatic merge

I came up with a nice little visualization to show the performance gains for different vectorization parameter choices. When you hover over one of the data points, it will show you the parameters. Unfortunately, the cluster I ran this on is a bit noisy (some measurements have big error bars of multiple MLUPS) and won't let me manually set CPU frequencies. You can still see though that simulations with assume_inner_stride_one=False or layout="zyxf" benefit significantly. Note that none of the sub-100% outliers even had their code changed by my merge request, it's just pure noise.

Ok looks fine. I will merge now.

mentioned in commit 8f72741d

merged

Here is another plot from a Core i7-7820X, on which I disabled turbo boost and set to a fixed 3.5 GHz (the maximum AVX512 frequency). Again, there is nothing systematic to the data points below 100%. Most error bars are smaller now.

mentioned in merge request !345 (merged)

Vector scatter/gather support

Merge request reports

Activity