Reduction Support
This MR introduces reductions to pystencils for scalar data types and thus covers #55.
User interface
- Adds reduction assignment classes to
sympyextensions
module: AddReductionAssignment, SubReductionAssignment, MulReductionAssignment, MinReductionAssignment, MaxReductionAssignment
These can be used as follows:
import pystencils as ps
r = ps.TypedSymbol("r", "double")
x, y = ps.fields(f"x, y: double[3D]", layout="fzyx")
assign_dot_prod = ps.AddReductionAssignment(r, x.center() * y.center())
- Alternatívely, you can also make use of the
reduction_assignment
orreduction_assignment_from_str
functions:
from pystencils.sympyextensions import reduction_assignment, reduction_assignment_from_str
from pystencils.sympyextensions.reduction import ReductionOp
assign_dot_prod = reduction_assignment(r, ReductionOp.Add, x.center() * y.center())
assign_dot_prod = reduction_assignment_from_str(r, "+", x.center() * y.center())
Supported Backends
Generic CPUs
- Add reduction support for OpenMP
SIMD: SSE3, AVX2, AVX512
- Include a generated header file with horizontal operations performing a binary operation between a scalar variable and a SIMD vector. The SIMD vector is transformed to a scalar variable via reduction, and then the binary operation is applied to the other operand
CUDA
- Employ atomic reduction operations in all threads when the block size does not align with the warp size
- Optimization for alignment with warp size: perform warp-level reductions and only perform atomic operation on first thread of warp
- Include a header file with manual implementations for atomic operations that are not directly supported for floating point numbers: atomicMul, atomicMax, and atomicMin. These functions make use of a CAS mechanism.
Internal Changes
- Freeze handling for newly introduced
ReductionAssignment
nodes - Add
PsVecHorizontal
vectorization node for conducting a binary operation between a scalar symbol and an extraction of a vector value (obtained by performing a reduction within a vector lane) - Add dataclass
ReductionInfo
holding essential information about a reduction (i.e. reduction operation, initial value and the write-back pointer for exporting the reduction result) and create corresponding lookup table for symbols inKernelCreationContext
- Introduce
NumericLimitsFunctions
for initializing neutral elements for reductions making use of min/max operations - Adapt
Platform.select_function
such that it either returns anPsExpression
that replaces the function call or returns aPsExpression | tuple[tuple[PsStructuralNode, ...], PsAstNode]
holding aPsAstNode
that replaces the function call and tuple of structural nodes that are added before the replacement. The structural nodes allow adding preparatory code for the replacement, as needed for the warp-level reductions for GPU platforms - Add
ReductionFunctions.WriteBackToPtr
function that is replaced with platform-dependent code inPlatform.select_function
- Slightly adapt CPU/GPU Jit modules to support the handling of write-back pointers used for reductions
Edited by Richard Angersbach
Merge request reports
Activity
Filter activity
added 4 commits
Toggle commit listadded 9 commits
- af855492 - Minor adaptation on how symbols are given reduction property
- 4ae330dc - Add C function selection for numeric limits functions
- a16969bf - Add omp reduction clauses for reduced symbols
- 555a6a83 - Reformat reduction.py
- ef9239ed - Add back reduced_assign to sympyextensions interface
- cf2ec066 - Fix inheritance of special math function enum classes
- 9741c024 - Fix header include of limits.h
- 9a8e6f9b - Omit distinction between normal and reduced assignments in AssignmentCollection
- e9ee769d - Adaptations to reduction test
Toggle commit listadded 5 commits
- 1a1c23b5 - Adapt comment of ReductionSymbolProperty
- fff5a079 - Fix removal of function parameters for lhs symbols that are not declared in the kernel
- bb984679 - Fix usage of numerical limits for init value of reduction
- a3025645 - Fix min/max reductions
- 9bbb8181 - Parameterize test_reduction.py for different reduction operations
Toggle commit listadded 1 commit
- 3c5a93b4 - Define type of init_val for reduction as Any
added 1 commit
- 3fc9a049 - Swap out neutral init values for reduced assignments with min/max op
added 1 commit
- 9fd1c2ad - Fix declaration of local reduction var and write back to original variable
added 1 commit
- 6bc3cf3f - Set type of reduced variable to pointer and write back via PsMemAcc
added 4 commits
Toggle commit listadded 43 commits
-
72fa8672...4f8e42e6 - 6 commits from branch
v2.0-dev
- 4f8e42e6...45ab4e86 - 27 earlier commits
- 0a9abc2a - Swap out neutral init values for reduced assignments with min/max op
- 3c276118 - Fix declaration of local reduction var and write back to original variable
- c51ae2b4 - Set type of reduced variable to pointer and write back via PsMemAcc
- c6eedfcd - Split reduction var property into local and pointer-based reduction var properties
- 3e0daa67 - Propagate properties of reduction pointer symbols to kernel parameters
- 777ab888 - Use literals for C macros used for the numeric limits
- f1c556e6 - Integrate reduction pointers to parameters.py
- ba697180 - Rewire existing code extraction of fields to support reduction pointer extraction
- 3e595df6 - Refine test_reduction.py to check for result correctness
- 5ca40eae - Merge remote-tracking branch 'origin/rangersbach/reductions' into rangersbach/reductions
Toggle commit list-
72fa8672...4f8e42e6 - 6 commits from branch
added 1 commit
- f0d2fde6 - Encapsulate mapping of binop strings to actual operands and now also use for...
Please register or sign in to reply