Skip to content
Snippets Groups Projects
Frederik Hennig's avatar
opened merge request !27 "Fix clang-format off directive for older LLVM versions" at pycodegen / pystencils-sfg
Frederik Hennig's avatar
Frederik Hennig's avatar
deleted branch fhennig/postprocessing-fixes at pycodegen / pystencils-sfg
Christoph Alt's avatar
accepted merge request !26 "Fixes to postprocessing: Remove unused code, test vector extraction, unify treatment of scalar fields" at pycodegen / pystencils-sfg
Christoph Alt's avatar
pushed to branch master at pycodegen / pystencils-sfg
Frederik Hennig's avatar
pushed to branch fhennig/postprocessing-fixes at pycodegen / pystencils-sfg
  • b23716bd · make SupportsFieldExtraction and SupportsVectorExtraction runtime-c...
Frederik Hennig's avatar
opened merge request !26 "Fixes to postprocessing: Remove unused code, test vector extraction, unify treatment of scalar fields" at pycodegen / pystencils-sfg
Frederik Hennig's avatar
pushed new branch fhennig/postprocessing-fixes at pycodegen / pystencils-sfg
Frederik Hennig's avatar
deleted branch fhennig/cuda-invoke at pycodegen / pystencils-sfg
Christoph Alt's avatar
accepted merge request !24 "Extend Support for CUDA and HIP kernel invocations" at pycodegen / pystencils-sfg
Christoph Alt's avatar
pushed to branch master at pycodegen / pystencils-sfg
Frederik Hennig's avatar
pushed to branch fhennig/cuda-invoke at pycodegen / pystencils-sfg
  • dc1a3935 · added missing default value
Frederik Hennig's avatar
commented on merge request !24 "Extend Support for CUDA and HIP kernel invocations" at pycodegen / pystencils-sfg

As far as testing is concerned, the CudaKernels and HipKernels test cases already run all available launch configurations at least once.

Frederik Hennig's avatar
commented on merge request !24 "Extend Support for CUDA and HIP kernel invocations" at pycodegen / pystencils-sfg

very good point. I pushed a patch wrapping the entire logic into a Builder class.

Frederik Hennig's avatar
pushed to branch fhennig/cuda-invoke at pycodegen / pystencils-sfg
  • 8b597b98 · clean up implementation of gpu_invoke using a builder
Christoph Alt's avatar
commented on merge request !24 "Extend Support for CUDA and HIP kernel invocations" at pycodegen / pystencils-sfg

This gpu_invoke function is currently around 130 lines long and defines 3 local functions. From my perspective this is a bit complex and it took me...

Frederik Hennig's avatar
pushed to branch fhennig/cuda-invoke at pycodegen / pystencils-sfg
  • cefe0bdd · fix outdated deprecation notice
Frederik Hennig's avatar
  • a8403669 · fix default block size for dynamic launch grids
Frederik Hennig's avatar
pushed to branch fhennig/cuda-invoke at pycodegen / pystencils-sfg
  • 9fda3a06 · fix default block size for dynamic launch grids
Frederik Hennig's avatar