Skip to content
Snippets Groups Projects

Various extensions to the vectorizer

Merged Daniel Bauer requested to merge hyteg/pystencils:bauerd/vec-extensions into v2.0-dev

Includes additions to the AstVectorizer:

  • Vectorize assignments to vector symbols
  • Vectorize loops nested within the vectorization axis if all bounds are independent of the axis
  • Vectorize array subscripts if all indices and the array expression are independent of the axis
  • Handle empty nodes (e.g. comments)

Additions to the x86 instruction set:

  • Intrinsics for basic math functions (log, abs, etc.)
  • Type conversions

And to the SelectIntrinsics pass:

  • Add option to implemenet type conversions using __builtin_convertvector which is supported by most compilers instead of inserting intrinsics. Useful because not all conversions are supported natively on all CPUs.

Merge request reports

Loading
Loading

Activity

Filter activity
  • Approvals
  • Assignees & reviewers
  • Comments (from bots)
  • Comments (from users)
  • Commits & branches
  • Edits
  • Labels
  • Lock status
  • Mentions
  • Merge request status
  • Tracking
  • Daniel Bauer added 1 commit

    added 1 commit

    Compare with previous version

  • Daniel Bauer added 1 commit

    added 1 commit

    Compare with previous version

  • Daniel Bauer added 1 commit

    added 1 commit

    Compare with previous version

  • added 1 commit

    • 75210689 - Update test_function; select combinations of functions and targets by pairwise parametrization

    Compare with previous version

  • Daniel Bauer added 1 commit

    added 1 commit

    • 41bc03b4 - fix x86 pre AVX512 unaligned load/store intrinsics

    Compare with previous version

  • Daniel Bauer added 1 commit

    added 1 commit

    • 2dca01bd - fix x86 unaligned load/store intrinsics

    Compare with previous version

  • Daniel Bauer added 1 commit

    added 1 commit

    • ba58e990 - disable int32, CUDA function tests

    Compare with previous version

  • Daniel Bauer added 2 commits

    added 2 commits

    • a4455131 - disable (int64, CUDA) function tests
    • 1e95fdfc - fix type annotations

    Compare with previous version

  • Daniel Bauer added 2 commits

    added 2 commits

    • 48d8dede - ignore vectorized casts on x86 (there are no type conversion intrinsics with...
    • 6a2ea2af - add type cast tests

    Compare with previous version

  • Daniel Bauer requested review from @da15siwa

    requested review from @da15siwa

  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Loading
  • Please register or sign in to reply
    Loading