Skip to content

GitLab

Explore

Sign in

Primary navigation

Project

pystencils
- Activity
- Members
- Labels
- Issues
- Issue boards
- Milestones
- Wiki
- Releases
- Model registry
- Environments
- Incidents

Snippets Groups Projects

!228

Vectorization improvements

Review changes
Download
Patches
Plain diff

Merged Vectorization improvements

ppc into master

Overview 5
Commits 9
Pipelines 20
Changes 1

Merged Michael Kuron requested to merge ppc into master 4 years ago

Overview 5
Commits 9
Pipelines 20
Changes 1

After we cleaned up vectorization support as part of our ARM Neon experiments a few weeks ago (!188 (merged), !220 (merged), !222 (merged)), I did the same thing with AltiVec/VSX intrinsics for POWER processors. Adding a new SIMD instruction set to pystencils really is just a matter of some quick find-and-replace now. I had test access to a POWER8 machine today, ran in both little-endian and big-endian mode, and all tests passed. So pystencils now actually supports all SIMD instruction sets out there (ignoring MIPS and SPARC processors, which are essentially dead).

This pull request also contains some minor unrelated changes:

switches the AES RNG to aligned stores
adds a missing pytest.importorskip
fixes the vec_any/vec_all operations (which used to only work on 256 bit doubles)
removes the q_registers argument from get_vector_instruction_set because there is no point in using half-width vectors
fix the AES-NI RNG on Ice Lake/Tiger Lake processors

Edited 4 years ago by Michael Kuron

Merge request reports

0 Assignees

0 Reviewers

Request review from

Loading

Labels

0

None

0

None

Select labels

Manage project labels

Milestone

None

None

None

Time tracking

No estimate or time spent

0

0 Participants

Loading