This repository was archived by the owner on Dec 22, 2021. It is now read-only.

Description
Packed horizontal arithmetic is reasonably performant on SSE3+ and Neon. These would be useful for complex multiplications, and in the absence of the opcodes below, these would need to be a combination of shifts and adds.
f32x4.addHoriz(x: v128, y:v128) -> v128
i32x4.addHoriz(x: v128, y:v128) -> v128
i16x8.addHoriz(x: v128, y:v128) -> v128
Thoughts on whether horizontal add instructions would be useful to include in the current SIMD spec?