Permutes 256-bit or 128-bit float32 values into a 256-bit or 128-bit destination vector. The corresponding Intel® AVX instruction is VPERMILPS.
extern __m256 _mm256_permute_ps(__m256 m1, int control); |
extern __m128 _mm_permute_ps(__m128 m1, int control); |
The _mm256_permute_ps intrinsic permutes single-precision floating point elements (float32 elements) in the 256-bit source vector, m1, according to a specified 2-bit control field, control, and stores the result in a destination vector.
The _mm_permute_pd intrinsic permutes single-precision floating point elements (float32 elements) in the 128-bit source vector, m1, according to a specified 2-bit control field, control, and stores the result in a destination vector.
Copyright © 1996-2010, Intel Corporation. All rights reserved.