blob: f8ab0ae246bdf6ecd074c8f6b0586144841860a6 (
plain) (
blame)
1
2
3
4
5
6
7
8
9
10
11
12
13
|
HPTT is a high-performance C++ library for out-of-place tensor
transpositions.
Key Features:
* Multi-threading support
* Explicit vectorization
* Auto-tuning (akin to FFTW)
* Loop order
* Parallelization
* Multi architecture support
* Explicitly vectorized kernels for (AVX and ARM)
* Supports float, double, complex and double complex data types
* Supports both column-major and row-major data layouts
|