Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
OpenBLAS 0.3.12 version.tar.gz | 2020-10-24 | 12.3 MB | |
OpenBLAS 0.3.12 version.zip | 2020-10-24 | 25.1 MB | |
README.md | 2020-10-24 | 952 Bytes | |
Totals: 3 Items | 37.4 MB | 0 |
common:
- Fixed missing BLAS/LAPACK functions (inadvertently dropped during the build system restructuring to support selective compilation)
- Fixed argument conversion macro in LAPACKE_zgesvdq (LAPACK [#458])
POWER:
- Added optimized SCOPY/CCOPY kernels for POWER10
- Increased and unified the default size of the GEMM buffer
- Fixed building for POWER10 in DYNAMIC_ARCH mode
- POWER10 compatibility test now checks binutils version as well
- Cleaned up compiler warnings
x86_64:
- corrected compiler version checks for AVX2 compatibility
- added compiler option -mavx2 for building with flang
- fixed direct SGEMM pathway for small matrix sizes (broken by the code refactoring in 0.3.11)
- fixed unhandled partial register clobbers in several kernels for AXPY,DOT,GEMV_N and GEMV_T flagged by gcc10 tree-vectorizer
ARMV8:
- improved Apple Vortex support to include cross-compiling