Add AVX512 s/dgemm optimizations for compute kernel (2nd try)
9 files changed