Add a yield instruction in the two spinloops of the threaded matmul implementation.
1 file changed