ggml : x2 speed for WASM by optimizing SIMD.
PR by Xuan-Son Nguyen forllama.cpp
: > This PR provides a big jump in speed for WASM by leveraging SIMD instructions forqX_K_q8_K
andqX_0_q8_0
dot product functions. > > …
Read in full here:
This thread was posted by one of our members via one of our news source trackers.