内容简介:Today we are going to talk about high performance. Not the standard distributed cloud blahblah but high performance on a single core on a single machine. Please welcome SIMD instructions.SIMD (for Single Instruction, Multiple Data) are special instructions
Today we are going to talk about high performance. Not the standard distributed cloud blahblah but high performance on a single core on a single machine. Please welcome SIMD instructions.
Did you know?
What are SIMD instructions?
SIMD (for Single Instruction, Multiple Data) are special instructions on CPUs and GPUs that allow them to perform an operation on multiple bytes with only one instruction.
It is often called vectorization because operations are applied to a vector of data with a single instruction.
It allows to implement very fast algorithms on a single thread on general purpose hardware, as opposed to specific hardware acceleration like AES-NI .
It is particularly used in machine learning, cryptography, databases and content processing (video / image / audio encoding) and is a good alternative to multithreading.
MMX, SSE, AVX, AVX-512 are SIMD instructions for Intel. NEON for ARM.
Projects
simdjson-go (Apache-2.0)
A Go port of simdjson by MinIO which can parse up to gigabytes of JSON per second :rocket:
pikkr (MIT)
“JSON parser which picks up values directly without performing tokenization in Rust”
faster (MPL 2.0)
“Easy, powerful, portable, absurdly fast numerical calculations. Includes static dispatch with inlining based on your platform and vector types, zero-allocation iteration, vectorized loading/storing, and support for uneven collections." For Rust.
ncnn (BSD 3-Clause)
A high-performance neural network inference framework optimized for the mobile platform by Tencent.
mandel-simd (public domain)
Mandelbrot Set in SSE, AVX, and NEON.
fastlwc (MIT)
SIMD-enhanced wc
(Word Counter).
Project of the community
QuestDB (Apache 2.0)
“QuestDB is a NewSQL relational database designed to process time-series data, faster. Our approach comes from low-latency trading. QuestDB’s stack is engineered from scratch, zero-GC Java and dependency-free. The whole database and console fits in a 3.5Mb package." They are using SIMD to achieve extreme performance (see below).
Articles
On the dangers of Intel's frequency scaling
Unfortunately using SIMD is not a silver bullet and may slow down your multithreaded programs due to how some intel’s processors throttle.
Using SIMD to aggregate billions of rows per second
With their new 4.2 release, QuestDB (a time-series database) introduced SIMD which made their (already fast) aggregations faster by 100x.
Parsing gigabytes of JSON per second in Go
This is the detailed explanation of why and how MinIO ported simdjson to Go.
SIMD < SIMT < SMT: parallelism in NVIDIA GPUs
The difference between SIMD, SIMT and SMT and how it works in Nvidia GPUs.
A good write-up about how to transpose the safe and zero-cost abstraction philosophy of Rust to the SIMD world.
An overview of using SIMD instructions with Go for different processor architectures.
Base64 encoding and decoding at almost the speed of a memory copy
This research paper describes how they achieved base64 encoding and decoding at almost the speed of a memcpy
. Really impressive.
Learning SIMD with Rust by finding planets
A concrete guide of using SIMD to speed up an algorithm in a portable way (in Rust).
Security
This week has been particularly bad regarding security. Major vulnerabilities have been patched in Firefox (2 times!), Ubuntu’s Linux Kernel, Red Hat’s Linux Kernel, Suse’s Linux Kernel, Android, and more.
Go update right now (browsers need to be restarted to apply auto-updates).
Stay safe :v:
Sylvain
以上所述就是小编给大家介绍的《List of open source projects using SIMD to achieve extreme performance》,希望对大家有所帮助,如果大家有任何疑问请给我留言,小编会及时回复大家的。在此也非常感谢大家对 码农网 的支持!
猜你喜欢:本站部分资源来源于网络,本站转载出于传递更多信息之目的,版权归原作者或者来源机构所有,如转载稿涉及版权问题,请联系我们。