Performance-portable, length-agnostic SIMD with runtime dispatch
-
Updated
Nov 14, 2024 - C++
Performance-portable, length-agnostic SIMD with runtime dispatch
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
TensorFlow binaries supporting AVX, FMA, SSE
SIMD Vector Classes for C++
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐
The Vector Optimized Library of Kernels
A simple C library for compressing lists of integers using binary packing
A C++ library to compress and intersect sorted lists of integers using SIMD instructions
Agenium Scale vectorization library for CPUs and GPUs
High performance algorithms in C#: SIMD/SSE, multi-core and faster
TensorFlow binaries supporting AVX, FMA, SSE
Fast decoder for VByte-compressed integers
Fast random number generators: Vectorized (SIMD) version of xorshift128+
High-performance dictionary coding
Faster.Map is a high-performance, thread-safe key-value store designed to outperform the standard Dictionary and ConcurrentDictionary
A fast implementation of single-pattern substring search using SIMD acceleration.
UME::SIMD A library for explicit simd vectorization.
DSP library for signal processing
(REOS) Radar and Electro-Optical Simulation Framework written in C++.
A few classes for extremely fast json parsing/serializing in modern C++.
Add a description, image, and links to the simd-instructions topic page so that developers can more easily learn about it.
To associate your repository with the simd-instructions topic, visit your repo's landing page and select "manage topics."