simd

There are 47 repositories under simd topic.

Tencent / ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
inference high-preformance simd arm-neon deep-learning artificial-intelligence android ios ncnn vulkan neural-network caffe mxnet pytorch onnx darknet tensorflow mlir keras riscv
Language:C++ 20674
simdjson
simdjson / simdjson
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
aarch64 arm arm64 avx2 avx512 c-plus-plus clang clang-cl cpp11 gcc-compiler json json-parser json-pointer loongarch neon simd sse42 vs2019 x64
Language:C++ 19486
questdb
questdb / questdb
QuestDB is a high performance, open-source, time-series database
time-series low-latency database sql grafana simd questdb tsdb java postgresql cpp time-series-database financial-analysis capital-markets market-data olap real-time-analytics sensor-data tick-data
Language:Java 14720
openwall / john
John the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs
assembler c cracker crypt fpga gpgpu gpu hash john jtr mpi opencl openmp password ripper simd
Language:C 10491
g-truc / glm
OpenGL Mathematics (GLM)
glm opengl mathematics vector matrix quaternion simd cpp cpp-library header-only sycl vulkan
Language:C++ 9430
Unity-Technologies / EntityComponentSystemSamples
auto-vectorisation auto-vectorization burst component containers csharp documentation ecs entity high jobs multicore multicore-processors multicore-programming native performance simd system tutorials unity3d
Language:C# 7284
bytedance / sonic
A blazingly fast JSON serializing & deserializing library
high-performance jit json simd
Language:Assembly 7084
google / highway
Performance-portable, length-agnostic SIMD with runtime dispatch
avx avx-512 avx-instructions avx2 avx512 intrinsics neon simd simd-instructions simd-intrinsics simd-library simd-parallelism simd-programming sse42 wasm
Language:C++ 4288
ARM-software / ComputeLibrary
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
aarch64 android arm armv7 armv8 computer-vision cpp linux machine-learning neon neural-network opencl simd sve
Language:C++ 2893
turbo / js
turbo.js - perform massive parallel computations in your browser with GPGPU.
glsl gpu vector simd calculations shaders gpgpu parallel
Language:JavaScript 2636
hora
hora-search / hora
🚀 efficient approximate nearest neighbor search algorithm collections library written in Rust 🦀 .
search-engine rust approximate-nearest-neighbor-search artificial-intelligence recommender-system image-search vector-search algorithm data-structures simd hnsw similarity-search neural-network high-performance machine-learning k-nearest-neighbors rust-sci numeric
Language:Rust 2600
ispc / ispc
Intel® Implicit SPMD Program Compiler
compiler intel ispc programming-language simd spmd
Language:C++ 2542
guillaumeblanc / ozz-animation
Open source c++ skeletal animation library and toolset
animation game data-oriented mit-license fbx collada sse simd soa
Language:C++ 2482
simd-everywhere / simde
Implementations of SIMD instruction sets for systems which don't natively support them.
simd-intrinsics sse neon arm avx simd sse2 sse3 ssse3 sse41 sse42 avx2 avx512 fma gfni mmx altivec powerpc arm64 vectorization
Language:C 2468
recp / cglm
📽 Highly Optimized 2D / 3D Graphics Math (glm) for C
3d 3d-math affine-transform-matrices avx bezier bounding-boxes c euler frustum marix-inverse math matrix matrix-decompositions neon opengl opengl-math simd sse vector wasm
Language:C 2372
zig-gamedev / zig-gamedev
Dev repo for @zig-gamedev libs and sample applications
gamedev zig libraries demos graphics directx12 math d3d12 simd ziglang game-development cross-platform webgpu physics opengl realtime
Language:Zig 2327
usearch
unum-cloud / usearch
Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍
kann vector-search approximate-nearest-neighbor-search simd search similarity-search database faiss search-engine webassembly clustering nearest-neighbor-search recommender-system semantic-search full-text-search fuzzy-search text-search image-search
Language:C++ 2322
StringZilla
ashvardanian / StringZilla
Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging NEON, AVX2, AVX-512, and SWAR to accelerate search, sort, edit distances, alignment scores, etc 🦖
beautifulsoup common-crawl csv dataset html information-retrieval json laion ndjson parser pattern-recognition simd sorting-algorithms string string-manipulation string-matching string-parsing string-search substring
Language:C++ 2304
xtensor-stack / xsimd
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
simd-intrinsics vectorization simd cpp avx neon sse avx512 simd-instructions mathematical-functions c-plus-plus-11 sve
Language:C++ 2243
tairov / llama2.mojo
Inference Llama 2 in one file of pure 🔥
inference llama llama2 modular mojo performance simd vectorization parallelize tensor transformer-architecture
Language:Mojo 2106
ermig1979 / Simd
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.
amx arm avx avx512 c-plus-plus haar-cascade image-processing lbp machine-learning neon neural-network simd simd-library sse
Language:C++ 2081
google / XNNPACK
High-efficiency floating-point neural network inference operators for mobile, server, and Web
convolutional-neural-network convolutional-neural-networks cpu inference inference-optimization matrix-multiplication mobile-inference multithreading neural-network neural-networks simd
Language:C 1909
agavrel / 42_CheatSheet
A comprehensive guide to 50 years of evolution of strict C programming, a tribute to Dennis Ritchie's language
42 42born2code 42fremont 42madrid 42paris 42school 42seoul 42tokyo bitwise learning school sdl2 simd
Language:C 1712
kfrlib / kfr
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
avx fft simd cpp14 audio dsp dft header-only avx512 digital-signal-processing audio-processing cpp17 fast-fourier-transform discrete-fourier-transform clang cxx cplusplus cplusplus-14 cplusplus-17
Language:C++ 1682
Maratyszcza / NNPACK
Acceleration package for neural networks on multi-core CPUs
convolutional-layers cpu fast-fourier-transform high-performance high-performance-computing inference matrix-multiplication multithreading neural-network neural-networks simd winograd-transform
Language:C 1678
fastfloat / fast_float
Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, Chromium, Redis and WebKit/Safari
cpp11 cpp17 cpp-library high-performance freebsd linux macos visual-studio neon simd sse2
Language:C++ 1647
timeplus-io / proton
A stream processing engine and database, and a fast and lightweight alternative to ksqlDB and Apache Flink, 🚀 powered by ClickHouse
analytics clickhouse confluent cpp flink-alternative high-performance kakfa ksqldb-alternative redpanda simd single-binary sql stream-processing streaming-sql udf
Language:C++ 1612
DirectXMath
microsoft / DirectXMath
DirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
microsoft directx simd neon cpp-library sse avx avx2 clang msvc xbox directxmath desktop uwp
Language:C++ 1570
AdamNiederer / faster
SIMD for humans
cross-platform intrinsics optimization simd
Language:Rust 1566
bitshifter / glam-rs
A simple and fast linear algebra library for games and graphics
3d-math-libraries rust simd sse2
Language:Rust 1562
VcDevel / Vc
SIMD Vector Classes for C++
vectorization parallel simd-vector simd-instructions simd avx c-plus-plus avx512 sse neon cpp portable cpp11 cpp14 cpp17 avx2 simd-programming data-parallel parallel-computing
Language:C++ 1462
SatDump / SatDump
A generic satellite data processing software.
baseband ccsds digital-signal-processing satellite sdr simd volk
Language:C++ 1402
ada
ada-url / ada
WHATWG-compliant and fast URL parser written in modern C++, part of Node.js, Clickhouse, Redpanda, Kong, Telegram and Cloudflare Workers.
cpp neon parser performance simd sse2 url whatwg-url
Language:C++ 1399
Daniel-Liu-c0deb0t / uwu
fastest text uwuifier in the west
owo simd uwu
Language:Rust 1368
sse2neon
DLTcollab / sse2neon
A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
arm aarch64 neon sse x86 simd biilabs armv7l armv8-a intel-sse-intrinsics arm64 neon-intrinsics sse-intrinsics sse2neon armv8 intel-intrinsics apple-silicon
Language:C++ 1319
p12tic / libsimdpp
Portable header-only C++ low level SIMD library
sse avx2 avx512 neon vsx msa altivec simd
Language:C++ 1251