oneAPI Deep Neural Network Library (oneDNN)
-
Updated
Jan 23, 2025 - C++
oneAPI Deep Neural Network Library (oneDNN)
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐
Half-precision floating point types f16 and bf16 for Rust.
Round matrix elements to lower precision in MATLAB
Floating-Point Arithmetic Library for Z80
C++ template library for floating point operations
A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTorch with bfloat16.
A JAX implementation of stochastic addition.
IEEE 754-style floating-point converter
CUDA/HIP header-only library to use vector and low-precision floating-point types (16 bit, 8 bit) in GPU code
A Pytorch implementation of stochastic addition.
Comparison of vector element sum using various data types.
Basic linear algebra routines implemented using the chop rounding function
Customizable floating point types, with all standard floating point operations implemented from scratch.
Comparison of PageRank algorithm using various datatypes.
Hybridized On-Premise and Cloud (HOPC) Deployment Experimentation with Bfloat16
Add a description, image, and links to the bfloat16 topic page so that developers can more easily learn about it.
To associate your repository with the bfloat16 topic, visit your repo's landing page and select "manage topics."