I — Foundations Refreshed → Chapter 3
FROM SYSTEMS TO FRONTIER ML

Floating point, integers & quantization error

IEEE-754 refreshed, fixed-point and integer arithmetic, where quantization error comes from. Kernel: int8 dot with _mm_maddubs_epi16.

§1 IEEE-754 refreshed §2 Integers and fixed-point §3 Quantization & the int8 dot product

← ALL CHAPTERS