Simdjson is the open-source project for high performance JSON parsing by leveraging SIMD instructions for “parsing gigabytes of JSON per second.” Notably it showed years ago the huge performance advantage to using AVX-512 in JSON parsing for surprisingly big benefits. Simdjson has continued advancing since then with various optimizations over the years and today is out with simdjson 4.3 that brings yet more SIMD optimizations.
Simdjson 4.3 released today and now handles runtime dispaching on LoongArch around the LSX and LASX instructions, Microsoft’s Visual C++ Group improving the build throughput by 30% through build optimizations, and various other enhancements.
Catching my eye with the new simdjson 4.3 release is this pull for SIMD string escaping and batch integer formatting optimizations. For string-heavy JSON parsing it can yield some massive benefits.
This pull that is part of simdjson 4.3 yields a 30% improvement for the Twitter benchmark and a 6% improvement for the CITM Catalog benchmark that is more numeric heavy than string heavy. This SIMD tuning around string escaping and batch integer formatting is using ARM64 NEON and SSE2.
Downloads and more details on simdjson 4.3 via GitHub. For those interested I do have simdjson benchmarks on different CPUs.
