YuNing's Thought

Home

❯

Notes

❯

Apache Arrow

Apache Arrow

Dec 30, 20251 min read

Columnar format for fast data interchange and in-memory analytics

Arrow provides O(1) random access lookups to any array index, whilst Parquet does not. In particular, Parquet uses dremel record shredding, variable length encoding schemes, and block compression to drastically reduce the data size, but these techniques come at the loss of performant random access lookups.

Links

  • Arrow Columnar Format — Apache Arrow v19.0.1
  • OLAP
  • DuckDB
  • Postgresql

Graph View

Backlinks

  • Apache Datafusion

Created with Quartz v4.5.2 © 2025

  • GitHub
  • Discord Community