YuNing's Thought

Home

❯

Notes

❯

Apache Arrow

Apache Arrow

Jan 22, 20261 min read

Columnar format for fast data interchange and in-memory analytics

Arrow provides O(1) random access lookups to any array index, whilst Parquet does not. In particular, Parquet uses dremel record shredding, variable length encoding schemes, and block compression to drastically reduce the data size, but these techniques come at the loss of performant random access lookups.

Links

  • Arrow Columnar Format — Apache Arrow v19.0.1
  • OLAP
  • DuckDB
  • Postgresql

Graph View

Backlinks

  • Apache Datafusion

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community