Blog

The official source for Apache Sedona news, technical insights, release updates, and best practices in large-scale spatial data management.

Thursday, December 11, 2025
6 min read

Introducing SpatialBench: performance benchmarks for spatial database queries

SpatialBench is a benchmarking framework for spatial joins, distance queries, and point-in-polygon analyses.

Traditional benchmarking frameworks don’t include spatial workflows. It’s important to benchmark spatial workflows separately because an engine that’s fast for tabular data analyses isn’t necessarily performant for spatial queries.

For example, here are the SpatialBench results for Scale Factor 1 (SF-1) and SF-10 for SedonaDB, DuckDB, and GeoPandas on a single ec2 instance:

Monday, December 1, 2025
6 min read

SedonaDB 0.2.0 Release

The Apache Sedona community is excited to announce the release of SedonaDB version 0.2.0!

SedonaDB is the first open-source, single-node analytical database engine that treats spatial data as a first-class citizen. It is developed as a subproject of Apache Sedona. This release consists of 136 resolved issues including 40 new functions from 17 contributors.

Apache Sedona powers large-scale geospatial processing on distributed engines like Spark (SedonaSpark), Flink (SedonaFlink), and Snowflake (SedonaSnow). SedonaDB extends the Sedona ecosystem with a single-node engine optimized for small-to-medium data analytics, delivering the simplicity and speed that distributed systems often cannot.

Tuesday, October 21, 2025
11 min read

Managing spatial tables in Data Lakehouses with Iceberg

This post explains the benefits of the Lakehouse Architecture for spatial tables and how Lakehouses differ from data warehouses and data lakes.

Wednesday, September 24, 2025
9 min read

Introducing SedonaDB: A single-node analytical database engine with geospatial as a first-class citizen

The Apache Sedona community is excited to announce the initial release of SedonaDB! 🎉

SedonaDB is the first open-source, single-node analytical database engine that treats spatial data as a first-class citizen. It is developed as a subproject of Apache Sedona.

Apache Sedona powers large-scale geospatial processing on distributed engines like Spark (SedonaSpark), Flink (SedonaFlink), and Snowflake (SedonaSnow). SedonaDB extends the Sedona ecosystem with a single-node engine optimized for small-to-medium data analytics, delivering the simplicity and speed that distributed systems often cannot.

Friday, September 5, 2025
10 min read

Should You Use H3 for Geospatial Analytics? A Deep Dive with Apache Spark and Sedona

TL;DR The H3 spatial index provides a number of spatial functions and a consistent grid system for efficient data aggregation and visualization. H3 is an approximation that makes some computations run faster, but less accurately. Sedona supports H3 spatial index, but it's often preferable to use precise computations, especially when accuracy is important.

Wednesday, July 9, 2025
1 min read

Welcome to the Apache Sedona Blog!

Welcome to the brand-new blog for Apache Sedona!

For several years, Apache Sedona has been the go-to open-source engine for processing massive geospatial datasets, extending Apache Spark to handle complex spatial operations with unparalleled speed and efficiency. Sedona's capabilities also extend beyond Spark, bringing spatial analytics to the Snowflake data warehouse with SedonaSnow and the real-time streaming engine Apache Flink with a Spatial SQL integration.