Skip to content
🎉 Apache Sedona 1.8.1 is now available! Check out the new features and improvements.

ST_DBSCAN

Introduction: Performs a DBSCAN clustering across the entire dataframe.

Returns a struct containing the cluster ID and a boolean indicating if the record is a core point in the cluster.

  • epsilon is the maximum distance between two points for them to be considered as part of the same cluster.
  • minPoints is the minimum number of neighbors a single record must have to form a cluster.
  • useSpheroid is whether to use ST_DistanceSpheroid or ST_Distance as the distance metric.

Format: ST_DBSCAN(geom: Geometry, epsilon: Double, minPoints: Integer, useSpheroid: Boolean)

Since: v1.7.1

SQL Example

SELECT ST_DBSCAN(geom, 1.0, 2, False)

Output:

{true, 85899345920}