Find the approximate total number of records within a Spatial RDD.
Source:R/spatial_rdd.R
approx_count.Rd
Given a Sedona spatial RDD, find the (possibly approximated) number of total records within it.
See also
Other Spatial RDD aggregation routine:
minimum_bounding_box()
Examples
library(sparklyr)
#>
#> Attaching package: ‘sparklyr’
#> The following object is masked from ‘package:stats’:
#>
#> filter
library(apache.sedona)
sc <- spark_connect(master = "spark://HOST:PORT")
if (!inherits(sc, "test_connection")) {
input_location <- "/dev/null" # replace it with the path to your input file
rdd <- sedona_read_shapefile_to_typed_rdd(
sc,
location = input_location, type = "polygon"
)
approx_cnt <- approx_count(rdd)
}