Extract rows from shapefile

Supported in: Batch

Reads a dataset of files and parses each shapefile into rows. All files except .shp, .shx and .dbf files will be ignored. This shapefile parser only supports point, polyline, polygon and multipoint geometry types. The output dataset will have a geometry column, and a column for each property listed by the user, apart from the _error and _file columns. If the user provides no properties to extract, the entire properties struct will be extracted into a properties column as a string.

Transform categories: File, Geospatial

Declared arguments

  • Dataset - Dataset of shapefiles to process. Each shapefile must have a .shp, .shx and a .dbf file. All files of a shapefile must have the same name. For example, a dataset with these files has two shapefiles (shapefile1, and shapefile2): shapefile1.shp, shapefile1.shx, shapefile1.dbf, folder/shapefile2.shp, folder/shapefile2.shx, folder/shapefile2.dbf.
    Files
  • List of properties - List of properties and their types that need to be extracted from these shapefiles. If an empty struct is provided, extracts all properties in one 'properties' column as a string.
    Type<Struct>
  • optional Source coordinate system - Coordinate system identifier formatted as "authority". For example, UTM zone 18N could be identified by EPSG:32618. If not specified, will default to WGS84 which is EPSG:4326.
    Literal<String>