Project Status: WIP – Initial development is in progress, but there has not yet been a stable, usable release suitable for the public. Build Status codecov CRAN_Status_Badge packageversionminimal R versionDOI

The underwater R data package brings together datasets for submerged geographic features and place names that are presently dispersed. The aim of the package is to facilitate mapping and text mining of scientific and other literature for underwater features and place names using a tidy approach.

The package aims to assist researchers seeking to contribute to the evidence base for negotiations of a new international treaty on marine biodiversity in Areas Beyond National Jurisdiction (ABNJ) under the United Nations General Assembly. The package was written as part of the BIOSPOLAR project funded by the Norwegian Research Council (RCN 257631/E10) which aims to map research and innovation in polar marine areas. The views expressed in this package are the strictly the author’s own and should not be attributed to the Norweigan Research Council.

The package provides open access datasets for named underwater places from:

  1. The GEBCO Gazeteer with 4,084 place and feature names.
  2. The Interridge Hydrothermal Vents Database with 700 hydrothermal vents.
  3. Underwater features from the Geonames database coded U for Undersea. This dataset contains 14,513 underwater place names.

Installing

underwater is not on CRAN and can be installed with devtools

install.packages("devtools")
devtools::install_github("poldham/underwater")

The datasets

The datasets have different update schedules and an archive of each dataset is provided in /data and can be called with the package.

  1. The GEBCO Gazeteer is rarely updated. The package archives a version of the dataset from 2017-12-29 and can be called using underwater::gebco or data(“gebco”).
gebco <- underwater::gebco
  1. The Interridge Hydrothermal vents database is updated infrequently. An archived version can be accessed using underwater::vents or data(“vents”). To import an updated version use import_vents().
vents <- underwater::vents
  1. The Geonames undersea data is taken from the Geonames allCountries.txt file data export dump and filtered on code U for Undersea (undersea). The geonames export files are updated daily. A reference set from 2018-01-01 is provided with the package. The geonames data can be accessed using underwater::undersea or data(“undersea”).
undersea <- underwater::undersea

The raw data is not provided in a form that makes text mining easy. The data is not tidy and may contain punctuation or concatenation of names. Alternate names may also be provided.

Note that geographic lat long coordinates are provided in the vents and the geonames dataset. GEBCO data includes Point, Linestring and Polygons.

If you know of an open access dataset that could be added to aid in underwater mapping projects please raise an issue or make a pull request. Here are some of the other packages I have found so far.

Other R packages dealing with Ocean data

  • robis for taxonomic data from the Ocean Biogeographic Information System
  • oce for working with instrument data and a variety of mapping functions
  • oceanview for visualizing complex oceanographic data
  • oceanmap A toolbox for plotting 2D oceanographic data
  • rnaturalearth A package to facilitate interaction with natural earth map data
  • rnaturalearthdata World Vector Map Data from Natural Earch used in rnaturalearth

Other data sources

Marineregions.org brings together a lot of GIS datasets (such as maritime boundaries) that can use useful for mapping in R and other tools.

Quantarctica3 is a resource for those interested in Antarctica and the southern ocean and provides resources for use with QGIS.