Make taxonomy lookups

make_gbif_taxonomy(
  df,
  taxa_col = "original_name",
  taxonomy_file,
  target_rank = "species",
  limit = TRUE,
  fixes = NULL,
  overrides = NULL,
  ...
)

Arguments

df: Dataframe with taxa_col.
taxa_col: Character. Name of column in df with taxa names
taxonomy_file: Character. Path to results from envClean::get_taxonomy()
target_rank: Character. Default is 'species'. At what level of the taxonomic hierarchy are results desired. This is the most detailed taxonomy returned. i.e. if genus is the target_rank, no taxa below genus are returned. See envClean::lurank rank column.
limit: Logical. If true (default), the output taxonomy will be limited to the input names in df. Otherwise, all taxa found in taxonomy_file will be returned.
fixes: Data frame with columns resolved_to and prefer. Any taxa result in lutaxa that matches a name in resolved_to will be changed to prefer. Mainly used where legitimate names are used in areas where they do not exist. e.g. Eastern osprey Pandion cristatus does not occur in South Australia but records of this species in South Australia are assumed to be legitimate Osprey (Pandion haliaetus) records.
overrides: Data frame with columns original and prefer. Any original_name result in lutaxa that matches a name in original will be have its corresponding taxa changed to prefer. Useful where GBIF Backbone Taxonomy provides a spurious result. e.g. The GBIF Backbone Taxonomy changes Thinornis rubricollis to Phalaropus lobatus rather than the preferred Charadrius cucullatus.
...: Passed to envClean::get_taxonomy()

Value

named list with elements:

lutaxa: Dataframe. For each unique name in taxa_col, the best taxa to use (taking into account target_rank)
taxonomy: Dataframe. For each taxa in lutaxa a row of taxonomic hierarchy and matching gbif usageKeys