Convert ANZSCO names and abbreviations into the consistent format used by the Australian Bureau of Statistics.
Source:R/clean_anzsco.R
clean_anzsco.Rd
This function enables both exact (default) and fuzzy matching. Under exact matching, if no match is found, NA is returned.
Arguments
- x
a (character) vector containing ANZSCO titles. Note that
clean_anzsco
always returns a character vector. If no match is found, thenNA
is returned.- fuzzy_match
logical; either TRUE which indicates that approximate/fuzzy string matching should be used, or FALSE (the default) which indicates that only exact matches should be used. If FALSE, then if no match is found, then NA is returned.
- max_dist
numeric, sets the maximum acceptable distance between your string and the matched string. Default is 0.4. Only relevant when fuzzy_match is TRUE.
- method
the method used for approximate/fuzzy string matching. Default is "jw", the Jaro-Winker distance; see `??stringdist-metrics` for more options. Only relevant when fuzzy_match is TRUE.
- silent
a boolean value. If FALSE (the default), the function will warn that
NA
(s) were returned.
See also
clean_anzsic
for ANZSIC.
Other cleaning functions:
clean_anzsic()
,
clean_asced_foe()
,
clean_asced_qual()
,
clean_nfd()