Abstract:
|
This paper describes the methodology behind BEACON – a tool that will be used by respondents to the 2022 Economic Census to self-designate their establishment’s North American Industry Classification System (NAICS) code. BEACON, which stands for Business Establishment Automated Classification of NAICS, takes a respondent-provided business description as input and returns to the respondent a list of candidate NAICS codes from which to choose. BEACON is based on text analysis, machine learning, and information retrieval. The rich training dataset for BEACON contains over 3.7 million observations from sources such as past Economic Census responses and Internal Revenue Service data. It is shown how BEACON employs ensemble and hierarchical modeling techniques to propose relevant NAICS codes. This paper also discusses results from a recent Economic Census field test.
|