Connecting firm's web scraped textual content to body of science : Utilizing microsoft academic graph hierarchical topic modeling
This paper demonstrates a method to transform and link textual information scraped from companies' websites to the scientific body of knowledge. The method illustrates the benefit of Natural Language Processing (NLP) in creating links between established economic classification systems with novel and agile constructs that new data sources enable. Therefore, we experimented on the European classifi
