What are categories?
Event Registry categorizes the news into a taxonomy of categories from DMOZ. The DMOZ taxonomy has over 1 million categories organized into several levels. In Event Registry, we use only the top 3 levels of taxonomy which amounts to about 50,000 categories – from generic ones (like Business) to very specific ones (like Business/Investing/Derivatives).
Categories are assigned to articles and events using machine learning models that are trained for each category separately. Categories do not represent a particular mention in the articles or events, but instead represent what topic the content is about. Categories are currently assigned only to content in the English language.