ourresearch / openalex-topic-classification

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

openalex-topic-classification

This repository contains all of the code for getting the OpenAlex topic classification model up and running. Go into the model iteration directory (v1) to find a more detailed explanation of how to use this repository. To learn more about topics in OpenAlex, check out the docs.

Model Development

You can find an explanation of the modeling and deployment process at the following link (work-in-progress): OpenAlex: End-to-End Process for Topic Classification

Model Iterations

  • v1 (current)

Topics

Works can be tagged with one or more of 4,516 topics, listed here. Each topic is also assigned a domain, field, and subfield that can be used to group similar topics together at a higher level. There is additional data (keywords, summary, wikipedia url) that can be used to help a user see how the clusters are distinguished and what the top words are in each topic.

About

License:MIT License


Languages

Language:Jupyter Notebook 92.8%Language:Python 6.6%Language:Dockerfile 0.3%Language:Shell 0.3%