shainaraza's repositories
bner-biobert
This repository is a named entity recognition model to extract medical and clinical named entities from the texts. It focuses on COVID-19 and long-COVID
acl-style-files
Official style files for papers submitted to venues of the Association for Computational Linguistics
adversarial-robustness-toolbox
Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams
AL-NER
LTP: A New Active Learning Strategy for CRF-Based Named Entity Recognition
awesome-fairness-papers
Papers on fairness in NLP
azimuth
Helping AI practitioners better understand their datasets and models in text classification. From ServiceNow.
bluebert
BlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).
brat
brat rapid annotation tool (brat) - for all your textual annotation needs
cleanlab
The standard package for machine learning with label errors, finding mislabeled data, and uncertainty quantification. Works with most datasets and models.
covid-19-data-1
Data on COVID-19 (coronavirus) cases, deaths, hospitalizations, tests • All countries • Updated daily by Our World in Data
COVID19
A worldwide epidemiological database for COVID-19 at fine-grained spatial resolution
covid19-forecast-hub
Projections of COVID-19, in standardized format
covid19-philadelphia
De-identified, aggregate datasets showing COVID-19 cases, hospitalizations, deaths and vaccinations by date, zip, or age/sex/race as made available by the City of Philadelphia through its Open Data Program.
ecco
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).
EHRKit-2022
A Python Natural Language Processing Toolkit for Electronic Health Record Texts
entity-recognition-datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Gooey
Turn (almost) any Python command line program into a full GUI application with one line
ktrain
ktrain is a Python library that makes deep learning and AI more accessible and easier to apply
label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
lux
Automatically visualize your pandas dataframe via a single print! 📊 💡
medspacy
Library for clinical NLP with spaCy.
ml-surveys
📋 Survey papers summarizing advances in deep learning, NLP, CV, graphs, reinforcement learning, recommendations, graphs, etc.
ploomber
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
r-course-material
A collection of R tutorials
rasa
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
small-text
Active Learning for Text Classification in Python
spark-nlp-workshop
Public runnable examples of using John Snow Labs' NLP for Apache Spark.