jeekim's repositories
EuropePMC-Identifier-Extractor
A program to extract identifiers such as grant ids, accession numbers etc. in free text
spark-monq
running monq on spark to annotate PMC full-text articles
Multi-Filter-Residual-Convolutional-Neural-Network
Multi-Filter Residual Convolutional Neural Network for Text Classification
alibi
Algorithms for explaining machine learning models
amazon-sagemaker-mlflow-fargate
Managing your machine learning lifecycle with MLflow and Amazon SageMaker
Awesome-medical-coding-NLP
A collection of papers in automated medical coding from free-texts
bio-lm
We evaluate many models used for biomedical and clinical nlp tasks, and train new models that perform much better.
BioSentVec
BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
Efficient_Python_tricks_and_tools_for_data_scientists
Efficient Python Tricks and Tools for Data Scientists
EHRKit-2022
A Python Natural Language Processing Toolkit for Electronic Health Record Texts
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
floret
🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy
genai-stack
Langchain + Docker + Neo4j + Ollama
graphql-engine
Blazing fast, instant realtime GraphQL APIs on your DB with fine grained access control, also trigger webhooks on database events.
ICD-MSMN
Code Synonyms Do Matter: Multiple Synonyms Matching Network for Automatic ICD Coding [ACL 2022]
ISD
Source code for ACL 2021 paper "Automatic ICD Coding via Interactive Shared Representation Networks with Self-distillation Mechanism"
kedro
A Python framework for creating reproducible, maintainable and modular data science code.
kedro-mlflow-tutorial
A tutorial on how to use kedro-mlflow plugin (https://github.com/Galileo-Galilei/kedro-mlflow) to synchronize training and inference and serve kedro pipeline
kedro-starters-sklearn
Kedro starter templates using Scikit-learn and optionally MLflow
LLM-Finetuning
LLM Finetuning with peft
medspacy
Library for clinical NLP with spaCy.
MLServer
An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more
prompttools
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
ragas
Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines