jeekim's repositories

EuropePMC-Identifier-Extractor

A program to extract identifiers such as grant ids, accession numbers etc. in free text

spark-monq

running monq on spark to annotate PMC full-text articles

Language:JavaLicense:CC0-1.0Stargazers:2Issues:0Issues:0
Language:Jupyter NotebookStargazers:1Issues:0Issues:0
Stargazers:1Issues:0Issues:0

Multi-Filter-Residual-Convolutional-Neural-Network

Multi-Filter Residual Convolutional Neural Network for Text Classification

Language:PythonStargazers:1Issues:0Issues:0
License:MITStargazers:1Issues:0Issues:0

alibi

Algorithms for explaining machine learning models

License:Apache-2.0Stargazers:0Issues:0Issues:0

amazon-sagemaker-mlflow-fargate

Managing your machine learning lifecycle with MLflow and Amazon SageMaker

License:MIT-0Stargazers:0Issues:0Issues:0

Awesome-medical-coding-NLP

A collection of papers in automated medical coding from free-texts

License:MITStargazers:0Issues:0Issues:0

bio-lm

We evaluate many models used for biomedical and clinical nlp tasks, and train new models that perform much better.

License:NOASSERTIONStargazers:0Issues:0Issues:0

BioSentVec

BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences

License:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Efficient_Python_tricks_and_tools_for_data_scientists

Efficient Python Tricks and Tools for Data Scientists

Stargazers:0Issues:0Issues:0

EHRKit-2022

A Python Natural Language Processing Toolkit for Electronic Health Record Texts

Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

License:MITStargazers:0Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

License:Apache-2.0Stargazers:0Issues:0Issues:0

floret

🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy

License:MITStargazers:0Issues:0Issues:0

genai-stack

Langchain + Docker + Neo4j + Ollama

License:CC0-1.0Stargazers:0Issues:0Issues:0

graphql-engine

Blazing fast, instant realtime GraphQL APIs on your DB with fine grained access control, also trigger webhooks on database events.

License:Apache-2.0Stargazers:0Issues:0Issues:0

ICD-MSMN

Code Synonyms Do Matter: Multiple Synonyms Matching Network for Automatic ICD Coding [ACL 2022]

Stargazers:0Issues:0Issues:0

ISD

Source code for ACL 2021 paper "Automatic ICD Coding via Interactive Shared Representation Networks with Self-distillation Mechanism"

Stargazers:0Issues:0Issues:0

kedro

A Python framework for creating reproducible, maintainable and modular data science code.

License:Apache-2.0Stargazers:0Issues:0Issues:0

kedro-mlflow-tutorial

A tutorial on how to use kedro-mlflow plugin (https://github.com/Galileo-Galilei/kedro-mlflow) to synchronize training and inference and serve kedro pipeline

Stargazers:0Issues:0Issues:0

kedro-starters-sklearn

Kedro starter templates using Scikit-learn and optionally MLflow

Language:PythonStargazers:0Issues:0Issues:0

LLM-Finetuning

LLM Finetuning with peft

Stargazers:0Issues:0Issues:0

medspacy

Library for clinical NLP with spaCy.

License:MITStargazers:0Issues:0Issues:0

MLServer

An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more

License:Apache-2.0Stargazers:0Issues:0Issues:0

prompttools

Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).

License:Apache-2.0Stargazers:0Issues:0Issues:0

ragas

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

License:Apache-2.0Stargazers:0Issues:0Issues:0