jeekim's repositories

EuropePMC-Identifier-Extractor

A program to extract identifiers such as grant ids, accession numbers etc. in free text

spark-monq

running monq on spark to annotate PMC full-text articles

Language:JavaLicense:CC0-1.0Stargazers:2Issues:2Issues:0
Language:Jupyter NotebookStargazers:1Issues:0Issues:0

Multi-Filter-Residual-Convolutional-Neural-Network

Multi-Filter Residual Convolutional Neural Network for Text Classification

Language:PythonStargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0

alibi

Algorithms for explaining machine learning models

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

amazon-sagemaker-mlflow-fargate

Managing your machine learning lifecycle with MLflow and Amazon SageMaker

Language:Jupyter NotebookLicense:MIT-0Stargazers:0Issues:0Issues:0

Awesome-medical-coding-NLP

A collection of papers in automated medical coding from free-texts

License:MITStargazers:0Issues:0Issues:0

bio-lm

We evaluate many models used for biomedical and clinical nlp tasks, and train new models that perform much better.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

BioSentVec

BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0

Efficient_Python_tricks_and_tools_for_data_scientists

Efficient Python Tricks and Tools for Data Scientists

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

EHRKit-2022

A Python Natural Language Processing Toolkit for Electronic Health Record Texts

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

floret

🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy

Language:C++License:MITStargazers:0Issues:0Issues:0

genai-stack

Langchain + Docker + Neo4j + Ollama

Language:PythonLicense:CC0-1.0Stargazers:0Issues:0Issues:0

graphql-engine

Blazing fast, instant realtime GraphQL APIs on your DB with fine grained access control, also trigger webhooks on database events.

Language:HaskellLicense:Apache-2.0Stargazers:0Issues:1Issues:0

ICD-MSMN

Code Synonyms Do Matter: Multiple Synonyms Matching Network for Automatic ICD Coding [ACL 2022]

Stargazers:0Issues:0Issues:0

ISD

Source code for ACL 2021 paper "Automatic ICD Coding via Interactive Shared Representation Networks with Self-distillation Mechanism"

Stargazers:0Issues:0Issues:0

kedro

A Python framework for creating reproducible, maintainable and modular data science code.

License:Apache-2.0Stargazers:0Issues:0Issues:0

kedro-mlflow-tutorial

A tutorial on how to use kedro-mlflow plugin (https://github.com/Galileo-Galilei/kedro-mlflow) to synchronize training and inference and serve kedro pipeline

Language:PythonStargazers:0Issues:0Issues:0

kedro-starters-sklearn

Kedro starter templates using Scikit-learn and optionally MLflow

Language:PythonStargazers:0Issues:0Issues:0

LLM-Finetuning

LLM Finetuning with peft

Stargazers:0Issues:0Issues:0

medspacy

Library for clinical NLP with spaCy.

License:MITStargazers:0Issues:0Issues:0

MLServer

An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more

License:Apache-2.0Stargazers:0Issues:0Issues:0

prompttools

Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ragas

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

License:Apache-2.0Stargazers:0Issues:0Issues:0