Kemar Reid's starred repositories

paper-qa

High accuracy RAG for answering questions from scientific documents with citations

Language:PythonLicense:Apache-2.0Stargazers:5020Issues:0Issues:0

sqlcoder

SoTA LLM for converting natural language questions to SQL queries

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3285Issues:0Issues:0

langchain

๐Ÿฆœ๐Ÿ”— Build context-aware reasoning applications

Language:Jupyter NotebookLicense:MITStargazers:92283Issues:0Issues:0

PyTrial

PyTrial: A Comprehensive Platform for Artificial Intelligence for Drug Development

Language:PythonLicense:BSD-2-ClauseStargazers:77Issues:0Issues:0

tech-interview-handbook

๐Ÿ’ฏ Curated coding interview preparation materials for busy software engineers

Language:TypeScriptLicense:MITStargazers:116779Issues:0Issues:0

DS-Career-Resources

Compilation of resources for aspiring data scientists

Language:PythonStargazers:1948Issues:0Issues:0

mlflow-export-import

Export and import MLflow experiments, runs or registered models

Language:HTMLLicense:Apache-2.0Stargazers:78Issues:0Issues:0

medaCy

:hospital: Medical Text Mining and Information Extraction with spaCy

Language:PythonLicense:GPL-3.0Stargazers:428Issues:0Issues:0

medspacy

Library for clinical NLP with spaCy.

Language:Jupyter NotebookLicense:MITStargazers:518Issues:0Issues:0

pyspark-style-guide

This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.

Language:PythonLicense:MITStargazers:1016Issues:0Issues:0

snack

Stochastic Neighbor and Crowd Kernel (SNaCK) embeddings: Quick and dirty visualization of large-scale datasets via concept embeddings

Language:C++License:NOASSERTIONStargazers:51Issues:0Issues:0

awesome-relation-extraction

๐Ÿ“– A curated list of awesome resources dedicated to Relation Extraction, one of the most important tasks in Natural Language Processing (NLP).

Stargazers:1176Issues:0Issues:0

OpenNRE

An Open-Source Package for Neural Relation Extraction (NRE)

Language:PythonLicense:MITStargazers:4314Issues:0Issues:0

QuickUMLS

System for Medical Concept Extraction and Linking

Language:PythonLicense:MITStargazers:369Issues:0Issues:0

mitmproxy

An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.

Language:PythonLicense:MITStargazers:35918Issues:0Issues:0

applied-ml

๐Ÿ“š Papers & tech blogs by companies sharing their work on data science & machine learning in production.

License:MITStargazers:27154Issues:0Issues:0

SpiderKeeper

admin ui for scrapy/open source scrapinghub

Language:PythonStargazers:2735Issues:0Issues:0

dockerfiles

:whale: A curated list of delicious docker recipes ๐Ÿ‡บ๐Ÿ‡ฆ๐Ÿ‡ฎ๐Ÿ‡ฑ (Let's Fight Against ************)

Language:DockerfileStargazers:3130Issues:0Issues:0

scrapydweb

Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI. DEMO :point_right:

Language:PythonLicense:GPL-3.0Stargazers:3112Issues:0Issues:0

metals-sublime

Sublime Text package for Metals, a language server for Scala

Language:PythonLicense:Apache-2.0Stargazers:16Issues:0Issues:0
Language:ScalaStargazers:13Issues:0Issues:0

scrapinghub-stack-scrapy

Software stack with latest Scrapy and updated deps

Language:DockerfileLicense:BSD-3-ClauseStargazers:60Issues:0Issues:0

pdfminer.six

Community maintained fork of pdfminer - we fathom PDF

Language:PythonLicense:MITStargazers:5806Issues:0Issues:0

pdfstructure

`pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.

Language:PythonStargazers:96Issues:0Issues:0

spark-nlp-workshop

Public runnable examples of using John Snow Labs' NLP for Apache Spark.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1026Issues:0Issues:0

scala-scraper

A Scala library for scraping content from HTML pages

Language:ScalaLicense:MITStargazers:717Issues:0Issues:0

drugstandards

Python tools for standardizing drug names to generic names.

Language:PythonLicense:MITStargazers:22Issues:0Issues:0

ceja

PySpark phonetic and string matching algorithms

Language:PythonLicense:MITStargazers:34Issues:0Issues:0

pyspark-utilities

ETL utilities library for PySpark

Language:PythonLicense:MITStargazers:8Issues:0Issues:0

sparkler

Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.

Language:JavaLicense:Apache-2.0Stargazers:411Issues:0Issues:0