vincent d warmerdam (koaning)

koaning

Geek Repo

Company:@explosion

Location:Amsterdam

Home Page:https://koaning.io

Twitter:@fishnets88

Github PK Tool:Github PK Tool

vincent d warmerdam 's repositories

doubtlab

Doubt your data, find bad labels.

Language:PythonLicense:MITStargazers:492Issues:7Issues:23

whatlies

Toolkit to help understand "what lies" in word embeddings. Also benchmarking!

Language:PythonLicense:Apache-2.0Stargazers:466Issues:14Issues:139

mktestdocs

Run pytest against markdown files/docstrings.

Language:PythonLicense:Apache-2.0Stargazers:101Issues:3Issues:3

scikit-prune

Prune your sklearn models

Language:PythonLicense:MITStargazers:19Issues:3Issues:0

gitlit

Streamlit App on Github Actions

texttoolz

tools and tricks that are good to have around

License:MITStargazers:5Issues:2Issues:0

boondoc

lightweight Python API docs for markdown

Language:PythonLicense:MITStargazers:4Issues:2Issues:0

calm-stats

Some GitScrapers

Language:PythonLicense:MITStargazers:3Issues:2Issues:0

featherbed

Very lightweight text vectors via tf/idf + SVD

Language:PythonLicense:MITStargazers:3Issues:2Issues:0

manyterms

Many terms for whatever purposes (weak labelling)

wordlists

Just a bunch of potentially useful wordlists.

License:NOASSERTIONStargazers:2Issues:2Issues:1

wow-avatar-datasets

A place to host some parquet files.

Stargazers:2Issues:0Issues:0

bulk-datasets

Helpers for the download command.

scikit-prodigy

Helpers to leverage scikit-learn pipelines in Prodigy.

Language:PythonLicense:MITStargazers:1Issues:3Issues:0

skooba

less weak supervision

License:MITStargazers:1Issues:0Issues:0

srsly

🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

awesome-normconf

List of resources coming out of Normconf Slack

Stargazers:0Issues:1Issues:0

BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

floret

🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy

Language:C++License:MITStargazers:0Issues:1Issues:0

json-schema-demo

json schemas as a demo

Stargazers:0Issues:2Issues:0

koaning-io-old

Just a copy of my old blog.

Language:HTMLStargazers:0Issues:2Issues:0

polars

Rust DataFrame library

Language:RustLicense:MITStargazers:0Issues:1Issues:0

projects

🪐 End-to-end NLP workflows from prototype to production

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

radicli

🕊️ Radically lightweight command-line interfaces

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

textual

Textual is a TUI (Text User Interface) framework for Python inspired by modern web development.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

weasel

🦦 weasel: A small and easy workflow system

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

weasel-old

Weasel: A small and easy workflow system

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:HTMLStargazers:0Issues:1Issues:0