thom lake (thomlake)

thomlake

Geek Repo

Company:Indeed

Location:Austin, TX, USA

Home Page:http://thomlake.github.io/

Github PK Tool:Github PK Tool

thom lake's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:135003Issues:1124Issues:16141

scikit-learn

scikit-learn: machine learning in Python

Language:PythonLicense:BSD-3-ClauseStargazers:60135Issues:2138Issues:11233

taipy

Turns Data and AI algorithms into production-ready web applications in no time.

Language:PythonLicense:Apache-2.0Stargazers:15381Issues:76Issues:860

ParlAI

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Language:PythonLicense:MITStargazers:10491Issues:283Issues:1544

hydra

Hydra is a framework for elegantly configuring complex applications

Language:PythonLicense:MITStargazers:8809Issues:129Issues:1395

quickdraw-dataset

Documentation on how to access and use the Quick, Draw! Dataset.

lit

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

Language:TypeScriptLicense:Apache-2.0Stargazers:3491Issues:67Issues:135

thinc

đź”® A refreshing functional take on deep learning, compatible with your favorite libraries

Language:PythonLicense:MITStargazers:2820Issues:78Issues:146

rebiber

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Language:PythonLicense:MITStargazers:2643Issues:15Issues:30

CodeSearchNet

Datasets, tools, and benchmarks for representation learning of code.

Language:Jupyter NotebookLicense:MITStargazers:2210Issues:60Issues:61

json_repair

A python module to repair invalid JSON, commonly used to parse the output of LLMs

Language:PythonLicense:MITStargazers:1137Issues:5Issues:61

PyDataset

Instant access to many datasets in Python.

Language:PythonLicense:MITStargazers:935Issues:34Issues:17

ark-tweet-nlp

CMU ARK Twitter Part-of-Speech Tagger

Language:JavaLicense:NOASSERTIONStargazers:574Issues:65Issues:32

AIQ

Algorithmic Intelligence Quotient

Language:PythonLicense:GPL-3.0Stargazers:38Issues:4Issues:0

TriangleCOPA

One hundred challenge problems for logical formalizations of commonsense psychology

foxcross

AsyncIO serving for data science models

Language:PythonLicense:BSD-3-ClauseStargazers:24Issues:2Issues:0
Language:PythonLicense:MITStargazers:22Issues:1Issues:0

treebank-scripts

Suite of scripts for preprocessing the Penn Treebank, primarily to extract lexical subcategorization frames and dependencies.

Language:PerlLicense:MITStargazers:7Issues:1Issues:17

data-file-parsers

stuff to parse data files

Language:PythonStargazers:2Issues:2Issues:0

gen-twitter-data-files

python scripts to get twitter feeds (stream and search)

Language:PythonStargazers:2Issues:2Issues:0

tokenizer

string tokenization in clojure

Language:ClojureStargazers:2Issues:2Issues:0

1milliontweets

A big text file full of tweets

Language:PythonStargazers:1Issues:2Issues:0

AIQ

Algorithmic Intelligence Quotient

Language:PythonLicense:GPL-3.0Stargazers:1Issues:2Issues:0
Language:PythonLicense:GPL-3.0Stargazers:1Issues:2Issues:0

DataUtils

ml related data utilities in python

Language:PythonStargazers:1Issues:2Issues:0

EbmLib

Energy Based Models for Python with Numpy

Language:PythonLicense:GPL-3.0Stargazers:1Issues:3Issues:0

hashfile

simple-hashfile

Language:ClojureStargazers:1Issues:2Issues:0

MyPyStuff

random useful python stuff

Language:PythonLicense:GPL-3.0Stargazers:1Issues:2Issues:0

tweepy-examples

example python source using the tweepy twitter api

Language:PythonStargazers:1Issues:2Issues:0