Craig Schmidt (craigschmidt)

craigschmidt

Geek Repo

Location:Wellesley, MA

Github PK Tool:Github PK Tool

Craig Schmidt's starred repositories

helix

A post-modern modal text editor.

Language:RustLicense:MPL-2.0Stargazers:30896Issues:181Issues:4175

glow

Render markdown on the CLI, with pizzazz! 💅🏻

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Language:PythonLicense:Apache-2.0Stargazers:14222Issues:133Issues:2002

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

Language:C++License:Apache-2.0Stargazers:9731Issues:123Issues:728

kakoune

mawww's experiment for a better code editor

Language:C++License:UnlicenseStargazers:9656Issues:112Issues:2792

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Language:RustLicense:Apache-2.0Stargazers:8611Issues:120Issues:946

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonLicense:Apache-2.0Stargazers:4417Issues:76Issues:87

Riskfolio-Lib

Portfolio Optimization and Quantitative Strategic Asset Allocation in Python

Language:C++License:BSD-3-ClauseStargazers:2793Issues:76Issues:128

zee

A modern text editor for the terminal written in Rust

Language:RustLicense:Apache-2.0Stargazers:1440Issues:18Issues:46

AlphaZero.jl

A generic, simple and fast implementation of Deepmind's AlphaZero algorithm.

Language:JuliaLicense:MITStargazers:1222Issues:27Issues:143

DiskANN

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

Language:C++License:NOASSERTIONStargazers:934Issues:24Issues:186

Empyrial

An Open Source Portfolio Backtesting Engine for Everyone | 面向所有人的开源投资组合回测引擎

Language:PythonLicense:MITStargazers:886Issues:30Issues:44

HiGHS

Linear optimization software

Language:C++License:MITStargazers:833Issues:31Issues:641

BanditPAM

BanditPAM C++ implementation and Python package

Language:C++License:MITStargazers:646Issues:9Issues:201

textadept

Textadept is a fast, minimalist, and remarkably extensible cross-platform text editor for programmers.

Language:LuaLicense:MITStargazers:609Issues:23Issues:222

Tokenizer

Fast and customizable text tokenization library with BPE and SentencePiece support

Language:C++License:MITStargazers:268Issues:19Issues:79

SBERT-WK-Sentence-Embedding

IEEE/ACM TASLP 2020: SBERT-WK: A Sentence Embedding Method By Dissecting BERT-based Word Models

Language:PythonLicense:Apache-2.0Stargazers:177Issues:7Issues:11

tokenizer

NLP tokenizers written in Go language

Language:GoLicense:Apache-2.0Stargazers:147Issues:10Issues:24

primme

PReconditioned Iterative MultiMethod Eigensolver for solving symmetric/Hermitian eigenvalue problems and singular value problems

Language:CLicense:NOASSERTIONStargazers:132Issues:14Issues:63

cc-lambda

Search the common crawl using lambda functions

sequence_align

Efficient implementations of Needleman-Wunsch and other sequence alignment algorithms written in Rust with Python bindings via PyO3.

Language:PythonLicense:Apache-2.0Stargazers:58Issues:4Issues:5

TP4

Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)

Language:PythonLicense:Apache-2.0Stargazers:54Issues:1Issues:0

SaGe

Code for SaGe subword tokenizer (EACL 2023)

Language:PythonLicense:MITStargazers:20Issues:0Issues:0

aiohttp-scraper

A robust asynchronous web scraping client using aiohttp.

Language:PythonLicense:MITStargazers:19Issues:2Issues:2

fold_to_ascii

A Python port of the Apache Lucene ASCII Folding Filter that converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the ‘Basic Latin’ Unicode block) into ASCII equivalents, if they exist.

Language:PythonLicense:MITStargazers:15Issues:0Issues:0

public-woo-api

Wordpress plugin for work with woocommerce rest api

Language:PHPLicense:GPL-2.0Stargazers:9Issues:2Issues:3

Stochastic_Dominance

Functions for portfolio optimization under second order stochastic dominance constraints

Language:PythonStargazers:2Issues:0Issues:0

tree-isomorphism-test

An isomorphism test for trees, using NetworkX's data structures (not the algorithm!).

Language:PythonLicense:GPL-3.0Stargazers:1Issues:1Issues:0

lukes-hugo-theme

My personal Hugo theme.

Language:HTMLLicense:MITStargazers:1Issues:0Issues:0