Jiang Shanshan (univerone)

univerone

Geek Repo

Company:Peking University

Location:Singapore

Home Page:blog.univerone.com

Github PK Tool:Github PK Tool

Jiang Shanshan's starred repositories

fzf

:cherry_blossom: A command-line fuzzy finder

the-algorithm

Source code for Twitter's Recommendation Algorithm

Language:ScalaLicense:AGPL-3.0Stargazers:61900Issues:372Issues:979

cs-self-learning

计算机自学指南

Language:HTMLLicense:MITStargazers:54617Issues:317Issues:177

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38481Issues:383Issues:1639

shellcheck

ShellCheck, a static analysis tool for shell scripts

Language:HaskellLicense:GPL-3.0Stargazers:35875Issues:416Issues:2644

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonLicense:Apache-2.0Stargazers:29478Issues:325Issues:5415

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Language:PythonLicense:BSD-3-ClauseStargazers:27359Issues:231Issues:659

ruffle

A Flash Player emulator written in Rust

Language:RustLicense:NOASSERTIONStargazers:15303Issues:163Issues:10786

OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Language:PythonLicense:MPL-2.0Stargazers:13351Issues:137Issues:1160

memray

Memray is a memory profiler for Python

Language:PythonLicense:Apache-2.0Stargazers:13005Issues:59Issues:184

FreshRSS

A free, self-hostable news aggregator…

Language:PHPLicense:AGPL-3.0Stargazers:9134Issues:105Issues:3237

pypdf

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

Language:PythonLicense:NOASSERTIONStargazers:7940Issues:149Issues:1107

GPTCache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Language:PythonLicense:MITStargazers:6999Issues:59Issues:162

nopecha-extension

Automated CAPTCHA solver for your browser. Works with Selenium, Puppeteer, Playwright, and more.

Language:JavaScriptLicense:MITStargazers:6096Issues:12Issues:50

GPU-Puzzles

Solve puzzles. Learn CUDA.

Language:Jupyter NotebookLicense:MITStargazers:5506Issues:29Issues:28

CodeGen

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Language:PythonLicense:Apache-2.0Stargazers:4860Issues:79Issues:74

coremltools

Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.

Language:PythonLicense:BSD-3-ClauseStargazers:4278Issues:123Issues:1400

LoveIt

❤️A clean, elegant but advanced blog theme for Hugo 一个简洁、优雅且高效的 Hugo 主题

Language:JavaScriptLicense:MITStargazers:3354Issues:30Issues:502

parallel-hashmap

A family of header-only, very fast and memory-friendly hashmap and btree containers.

Language:C++License:Apache-2.0Stargazers:2442Issues:63Issues:180

gptcommit

A git prepare-commit-msg hook for authoring commit messages with GPT-3.

Language:RustLicense:MITStargazers:2316Issues:8Issues:65

Algorithm-Practice-in-Industry

搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)

Language:PythonLicense:BSD-2-ClauseStargazers:2008Issues:58Issues:53

torchrec

Pytorch domain library for recommendation systems

Language:PythonLicense:BSD-3-ClauseStargazers:1837Issues:29Issues:157

cnpy

library to read/write .npy and .npz files in C/C++

Language:C++License:MITStargazers:1294Issues:29Issues:64

fedlearner

A multi-party collaborative machine learning framework

Language:PythonLicense:Apache-2.0Stargazers:890Issues:28Issues:31

emhash

Fast and memory efficient c++ flat hash map/set

Language:C++License:MITStargazers:445Issues:13Issues:29

FBTT-Embedding

This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as recommendation and natural language processing. We showed this library can reduce the total model size by up to 100x in Facebook’s open sourced DLRM model while achieving same model quality. Our implementation is faster than the state-of-the-art implementations. Existing the state-of-the-art library also decompresses the whole embedding tables on the fly therefore they do not provide memory reduction during runtime of the training. Our library decompresses only the requested rows therefore can provide 10,000 times memory footprint reduction per embedding table. The library also includes a software cache to store a portion of the entries in the table in decompressed format for faster lookup and process.

Language:CudaLicense:MITStargazers:192Issues:11Issues:11

mulle-concurrent

📶 A lock- and wait-free hashtable (and an array too)

Language:CLicense:NOASSERTIONStargazers:105Issues:7Issues:1

gtl

Greg's Template Library of useful classes.

Language:C++License:Apache-2.0Stargazers:104Issues:9Issues:9

float_compr_tester

Testing various libraries/approaches for compressing floating point data

Language:HTMLStargazers:14Issues:3Issues:0