univerone

followers

following

stars

Peking University

Singapore

blog.univerone.com

Jiang Shanshan's starred repositories

fzf

:cherry_blossom: A command-line fuzzy finder

Language:GoMIT63100 392 2739

the-algorithm

Source code for Twitter's Recommendation Algorithm

Language:ScalaAGPL-3.061900 372 979

cs-self-learning

计算机自学指南

Language:HTMLMIT54617 317 177

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonApache-2.038481 383 1639

shellcheck

ShellCheck, a static analysis tool for shell scripts

Language:HaskellGPL-3.035875 416 2644

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonApache-2.029478 325 5415

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Language:PythonBSD-3-Clause27359 231 659

ruffle

A Flash Player emulator written in Rust

Language:RustNOASSERTION15303 163 10786

OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Language:PythonMPL-2.013351 137 1160

memray

Memray is a memory profiler for Python

Language:PythonApache-2.013005 59 184

FreshRSS

A free, self-hostable news aggregator…

Language:PHPAGPL-3.09134 105 3237

pypdf

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

Language:PythonNOASSERTION7940 149 1107

GPTCache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Language:PythonMIT6999 59 162

nopecha-extension

Automated CAPTCHA solver for your browser. Works with Selenium, Puppeteer, Playwright, and more.

Language:JavaScriptMIT6096 12 50

GPU-Puzzles

Solve puzzles. Learn CUDA.

Language:Jupyter NotebookMIT5506 29 28

CodeGen

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Language:PythonApache-2.04860 79 74

coremltools

Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.

Language:PythonBSD-3-Clause4278 123 1400

LoveIt

❤️A clean, elegant but advanced blog theme for Hugo 一个简洁、优雅且高效的 Hugo 主题

Language:JavaScriptMIT3354 30 502

parallel-hashmap

A family of header-only, very fast and memory-friendly hashmap and btree containers.

Language:C++Apache-2.02442 63 180

gptcommit

A git prepare-commit-msg hook for authoring commit messages with GPT-3.

Language:RustMIT2316 8 65

Algorithm-Practice-in-Industry

搜索、推荐、广告、用增等工业界实践文章收集（来源：知乎、Datafuntalk、技术公众号）

Language:PythonBSD-2-Clause2008 58 53

torchrec

Pytorch domain library for recommendation systems

Language:PythonBSD-3-Clause1837 29 157

how-to-optimize-gemm

Language:C1702 45 17

cnpy

library to read/write .npy and .npz files in C/C++

Language:C++MIT1294 29 64

fedlearner

A multi-party collaborative machine learning framework

Language:PythonApache-2.0890 28 31

emhash

Fast and memory efficient c++ flat hash map/set

Language:C++MIT445 13 29

FBTT-Embedding

This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as recommendation and natural language processing. We showed this library can reduce the total model size by up to 100x in Facebook’s open sourced DLRM model while achieving same model quality. Our implementation is faster than the state-of-the-art implementations. Existing the state-of-the-art library also decompresses the whole embedding tables on the fly therefore they do not provide memory reduction during runtime of the training. Our library decompresses only the requested rows therefore can provide 10,000 times memory footprint reduction per embedding table. The library also includes a software cache to store a portion of the entries in the table in decompressed format for faster lookup and process.

Language:CudaMIT192 11 11

mulle-concurrent

📶 A lock- and wait-free hashtable (and an array too)

Language:CNOASSERTION105 7 1

gtl

Greg's Template Library of useful classes.

Language:C++Apache-2.0104 9 9

float_compr_tester

Testing various libraries/approaches for compressing floating point data

Language:HTML14 30