Jiang Shanshan's starred repositories
the-algorithm
Source code for Twitter's Recommendation Algorithm
cs-self-learning
计算机自学指南
ColossalAI
Making large AI models cheaper, faster and more accessible
shellcheck
ShellCheck, a static analysis tool for shell scripts
Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
nopecha-extension
Automated CAPTCHA solver for your browser. Works with Selenium, Puppeteer, Playwright, and more.
GPU-Puzzles
Solve puzzles. Learn CUDA.
coremltools
Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.
parallel-hashmap
A family of header-only, very fast and memory-friendly hashmap and btree containers.
Algorithm-Practice-in-Industry
搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)
fedlearner
A multi-party collaborative machine learning framework
FBTT-Embedding
This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as recommendation and natural language processing. We showed this library can reduce the total model size by up to 100x in Facebook’s open sourced DLRM model while achieving same model quality. Our implementation is faster than the state-of-the-art implementations. Existing the state-of-the-art library also decompresses the whole embedding tables on the fly therefore they do not provide memory reduction during runtime of the training. Our library decompresses only the requested rows therefore can provide 10,000 times memory footprint reduction per embedding table. The library also includes a software cache to store a portion of the entries in the table in decompressed format for faster lookup and process.
mulle-concurrent
📶 A lock- and wait-free hashtable (and an array too)
float_compr_tester
Testing various libraries/approaches for compressing floating point data