Yunyu Lin's starred repositories
ColossalAI
Making large AI models cheaper, faster and more accessible
chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
jsonformer
A Bulletproof Way to Generate Structured JSON from Language Models
deepdoctection
A Repo For Document AI
parallelformers
Parallelformers: An Efficient Model Parallelization Toolkit for Deployment
info-nce-pytorch
PyTorch implementation of the InfoNCE loss for self-supervised learning.
react-nestable
Drag & drop hierarchical list made as a react component
charformer-pytorch
Implementation of the GBST block from the Charformer paper, in Pytorch
synchronicity
Synchronicity lets you interoperate with asynchronous Python APIs.