JoeYee007's starred repositories

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookLicense:MITStargazers:9722Issues:0Issues:0

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonLicense:Apache-2.0Stargazers:17816Issues:0Issues:0

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

Stargazers:9045Issues:0Issues:0

cramming

Cramming the training of a (BERT-type) language model into limited compute.

Language:PythonLicense:MITStargazers:1263Issues:0Issues:0

CODA-Prompt

PyTorch code for the CVPR'23 paper: "CODA-Prompt: COntinual Decomposed Attention-based Prompting for Rehearsal-Free Continual Learning"

Language:PythonLicense:MITStargazers:118Issues:0Issues:0

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:136248Issues:0Issues:0

crawlee

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

Language:TypeScriptLicense:Apache-2.0Stargazers:13449Issues:0Issues:0

ecs-deploy

Powerful CLI tool to simplify Amazon ECS deployments, rollbacks & scaling

Language:PythonLicense:NOASSERTIONStargazers:845Issues:0Issues:0

rasa

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Language:PythonLicense:Apache-2.0Stargazers:18332Issues:0Issues:0

ML-From-Scratch

Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.

Language:PythonLicense:MITStargazers:23606Issues:0Issues:0

ILearnDeepLearning.py

This repository contains small projects related to Neural Networks and Deep Learning in general. Subjects are closely linekd with articles I publish on Medium. I encourage you both to read as well as to check how the code works in the action.

Language:Jupyter NotebookLicense:MITStargazers:1329Issues:0Issues:0

cheatsheets-ai

Essential Cheat Sheets for deep learning and machine learning researchers https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5

License:MITStargazers:15053Issues:0Issues:0

Chinese-Annotator

Annotator for Chinese Text Corpus (UNDER DEVELOPMENT) 中文文本标注工具

Language:JavaScriptLicense:Apache-2.0Stargazers:1447Issues:0Issues:0
Language:Jupyter NotebookLicense:MIT-0Stargazers:4Issues:0Issues:0
Language:Jupyter NotebookLicense:MIT-0Stargazers:13Issues:0Issues:0

amazon-sagemaker-bert-classify-pytorch

This sample show you how to train BERT on Amazon Sagemaker using Spot instances

Language:PythonLicense:MIT-0Stargazers:31Issues:0Issues:0

django-haystack

Modular search for Django

Language:PythonLicense:NOASSERTIONStargazers:3576Issues:0Issues:0

lxSpider

爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、各种指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书、大众点评、推特、脉脉、知乎》

Language:PythonLicense:GPL-3.0Stargazers:1597Issues:0Issues:0

awesome-knowledge-graph

整理知识图谱相关学习资料

Stargazers:4456Issues:0Issues:0

faiss

A library for efficient similarity search and clustering of dense vectors.

Language:C++License:MITStargazers:29558Issues:0Issues:0

haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonLicense:Apache-2.0Stargazers:14680Issues:0Issues:0

WWW2015_code

This is the code for my WWW2015 paper

Language:MatlabStargazers:5Issues:0Issues:0

awesome-public-datasets

A topic-centric list of HQ open datasets.

License:MITStargazers:59395Issues:0Issues:0

awesome

😎 Awesome lists about all kinds of interesting topics

License:CC0-1.0Stargazers:311539Issues:0Issues:0

sematch

semantic similarity framework for knowledge graph

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:426Issues:0Issues:0

chinesetokenization

chinesetokenization

Language:PythonStargazers:13Issues:0Issues:0