Mustard Bean's repositories
Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
approachingalmost
Approaching (Almost) Any Machine Learning Problem
AutoKG
Code and dataset for the paper "LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities".
Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
BertWithPretrained
An implementation of the BERT model and its related downstream tasks based on the PyTorch framework
camel
🐫 CAMEL: Communicative Agents for “Mind” Exploration of Large Scale Language Model Society
ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
chatgpt-web
用 Express 和 Vue3 搭建的 ChatGPT 演示网页
chatWeb
ChatWeb can crawl web pages, read PDF, DOCX, TXT, and extract the main content, then answer your questions based on the content, or summarize the key points.
Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地部署 (Chinese LLaMA & Alpaca LLMs)
Chinese-Vicuna
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca
CPM-Bee
百亿参数的中英文双语基座大模型
DecryptPrompt
总结Prompt&LLM论文,开源数据&模型,AIGC应用
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
generative-ai-roadmap
生成式AI的应用路线图 The roadmap of generative AI: use cases and applications
gorilla
Gorilla: An API store for LLMs
langchain-ChatGLM
langchain-ChatGLM, local knowledge based ChatGLM with langchain | 基于本地知识的 ChatGLM 问答
LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
MOSS
An open-source tool-augmented conversational language model from Fudan University
NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
PreSumm
code for EMNLP 2019 paper Text Summarization with Pretrained Encoders
Prompt-Engineering-Guide
:octopus: Guides, papers, lecture, and resources for prompt engineering
Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
string2string
String-to-String Algorithms for Natural Language Processing
stylegan2-ada-pytorch
StyleGAN2-ADA - Official PyTorch implementation
TOXIGEN
This repo contains the code for generating the ToxiGen dataset, published at ACL 2022.
twint
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
Visionary-Vids
Multi-modal transformer approach for natural language query based joint video summarization and highlight detection
wenda-webui
专为 l15y/wenda 闻达平台设计的webui