QDX (Hemeets)

Hemeets

Geek Repo

Github PK Tool:Github PK Tool

QDX's starred repositories

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:35519Issues:346Issues:1717

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Language:PythonLicense:NOASSERTIONStargazers:15602Issues:135Issues:615

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:12381Issues:95Issues:1024

QAnything

Question and Answer based on Anything.

Language:PythonLicense:Apache-2.0Stargazers:10464Issues:94Issues:322

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10313Issues:82Issues:291

danswer

Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.

Language:PythonLicense:NOASSERTIONStargazers:9733Issues:97Issues:400

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:8417Issues:129Issues:426

BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

Language:HTMLLicense:Apache-2.0Stargazers:7696Issues:107Issues:438

handson-ml3

A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6679Issues:118Issues:94

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:5845Issues:36Issues:832

Firefly

Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

dlwpt-code

Code for the book Deep Learning with PyTorch by Eli Stevens, Luca Antiga, and Thomas Viehmann.

Language:Jupyter NotebookStargazers:4587Issues:110Issues:108

self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Language:PythonLicense:Apache-2.0Stargazers:3910Issues:57Issues:19

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:3125Issues:21Issues:397

JioNLP

中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com

Language:PythonLicense:Apache-2.0Stargazers:3096Issues:34Issues:193

example-code-2e

Example code for Fluent Python, 2nd edition (O'Reilly 2022)

Language:PythonLicense:MITStargazers:3059Issues:68Issues:15

instructor-embedding

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Language:PythonLicense:Apache-2.0Stargazers:1777Issues:17Issues:104

nlpcda

一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda

Language:PythonLicense:Apache-2.0Stargazers:1715Issues:9Issues:31

Chat-Haruhi-Suzumiya

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1677Issues:15Issues:59

FinGLM

FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。

LLMs_interview_notes

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

KwaiAgents

A generalized information-seeking agent system with Large Language Models (LLMs).

Language:PythonLicense:NOASSERTIONStargazers:1012Issues:20Issues:41

ChatLM-mini-Chinese

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Language:PythonLicense:Apache-2.0Stargazers:965Issues:12Issues:43

NLPDataSet

记录本人整理的一些数据集

XuanYuan

轩辕:度小满中文金融对话大模型

sft_datasets

开源SFT数据集整理,随时补充

bookshelf

:books: books

License:GPL-3.0Stargazers:378Issues:0Issues:0

ContextualSP

Multiple paper open-source codes of the Microsoft Research Asia DKI group

Language:PythonLicense:MITStargazers:366Issues:16Issues:32