QDX (Hemeets)

Hemeets

Geek Repo

Github PK Tool:Github PK Tool

QDX's starred repositories

Language:PythonStargazers:230Issues:0Issues:0

Firefly

Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Language:PythonStargazers:5101Issues:0Issues:0

LLMs_interview_notes

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

License:Apache-2.0Stargazers:1099Issues:0Issues:0

FinGLM

FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。

Language:HTMLStargazers:1572Issues:0Issues:0

sft_datasets

开源SFT数据集整理,随时补充

Stargazers:380Issues:0Issues:0

ChatLM-mini-Chinese

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Language:PythonLicense:Apache-2.0Stargazers:946Issues:0Issues:0

XuanYuan

轩辕:度小满中文金融对话大模型

Language:PythonStargazers:912Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:35324Issues:0Issues:0

handson-ml3

A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6560Issues:0Issues:0

dlwpt-code

Code for the book Deep Learning with PyTorch by Eli Stevens, Luca Antiga, and Thomas Viehmann.

Language:Jupyter NotebookStargazers:4571Issues:0Issues:0

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:8337Issues:0Issues:0

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:5690Issues:0Issues:0

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:12188Issues:0Issues:0

NLPDataSet

记录本人整理的一些数据集

License:Apache-2.0Stargazers:952Issues:0Issues:0

self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Language:PythonLicense:Apache-2.0Stargazers:3892Issues:0Issues:0

KwaiAgents

A generalized information-seeking agent system with Large Language Models (LLMs).

Language:PythonLicense:NOASSERTIONStargazers:992Issues:0Issues:0

QAnything

Question and Answer based on Anything.

Language:PythonLicense:Apache-2.0Stargazers:10295Issues:0Issues:0

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:3036Issues:0Issues:0

Chat-Haruhi-Suzumiya

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1656Issues:0Issues:0

BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

Language:HTMLLicense:Apache-2.0Stargazers:7665Issues:0Issues:0
License:Apache-2.0Stargazers:4671Issues:0Issues:0

JioNLP

中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com

Language:PythonLicense:Apache-2.0Stargazers:3071Issues:0Issues:0

danswer

Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.

Language:PythonLicense:MITStargazers:9650Issues:0Issues:0

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Language:PythonLicense:NOASSERTIONStargazers:15574Issues:0Issues:0

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookStargazers:10184Issues:0Issues:0

ContextualSP

Multiple paper open-source codes of the Microsoft Research Asia DKI group

Language:PythonLicense:MITStargazers:364Issues:0Issues:0

nlpcda

一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda

Language:PythonLicense:Apache-2.0Stargazers:1707Issues:0Issues:0

instructor-embedding

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Language:PythonLicense:Apache-2.0Stargazers:1765Issues:0Issues:0

example-code-2e

Example code for Fluent Python, 2nd edition (O'Reilly 2022)

Language:PythonLicense:MITStargazers:3036Issues:0Issues:0

bookshelf

:books: books

License:GPL-3.0Stargazers:378Issues:0Issues:0