bigFaceZhang

bigFaceZhang

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

bigFaceZhang's starred repositories

Bend

A massively parallel, high-level programming language

Language:RustLicense:Apache-2.0Stargazers:15934Issues:0Issues:0

MiniCPM

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4195Issues:0Issues:0

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonLicense:MITStargazers:1537Issues:0Issues:0

llm-export

llm-export can export llm model to onnx.

Language:PythonStargazers:141Issues:0Issues:0

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:3432Issues:0Issues:0

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:11804Issues:0Issues:0

gemma

Open weights LLM from Google DeepMind.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2145Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:53783Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:33266Issues:0Issues:0

act-plus-plus

Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN

Language:PythonLicense:MITStargazers:2741Issues:0Issues:0

Awesome-Chinese-Stable-Diffusion

中文文生图stable diffsion模型集合

Stargazers:133Issues:0Issues:0

libhv

🔥 比libevent/libuv/asio更易用的网络库。A c/c++ network library for developing TCP/UDP/SSL/HTTP/WebSocket/MQTT client/server.

Language:CLicense:BSD-3-ClauseStargazers:6451Issues:0Issues:0

cmockery

A lightweight library to simplify and generalize the process of writing unit tests for C applications.

Language:ShellLicense:Apache-2.0Stargazers:1022Issues:0Issues:0

design-patterns-cpp

C++ Design Patterns

Language:C++License:MITStargazers:4007Issues:0Issues:0

CPlusPlusThings

C++那些事

Language:C++Stargazers:37741Issues:0Issues:0

C

Collection of various algorithms in mathematics, machine learning, computer science, physics, etc implemented in C for educational purposes.

Language:CLicense:GPL-3.0Stargazers:18300Issues:0Issues:0

C-Plus-Plus

Collection of various algorithms in mathematics, machine learning, computer science and physics implemented in C++ for educational purposes.

Language:C++License:MITStargazers:29437Issues:0Issues:0
License:Apache-2.0Stargazers:53Issues:0Issues:0

GPT-4-LLM

Instruction Tuning with GPT-4

Language:HTMLLicense:Apache-2.0Stargazers:4036Issues:0Issues:0

pCLUE

pCLUE: 1000000+多任务提示学习数据集

Language:Jupyter NotebookStargazers:452Issues:0Issues:0

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:23803Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:11452Issues:0Issues:0

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)

Language:PythonLicense:Apache-2.0Stargazers:1484Issues:0Issues:0

Finetune-ChatGLM2-6B

ChatGLM2-6B 全参数微调,支持多轮对话的高效微调。

Language:PythonLicense:Apache-2.0Stargazers:391Issues:0Issues:0

Firefly

Firefly: 大模型训练工具,支持训练Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Language:PythonStargazers:4987Issues:0Issues:0

CodeGeeX2

CodeGeeX2: A More Powerful Multilingual Code Generation Model

Language:PythonLicense:Apache-2.0Stargazers:7386Issues:0Issues:0

CLUECorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

License:MITStargazers:889Issues:0Issues:0

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

License:MITStargazers:3110Issues:0Issues:0

BlueLM

BlueLM(蓝心大模型): Open large language models developed by vivo AI Lab

Language:PythonLicense:NOASSERTIONStargazers:801Issues:0Issues:0