MangoFF

MangoFF

Geek Repo

Company:ZhipuAI

Github PK Tool:Github PK Tool

MangoFF's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:132385Issues:1116Issues:15738

fastai

The fastai deep learning library

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:26133Issues:608Issues:1798

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25307Issues:221Issues:458

llama2.c

Inference Llama 2 in one file of pure C

gpt-3

GPT-3: Language Models are Few-Shot Learners

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Language:PythonLicense:MITStargazers:13315Issues:117Issues:1058
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:10116Issues:102Issues:206

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9683Issues:97Issues:649

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:9330Issues:73Issues:1109

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonLicense:MITStargazers:5756Issues:48Issues:968

Firefly

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

deep-learning-v2-pytorch

Projects and exercises for the latest Deep Learning ND program https://www.udacity.com/course/deep-learning-nanodegree--nd101

Language:Jupyter NotebookLicense:MITStargazers:5270Issues:175Issues:155

AI-Job-Notes

AI算法岗求职攻略(涵盖准备攻略、刷题指南、内推和AI公司清单等资料)

GPT-4-LLM

Instruction Tuning with GPT-4

Language:HTMLLicense:Apache-2.0Stargazers:4171Issues:43Issues:34

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

chain-of-thought-hub

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Language:Jupyter NotebookLicense:MITStargazers:2525Issues:37Issues:34

Python-Offer

《剑指Offer》面试题Python实现

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2175Issues:25Issues:56

pytorch_forward_forward

Implementation of Hinton's forward-forward (FF) algorithm - an alternative to back-propagation

Language:PythonLicense:MITStargazers:1431Issues:26Issues:12

AgentTuning

AgentTuning: Enabling Generalized Agent Abilities for LLMs

CV_Interview

I hope this repo can help you a lot!

Cpp-Primer-5th-Notes-CN

📚 《C++ Primer中文版(第5版)》笔记

ED_Lib

Implementations of edge (ED, EDColor, EDPF), line (EDLines), circle and low eccentric ellipse (EDCircles) detection algorithms.

Language:C++License:MITStargazers:390Issues:22Issues:26

cuda_hgemm

Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.

Language:CudaLicense:MITStargazers:267Issues:4Issues:12

README

A pupil in the computer world.(Felix Fu)

Language:Jupyter NotebookStargazers:173Issues:3Issues:1

LVBench

LVBench: An Extreme Long Video Understanding Benchmark

markdown-clipper

A Firefox and Google Chrome extension to clip websites and download them into a readable markdown file.

Language:JavaScriptLicense:Apache-2.0Stargazers:22Issues:1Issues:0

matrix_multiply

Several common methods of matrix multiplication are implemented on CPU and Nvidia GPU using C++11 and CUDA.

Language:C++License:MITStargazers:14Issues:3Issues:0

megatronlm_dataset_autotokenizer

Megatron-LM/GPT-NeoX compatible Text Encoder with 🤗Transformers AutoTokenizer.

Language:PythonLicense:Apache-2.0Stargazers:6Issues:2Issues:0