过拟合 (bestpredicts)

bestpredicts

Geek Repo

Company:AntGroup

Location:China

Home Page:https://www.zhihu.com/people/YongDeng0101

Twitter:@bestpredict01

Github PK Tool:Github PK Tool

过拟合's repositories

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

Stargazers:0Issues:0Issues:0

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

License:CC0-1.0Stargazers:0Issues:0Issues:0

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

License:Apache-2.0Stargazers:0Issues:0Issues:0

awesome_LLMs_interview_notes

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

License:MITStargazers:0Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!

License:Apache-2.0Stargazers:0Issues:0Issues:0

dsir

DSIR large-scale data selection framework for language model training

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

easy-rl

强化学习中文教程(蘑菇书),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

gpt-accelera

Simple and efficient pytorch-native transformer training and inference (batched)

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

gsm8k-ScRel

Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models

Stargazers:0Issues:0Issues:0

Linly-Talker

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬

License:MITStargazers:0Issues:0Issues:0

LLaMA-Factory

Unify Efficient Fine-tuning of 100+ LLMs

License:Apache-2.0Stargazers:0Issues:0Issues:0

LLaMA-Pro

[ACL 2024] Progressive LLaMA with Block Expansion.

License:Apache-2.0Stargazers:0Issues:0Issues:0

llm-detect-ai

1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition

License:MITStargazers:0Issues:0Issues:0

LLMs-from-scratch

Implementing a ChatGPT-like LLM from scratch, step by step

License:NOASSERTIONStargazers:0Issues:0Issues:0

LS-LLaMA

A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning

License:MITStargazers:0Issues:0Issues:0

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

License:MITStargazers:0Issues:0Issues:0

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

License:Apache-2.0Stargazers:0Issues:0Issues:0

mlx-examples

Examples in the MLX framework

License:MITStargazers:0Issues:0Issues:0

Ollamac

A macOS app for interacting with the Ollama models

License:MITStargazers:0Issues:0Issues:0

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (LLaMA, LLaMa2, ChatGLM2, ChatGPT, Claude, etc) over 50+ datasets.

License:Apache-2.0Stargazers:0Issues:0Issues:0

OralCounsellor

一个基于大模型的口语对话顾问

License:MITStargazers:0Issues:0Issues:0

PPOxFamily

PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

License:Apache-2.0Stargazers:0Issues:0Issues:0

quillman

A chat app that transcribes audio in real-time, streams back a response from a language model, and synthesizes this response as natural-sounding speech.

License:MITStargazers:0Issues:0Issues:0

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.

Stargazers:0Issues:0Issues:0

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

License:Apache-2.0Stargazers:0Issues:0Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

License:Apache-2.0Stargazers:0Issues:0Issues:0

TPU-Alignment

Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free

License:Apache-2.0Stargazers:0Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0