过拟合's repositories
aima-python
Python implementation of algorithms from Russell And Norvig's "Artificial Intelligence - A Modern Approach"
Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
bestpredicts.github.io
bestpredicts.github.io
BiLLM
Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embeddings. Compatible with 🤗 transformers.
data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
dsir
DSIR large-scale data selection framework for language model training
functionary
Chat language model that can use tools and interpret the results
gpt-accelera
Simple and efficient pytorch-native transformer training and inference (batched)
hexo-theme-3-hexo
hexo主题:三段式设计、极简、方便 Hexo theme: three-stage design
InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
iRingo
解锁完整的 Apple功能和集成服务
Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
LLaMA-Factory
Unify Efficient Fine-tuning of 100+ LLMs
LLaMA-Pro
[ACL 2024] Progressive LLaMA with Block Expansion.
LLaVA-MORE
LLaVA-MORE: Enhancing Visual Instruction Tuning with LLaMA 3.1
llm-detect-ai
1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition
LLMs-from-scratch
Implementing a ChatGPT-like LLM from scratch, step by step
LS-LLaMA
A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning
minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
mistral-src
Reference implementation of Mistral AI 7B v0.1 model.
mlx-examples
Examples in the MLX framework
Ollamac
A macOS app for interacting with the Ollama models
quillman
A chat app that transcribes audio in real-time, streams back a response from a language model, and synthesizes this response as natural-sounding speech.
Qwen1.5
Qwen1.5 is the improved version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.
safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
v2ray-agent
Xray、Tuic、hysteria2、sing-box 八合一一键脚本