Minami-su

南栖's repositories

character_AI_open

Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.

Language:Python135 4 7

This repository, deepspeed-grpo-qlora-vllm, provides a complete framework for fine-tuning LLMs using Group Relative Policy Optimization (GRPO) on 4-bit quantized models (QLoRA). It utilizes DeepSpeed ZeRO-3 for scalable training and integrates with a VLLM server to dynamically serve the fine-tuned LoRA adapters.

Language:PythonApache-2.01200

nBAT

BiLSTM-Attention Transformer for non-coding RNA Coding Potential Prediction（Code）Journal of Chemical Information and Modeling 2024-08-09 | Journal article DOI: 10.1021/acs.jcim.4c01097

Language:Python4 10

Minami-su

3 10

attention_sinks_autogptq

attention_sinks can use autogptq,and support all model at autogptq,like qwen baichuan,etc

Language:PythonApache-2.0100

Infinite-Evolution-AI

Iteratively Generating Complex Evolutionary Networked Instructions uncensored.

Language:Python1 10

quip-sharp-qwen

Language:PythonGPL-3.0100

ActQKV

Language:Python000

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:Python000

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookApache-2.0000

AutoGPTQ_cogvlm

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonMIT000

Bert-VITS2

vits2 backbone with bert

Language:PythonAGPL-3.0000

character_AI_open_evol

Achieve 2–3× roleplay performance through LLM self-iteration with MCTS and evol instruction.

000

Emotional-ai

Emotional ai

010

InfLLM

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

Language:Python000

wxbot

PC微信Hook模块、Hook WeChat / 微信逆向、微信机器人、WeChatRobot

Language:Go000

Evol_Mctsr

Achieve a 2x-3x performance improvement through LLM self-iteration with MCTS and evo instruction.

010

hallucination-leaderboard

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

Apache-2.0000

ITCMA

Language:Python000

MemAgent

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Language:PythonApache-2.0000

mini-omni2

Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。

MIT000

nemori

A minimalist MVP demonstrating a simple yet profound insight: aligning AI memory with human episodic memory granularity. Shows how this single principle enables simple methods to rival complex memory frameworks for conversational tasks.

Language:PythonMIT000