imfht

imfht

Geek Repo

Company:西云数据

Location:Beijing, China

Home Page:https://blog.imfht.com/

Github PK Tool:Github PK Tool

imfht's starred repositories

LlamaGym

Fine-tune LLM agents with online reinforcement learning

Language:PythonLicense:MITStargazers:964Issues:0Issues:0

alpaca-chinese-dataset

alpaca中文指令微调数据集

Stargazers:390Issues:0Issues:0

TigerBot

TigerBot: A multi-language multi-task LLM

Language:PythonLicense:Apache-2.0Stargazers:2229Issues:0Issues:0

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:4306Issues:0Issues:0

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptLicense:NOASSERTIONStargazers:42592Issues:0Issues:0

pyllama

LLaMA: Open and Efficient Foundation Language Models

Language:PythonLicense:GPL-3.0Stargazers:2800Issues:0Issues:0

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookStargazers:11389Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:55187Issues:0Issues:0

llama2.c

Inference Llama 2 in one file of pure C

Language:CLicense:MITStargazers:17049Issues:0Issues:0

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Language:PythonLicense:NOASSERTIONStargazers:22146Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:130874Issues:0Issues:0

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

License:MITStargazers:3317Issues:0Issues:0

Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2545Issues:0Issues:0
Language:PythonStargazers:67Issues:0Issues:0

parquet2json

A command-line tool for converting Parquet to newline-delimited JSON

Language:RustLicense:MITStargazers:28Issues:0Issues:0

SecGPT

SecGPT网络安全大模型

Language:PythonLicense:Apache-2.0Stargazers:1681Issues:0Issues:0

DeepKE

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Language:PythonLicense:MITStargazers:3342Issues:0Issues:0

Machine-Mindset

An MBTI Exploration of Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:441Issues:0Issues:0

AutoPrompt

A framework for prompt tuning using Intent-based Prompt Calibration

Language:PythonLicense:Apache-2.0Stargazers:1986Issues:0Issues:0

chathub

All-in-one chatbot client

Language:TypeScriptLicense:GPL-3.0Stargazers:9889Issues:0Issues:0

NL2SQL

Text2SQL 语义解析数据集、解决方案、paper资源整合项目

Stargazers:1123Issues:0Issues:0

GPU-Benchmarks-on-LLM-Inference

Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?

Language:Jupyter NotebookStargazers:749Issues:0Issues:0

prompt-engineering-for-developers

吴恩达《ChatGPT Prompt Engineering for Developers》课程中文版

Language:Jupyter NotebookStargazers:68Issues:0Issues:0

Text-Auto-Summarization

文本自动摘要

Language:Jupyter NotebookLicense:MITStargazers:87Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:36Issues:0Issues:0

DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Language:PythonLicense:MITStargazers:13018Issues:0Issues:0

magika

Detect file content types with deep learning

Language:RustLicense:Apache-2.0Stargazers:7665Issues:0Issues:0

gptscript

Build AI assistants that interact with your systems

Language:GoLicense:Apache-2.0Stargazers:2876Issues:0Issues:0

FastUI

Build better UIs faster.

Language:PythonLicense:MITStargazers:8009Issues:0Issues:0

BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

Language:HTMLLicense:Apache-2.0Stargazers:7780Issues:0Issues:0