Force1ess

Force1ess

Geek Repo

Company:University of Chinese Academy of Sciences

Github PK Tool:Github PK Tool

Force1ess's starred repositories

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookLicense:MITStargazers:18304Issues:116Issues:507

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8881Issues:75Issues:1017

ai

Build AI-powered applications with React, Svelte, Vue, and Solid

Language:TypeScriptLicense:NOASSERTIONStargazers:8816Issues:58Issues:686

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8484Issues:99Issues:1232

txtai

💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

Language:PythonLicense:Apache-2.0Stargazers:8140Issues:83Issues:720

nvtop

GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm

Language:CLicense:NOASSERTIONStargazers:7795Issues:78Issues:234

magika

Detect file content types with deep learning

Language:RustLicense:Apache-2.0Stargazers:7596Issues:36Issues:384

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5392Issues:63Issues:96

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4183Issues:47Issues:266

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

algorithmica

A computer science textbook

Language:Jupyter NotebookStargazers:3229Issues:64Issues:68

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

uYouEnhanced

uYouEnhanced (by @arichornlover) is an expanded version of uYou+ (made by @qnblackcat) with additional features and mainly made for non jailbroken users!

chronos-forecasting

Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting

Language:PythonLicense:Apache-2.0Stargazers:2154Issues:24Issues:63

fastmoe

A fast MoE impl for PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1492Issues:13Issues:113

OpenMoE

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

distilabel

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.

Language:PythonLicense:Apache-2.0Stargazers:1198Issues:13Issues:329
Language:PythonLicense:Apache-2.0Stargazers:1134Issues:19Issues:50

Marker

A Desktop App for Easily Viewing and Editing Markdown Files

Language:TypeScriptLicense:MITStargazers:1041Issues:7Issues:31

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:1001Issues:40Issues:66

follow

[WIP] Next generation information browser

Language:TypeScriptLicense:GPL-3.0Stargazers:983Issues:21Issues:12

ncmdump

转换网易云音乐 ncm 到 mp3 / flac. Convert Netease Cloud Music ncm files to mp3/flac files.

Language:C++License:MITStargazers:648Issues:3Issues:15

LoRD

Low-Rank adapter extraction for fine-tuned transformers model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:150Issues:2Issues:0

Python-Package-Template

A easy, reliable, fluid template for python packages complete with docs, testing suites, readme's, github workflows, linting and much much more

Language:ShellLicense:MITStargazers:116Issues:3Issues:0

RSSmanX

RSSman X 一套综合RSS解决方案

open-lid-dataset

Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)

Language:PerlLicense:GPL-3.0Stargazers:61Issues:8Issues:9

UCAS-AICS

国科大《智能计算系统》课程实验

Language:PythonStargazers:18Issues:0Issues:0
Language:PythonStargazers:13Issues:1Issues:0
Language:PythonLicense:MITStargazers:3Issues:0Issues:0