lb410's starred repositories

linux

Linux kernel source tree

Language:CLicense:NOASSERTIONStargazers:179413Issues:7931Issues:0

youtube-dl

Command-line program to download videos from YouTube.com and other video sites

Language:PythonLicense:UnlicenseStargazers:131598Issues:2203Issues:26604

ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

llama.cpp

LLM inference in C/C++

jieba

结巴中文分词

Language:PythonLicense:MITStargazers:33152Issues:1279Issues:851

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:31693Issues:201Issues:4905

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:23606Issues:230Issues:136

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Language:JavaScriptLicense:Apache-2.0Stargazers:18397Issues:174Issues:2223

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonLicense:MITStargazers:17691Issues:111Issues:465

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:16210Issues:112Issues:840

DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Language:PythonLicense:MITStargazers:13392Issues:116Issues:1070

Chinese-Word-Vectors

100+ Chinese Word Vectors 上百种预训练中文词向量

Language:PythonLicense:Apache-2.0Stargazers:11779Issues:285Issues:167

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:11727Issues:148Issues:816

doccano

Open source annotation tool for machine learning practitioners.

Language:PythonLicense:MITStargazers:9449Issues:132Issues:1523

search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

Language:TypeScriptLicense:Apache-2.0Stargazers:7742Issues:51Issues:65

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7699Issues:108Issues:156

EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

snownlp

Python library for processing Chinese text

Language:PythonLicense:MITStargazers:6403Issues:350Issues:108

OpenNRE

An Open-Source Package for Neural Relation Extraction (NRE)

Language:PythonLicense:MITStargazers:4320Issues:119Issues:367

llama3-Chinese-chat

Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)

OpenKE

An Open-Source Package for Knowledge Embedding (KE)

KeyBERT

Minimal keyword extraction with BERT

Language:PythonLicense:MITStargazers:3459Issues:32Issues:201

DeepKE

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Language:PythonLicense:MITStargazers:3445Issues:43Issues:559

brat

brat rapid annotation tool (brat) - for all your textual annotation needs

Language:PythonLicense:NOASSERTIONStargazers:1816Issues:80Issues:1349

cnSchema

开放中文知识图谱的schema

MarkTool

DoTAT 是一款基于web、面向领域的通用文本标注工具,支持大规模实体标注、关系标注、事件标注、文本分类、基于字典匹配和正则匹配的自动标注以及用于实现归一化的标准名标注,同时也支持迭代标注、嵌套实体标注和嵌套事件标注。标注规范可自定义且同类型任务中可“一次创建多次复用”。通过分级实体集合扩大了实体类型的规模,并设计了全新高效的标注方式,提升了用户体验和标注效率。此外,本工具增加了审核环节,可对多人的标注结果进行一致性检验、自动合并和手动调整,提高了标注结果的准确率。

Language:VueLicense:Apache-2.0Stargazers:589Issues:13Issues:18

Enterprise-WeChat-GPTbot

基于企微gpt知识库的bot机器人,能够自动回复企业微信中收到的消息。这个机器人能够处理私聊和群聊,还可以记住与用户的聊天内容,从而做出更加贴合上下文的回应。此外,您还可以设置白名单来控制机器人与哪些用户或群组交互。

ZhKeyBERT

Minimal keyword extraction with BERT

Language:PythonLicense:MITStargazers:72Issues:1Issues:0

incubator-hugegraph-ai

The integration of HugeGraph with artificial intelligence

Language:PythonLicense:Apache-2.0Stargazers:46Issues:14Issues:16