reborm's starred repositories

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:39702Issues:431Issues:9024

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:17389Issues:159Issues:277

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

QAnything

Question and Answer based on Anything.

Language:PythonLicense:Apache-2.0Stargazers:10117Issues:93Issues:308

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:9105Issues:60Issues:542

unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Language:HTMLLicense:Apache-2.0Stargazers:7147Issues:50Issues:976

ChineseNLPCorpus

中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。

lac

百度NLP:分词,词性标注,命名实体识别,词重要性

Language:C++License:Apache-2.0Stargazers:3781Issues:106Issues:247

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Language:PythonLicense:MITStargazers:3423Issues:37Issues:223

TextRank4ZH

:deciduous_tree:从中文文本中自动提取关键词和摘要

Language:PythonLicense:MITStargazers:3231Issues:102Issues:34

sqlcoder

SoTA LLM for converting natural language questions to SQL queries

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2993Issues:32Issues:98

awesome-streamlit

The purpose of this project is to share knowledge on how awesome Streamlit is and can be

Language:HTMLLicense:CC-BY-SA-4.0Stargazers:1982Issues:45Issues:30

FinGLM

FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。

Awesome-Text2SQL

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

LLMs_interview_notes

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Language:PythonLicense:Apache-2.0Stargazers:1062Issues:27Issues:79

llm-hallucination-survey

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

spRAG

RAG framework for challenging queries over dense unstructured data

Language:PythonLicense:MITStargazers:455Issues:6Issues:8

Finetune-ChatGLM2-6B

ChatGLM2-6B 全参数微调,支持多轮对话的高效微调。

Language:PythonLicense:Apache-2.0Stargazers:391Issues:8Issues:22

django-vue-tutorial

用 django-rest-framework 和 vue 搭建前后端分离的个人博客

Prompt-Engineering-Guide-zh-CN

🐙 关于提示词工程(prompt)的指南、论文、讲座、笔记本和资源大全(自动持续更新)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:377Issues:5Issues:3

RAG-Retrieval

Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT,Cross Encoder

Language:PythonLicense:MITStargazers:311Issues:6Issues:13

MedQA-ChatGLM

🛰️ 基于真实医疗对话数据在ChatGLM上进行LoRA、P-Tuning V2、Freeze、RLHF等微调,我们的眼光不止于医疗问答

AlignBench

多维度中文对齐评测基准 | Benchmarking Chinese Alignment of LLMs

evol-teacher

Open Source WizardCoder Dataset

Language:PythonLicense:Apache-2.0Stargazers:138Issues:2Issues:4

Awesome-KBQA

Paper list of KBQA

License:MITStargazers:59Issues:2Issues:0

text-to-sql-wizardcoder

Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom Spider training dataset. The resultant model, achieves 61% execution accuracy, incorporating database context for validation.

Language:Jupyter NotebookStargazers:41Issues:6Issues:1

LLMs_train

一套代码指令微调大模型

textrank_summarization

用textrank算法做中文新闻自动摘要

Language:Jupyter NotebookStargazers:17Issues:0Issues:0

CodeLlama-LangChain-MySql

Prototype sample code demonstrating how we can leverage CodeLlama locally and connect it to MySQL using LangChain

Language:Jupyter NotebookLicense:MITStargazers:16Issues:0Issues:0