bodhibudd

bodhibudd

Geek Repo

Github PK Tool:Github PK Tool

bodhibudd's starred repositories

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Language:PythonLicense:Apache-2.0Stargazers:2560Issues:0Issues:0

LaTeX_OCR

:gem: 数学公式识别 Math Formula OCR

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:476Issues:0Issues:0

haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonLicense:Apache-2.0Stargazers:14257Issues:0Issues:0

open-parse

Improved file parsing for LLM’s

Language:PythonLicense:MITStargazers:1974Issues:0Issues:0

pdf-struct

Logical structure analysis for visually structured documents

Language:PythonLicense:Apache-2.0Stargazers:70Issues:0Issues:0

fastRAG

Efficient Retrieval Augmentation and Generation Framework

Language:PythonLicense:Apache-2.0Stargazers:1038Issues:0Issues:0

natural-instructions

Expanding natural instructions

Language:PythonLicense:Apache-2.0Stargazers:915Issues:0Issues:0

pdf2png

将pdf转换成png图片

Language:GoStargazers:3Issues:0Issues:0

MediaCrawler

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫

Language:PythonLicense:NOASSERTIONStargazers:14423Issues:0Issues:0
Language:PythonStargazers:40Issues:0Issues:0

MoZi

首个全参数训练的知识产权大模型 MoZi (墨子)

Language:PythonLicense:Apache-2.0Stargazers:9Issues:0Issues:0

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonLicense:MITStargazers:27172Issues:0Issues:0

DISC-FinLLM

DISC-FinLLM,中文金融大语言模型(LLM),旨在为用户提供金融场景下专业、智能、全面的金融咨询服务。DISC-FinLLM, a Chinese financial large language model (LLM) designed to provide users with professional, intelligent, and comprehensive financial consulting services in financial scenarios.

Language:PythonLicense:Apache-2.0Stargazers:465Issues:0Issues:0

awesome-pretrained-chinese-nlp-models

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Language:PythonLicense:MITStargazers:4373Issues:0Issues:0

ChatGLM-Finetuning

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等

Language:PythonStargazers:2538Issues:0Issues:0

Chinese-Mixtral-8x7B

中文Mixtral-8x7B(Chinese-Mixtral-8x7B)

Language:PythonLicense:Apache-2.0Stargazers:628Issues:0Issues:0

LLamaTuner

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

Language:PythonLicense:Apache-2.0Stargazers:540Issues:0Issues:0

GoGPT

GoGPT:基于Llama/Llama 2训练的中英文增强大模型|Chinese-Llama2

Language:PythonStargazers:76Issues:0Issues:0

Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

Language:PythonLicense:Apache-2.0Stargazers:4003Issues:0Issues:0

Bert-Chinese-Text-Classification-Pytorch

使用Bert,ERNIE,进行中文文本分类

Language:PythonLicense:MITStargazers:3801Issues:0Issues:0

EditScorer

The code for EMNLP2022 paper "Improved grammatical error correction by ranking elementary edits"

Language:PythonStargazers:17Issues:0Issues:0

LaWGPT

🎉 Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge. 基于中文法律知识的大语言模型

Language:PythonLicense:GPL-3.0Stargazers:5695Issues:0Issues:0

self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Language:PythonLicense:Apache-2.0Stargazers:3877Issues:0Issues:0

memory-efficient-attention-pytorch

Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"

Language:PythonLicense:MITStargazers:344Issues:0Issues:0

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:7859Issues:0Issues:0

Firefly

Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Language:PythonStargazers:5015Issues:0Issues:0

bytepiece

更纯粹、更高压缩率的Tokenizer

Language:PythonLicense:Apache-2.0Stargazers:408Issues:0Issues:0

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:24090Issues:0Issues:0

DISC-LawLLM

DISC-LawLLM, an intelligent legal system utilizing large language models (LLMs) to provide a wide range of legal services

Language:PythonLicense:Apache-2.0Stargazers:458Issues:0Issues:0
Language:PythonStargazers:17Issues:0Issues:0