ZeroneBo's starred repositories

Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

License:Apache-2.0Stargazers:3951Issues:0Issues:0
Language:PythonLicense:MITStargazers:334Issues:0Issues:0

Llama3.1-Finetuning

对llama3进行全参微调、lora微调以及qlora微调。

Language:PythonLicense:Apache-2.0Stargazers:130Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:34947Issues:0Issues:0

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:10145Issues:0Issues:0

research_tao

NLP研究入门之道

License:MITStargazers:1949Issues:0Issues:0

PLMpapers

Must-read Papers on pre-trained language models.

License:MITStargazers:3319Issues:0Issues:0

zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Language:Jupyter NotebookLicense:MITStargazers:2877Issues:0Issues:0

MegaParse

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

Language:PythonLicense:Apache-2.0Stargazers:510Issues:0Issues:0

wiseflow

Wiseflow is an agile information mining tool that extracts concise messages from various sources such as websites, WeChat official accounts, social platforms, etc. It automatically categorizes and uploads them to the database.

Language:PythonLicense:NOASSERTIONStargazers:3985Issues:0Issues:0

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:16751Issues:0Issues:0

simpread

简悦 ( SimpRead ) - 让你瞬间进入沉浸式阅读的扩展

Language:JavaScriptLicense:GPL-3.0Stargazers:8060Issues:0Issues:0

readability

A standalone version of the readability lib

Language:JavaScriptLicense:NOASSERTIONStargazers:8795Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:18370Issues:0Issues:0

unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Language:HTMLLicense:Apache-2.0Stargazers:8637Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14Issues:0Issues:0
Language:MakefileLicense:NOASSERTIONStargazers:799Issues:0Issues:0

PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Language:PythonLicense:AGPL-3.0Stargazers:4948Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:13605Issues:0Issues:0

xcopa

XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning

License:CC-BY-4.0Stargazers:97Issues:0Issues:0

simbert

a bert for retrieval and generation

Language:PythonLicense:Apache-2.0Stargazers:840Issues:0Issues:0

corpora

Parallel corpora for the biomedical domain

Language:PythonStargazers:48Issues:0Issues:0

AAAI-2024-Papers

AAAI 2024 Papers: Explore a comprehensive collection of innovative research papers presented at one of the premier artificial intelligence conferences. Seamlessly integrate code implementations for better understanding. ⭐ experience the forefront of progress in artificial intelligence with this repository!

Language:PythonLicense:MITStargazers:393Issues:0Issues:0

Awesome-LLM-RAG

Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

Stargazers:890Issues:0Issues:0

LangChain_ChatGLM_NNU_RAG

基于langchain框架构建的rag小项目

Language:PythonLicense:Apache-2.0Stargazers:3Issues:0Issues:0

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonLicense:MITStargazers:1763Issues:0Issues:0

fucking-algorithm

刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.

Language:MarkdownStargazers:125242Issues:0Issues:0

LLM2LLM

[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Language:PythonLicense:MITStargazers:150Issues:0Issues:0

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11289Issues:0Issues:0
Language:C++License:LGPL-3.0Stargazers:3236Issues:0Issues:0