hata8210's starred repositories

dataherald

Interact with your SQL database, Natural Language to SQL using LLMs

Language:PythonLicense:Apache-2.0Stargazers:3227Issues:0Issues:0

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:1768Issues:0Issues:0

calcite

Apache Calcite

Language:JavaLicense:Apache-2.0Stargazers:4470Issues:0Issues:0

swift

ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3, Llava-Video, Internvl2, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Language:PythonLicense:Apache-2.0Stargazers:2409Issues:0Issues:0

superset

Apache Superset is a Data Visualization and Data Exploration Platform

Language:TypeScriptLicense:Apache-2.0Stargazers:60717Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:12203Issues:0Issues:0
Language:PythonLicense:MITStargazers:3976Issues:0Issues:0

sqlcoder

SoTA LLM for converting natural language questions to SQL queries

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3126Issues:0Issues:0

sqlglot

Python SQL Parser and Transpiler

Language:PythonLicense:MITStargazers:6059Issues:0Issues:0

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonLicense:MITStargazers:1636Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:5865Issues:0Issues:0

litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Language:PythonLicense:Apache-2.0Stargazers:8758Issues:0Issues:0

ragas

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Language:PythonLicense:Apache-2.0Stargazers:5806Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:12394Issues:0Issues:0

sql-eval

Evaluate the accuracy of LLM generated outputs

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:458Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29168Issues:0Issues:0

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18415Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:33898Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38362Issues:0Issues:0

MindMap

MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models

Language:PythonStargazers:189Issues:0Issues:0

Grapher

Code that implements efficient knowledge graph extraction from the textual descriptions

Language:PythonLicense:Apache-2.0Stargazers:135Issues:0Issues:0

chroma

the AI-native open-source embedding database

Language:RustLicense:Apache-2.0Stargazers:13653Issues:0Issues:0

LASER

Language-Agnostic SEntence Representations

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:3565Issues:0Issues:0

Chinese-Word-Vectors

100+ Chinese Word Vectors 上百种预训练中文词向量

Language:PythonLicense:Apache-2.0Stargazers:11696Issues:0Issues:0

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:12661Issues:0Issues:0

text-generation-webui

A Gradio web UI for Large Language Models.

Language:PythonLicense:AGPL-3.0Stargazers:38478Issues:0Issues:0

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Language:PythonLicense:Apache-2.0Stargazers:30990Issues:0Issues:0

fast-stable-diffusion

fast-stable-diffusion + DreamBooth

Language:PythonLicense:MITStargazers:7411Issues:0Issues:0

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonLicense:MITStargazers:37638Issues:0Issues:0

lora-scripts

LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

Language:PythonLicense:AGPL-3.0Stargazers:4109Issues:0Issues:0