hugochan

Yu (Hugo) Chen's starred repositories

openai-cookbook

Examples and guides for using the OpenAI API

Language:MDXMIT56457 855 382

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:Jupyter NotebookMIT48947 435 122

grok-1

Grok open release

Language:PythonApache-2.048432 534 192

Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

Language:MDXMIT44455 498 126

milvus

A cloud-native vector database, storage for next generation AI applications

Language:GoApache-2.027210 273 10772

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT24899 168 791

babyagi

Language:PythonMIT19363 292 143

StableLM

StableLM: Stability AI Language Models

Language:Jupyter NotebookApache-2.015854 203 76

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonMIT10301 151 153

weaviate

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.

Language:GoBSD-3-Clause9708 109 2245

unsloth

Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.09690 72 350

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonMIT8169 68 187

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonApache-2.08048 72 381

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonApache-2.07585 46 384

open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

Apache-2.07213 117 90

unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Language:HTMLApache-2.06772 49 923

Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Language:PythonApache-2.05647 66 128

langfuse

🪢 Open source LLM engineering platform: Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

Language:TypeScriptNOASSERTION3709 12 345

giskard

🐢 Open-Source Evaluation & Testing for LLMs and ML models

Language:PythonApache-2.03191 26 424

BMTools

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

Language:PythonApache-2.02848 35 37

hugging-llm

HuggingLLM, Hugging Future.

Language:Jupyter NotebookNOASSERTION2532 40 10

lamini

Language:PythonApache-2.02417 32 28

table-transformer

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.

Language:PythonMIT1870 37 134