Yu (Hugo) Chen (hugochan)

hugochan

Geek Repo

Company:Anytime.AI

Location:Santa Clara, CA

Home Page:http://academic.hugochan.net

Twitter:@chenyu_hugo

Github PK Tool:Github PK Tool

Yu (Hugo) Chen's starred repositories

openai-cookbook

Examples and guides for using the OpenAI API

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:Jupyter NotebookLicense:MITStargazers:48947Issues:435Issues:122

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:48432Issues:534Issues:192

Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

milvus

A cloud-native vector database, storage for next generation AI applications

Language:GoLicense:Apache-2.0Stargazers:27210Issues:273Issues:10772

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:24899Issues:168Issues:791

StableLM

StableLM: Stability AI Language Models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:15854Issues:203Issues:76

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:10301Issues:151Issues:153

weaviate

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

Language:GoLicense:BSD-3-ClauseStargazers:9708Issues:109Issues:2245

unsloth

Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:9690Issues:72Issues:350

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:8169Issues:68Issues:187

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonLicense:Apache-2.0Stargazers:8048Issues:72Issues:381

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:7585Issues:46Issues:384

open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Language:HTMLLicense:Apache-2.0Stargazers:6772Issues:49Issues:923

Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Language:PythonLicense:Apache-2.0Stargazers:5647Issues:66Issues:128

langfuse

🪢 Open source LLM engineering platform: Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

Language:TypeScriptLicense:NOASSERTIONStargazers:3709Issues:12Issues:345

giskard

🐢 Open-Source Evaluation & Testing for LLMs and ML models

Language:PythonLicense:Apache-2.0Stargazers:3191Issues:26Issues:424

BMTools

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

Language:PythonLicense:Apache-2.0Stargazers:2848Issues:35Issues:37

hugging-llm

HuggingLLM, Hugging Future.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2532Issues:40Issues:10
Language:PythonLicense:Apache-2.0Stargazers:2417Issues:32Issues:28

table-transformer

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.

Language:PythonLicense:MITStargazers:1870Issues:37Issues:134

griptape

Modular Python framework for AI agents and workflows with chain-of-thought reasoning, tools, and memory.

Language:PythonLicense:Apache-2.0Stargazers:1619Issues:24Issues:265

Llama-X

Open Academic Research on Improving LLaMA to SOTA LLM

Language:PythonLicense:Apache-2.0Stargazers:1571Issues:42Issues:20

llama_parse

Parse files for optimal RAG

Language:PythonLicense:MITStargazers:1128Issues:16Issues:113

lawyer-llama

中文法律LLaMA (LLaMA for Chinese legel domain)

Language:PythonLicense:Apache-2.0Stargazers:770Issues:11Issues:60

xgen

Salesforce open-source LLMs with 8k sequence length.

Language:PythonLicense:Apache-2.0Stargazers:712Issues:12Issues:14

Yuan-2.0

Yuan 2.0 Large Language Model

Language:PythonLicense:NOASSERTIONStargazers:640Issues:5Issues:90