eshoyuan

Yixiao Yuan's starred repositories

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Language:PythonApache-2.0212000

mlx

MLX: An array framework for Apple silicon

Language:C++MIT1506800

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookMIT997100

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

GPL-3.0165500

bolna

End-to-end platform for building voice first multimodal agents

Language:PythonMIT21500

awesome-llm-role-playing-with-persona

Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas

30900

pipecat

Open Source framework for voice and multimodal conversational AI

Language:PythonBSD-2-Clause158800

Yi-1.5

Yi-1.5 is an upgraded version of Yi, delivering stronger performance in coding, math, reasoning, and instruction-following capability.

Apache-2.031100

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.0845300

llama3-Chinese-chat

Llama3 中文仓库（聚合资料，各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档）

Language:Python310900

axolotl

Go ahead and axolotl questions

Language:PythonApache-2.0646500

SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

Apache-2.019100

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION2169800

Ouroboros

Ouroboros: Speculative Decoding with Large Model Enhanced Drafting

Language:PythonApache-2.05500

TriForce

TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

Language:Python12900

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonBSD-3-Clause336200

cog

Containers for machine learning

Language:PythonApache-2.0732600

insanely-fast-whisper

Language:Jupyter NotebookApache-2.0674600

reader

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/

Language:TypeScriptApache-2.0504200

wordcab-transcribe

💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.

Language:PythonMIT16900

WhisperLive

A nearly-live implementation of OpenAI's Whisper.

Language:PythonMIT138800

spring2024-lectures

Language:Python7200

Ice

Powerful menu bar manager for macOS

Language:SwiftMIT317900

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonApache-2.03162000

kubeflow

Machine Learning Toolkit for Kubernetes

Language:TypeScriptApache-2.01382700

RecommenderSystem

183800

SearchEngine

搜索引擎原理

123600

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:Shell354700

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run.

Language:PythonMIT1156400

SWE-bench

[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

Language:PythonMIT133200