Beast code in Giters

zwf's starred repositories

sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.01126800

dawn

Native WebGPU implementation. Mirror of https://dawn.googlesource.com/dawn

Language:C++BSD-3-Clause36800

LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Language:PythonApache-2.0211200

moshi

Language:PythonApache-2.0593000

chai-lab

Chai-1, SOTA model for biomolecular structure prediction

Language:PythonNOASSERTION105700

heic2any

Converting HEIF/HEIF image formats to PNG/GIF/JPEG in the browser

Language:TypeScriptMIT62800

cog-flux-dev-inpainting

🎨 Fill in masked parts of images with FLUX.1-dev 🖌️

Language:PythonMIT900

MakerSkillTree

A repository of Maker Skill Trees and templates to make your own.

Language:Jinja277100

instant

Instant is a modern Firebase. We make you productive by giving your frontend a real-time database.

Language:ClojureApache-2.0604800

email-reply-parser

:email: Email reply parser library for Python

Language:PythonMIT49300

revideo

Create Videos with Code

Language:TypeScriptMIT240600

matmulfreellm

Implementation for MatMul-free LM.

Language:PythonApache-2.0288800

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Language:PythonGPL-3.05222600

LLM-Merging

LLM-Merging: Building LLMs Efficiently through Merging

Language:Jupyter Notebook16900

ChatTTS

A generative speech model for daily dialogue.

Language:PythonAGPL-3.03117800

MusePose

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Language:PythonNOASSERTION214700

jacob

Just Another Coding Bot

Language:TypeScriptApache-2.010900

loopvlm

run paligemma in real time

Language:PythonBSD-3-Clause12200

morphic

An AI-powered search engine with a generative UI

Language:TypeScriptApache-2.0603400

schedule_free

Schedule-Free Optimization in PyTorch

Language:PythonApache-2.0183300

GigaSpeech

Large, modern dataset for speech recognition

Language:ShellApache-2.063200

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookNOASSERTION751600

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT3345200

OpenHands

🙌 OpenHands: Code Less, Make More

Language:PythonMIT3261300

Yi

A series of large language models trained from scratch by developers @01-ai

Language:Jupyter NotebookApache-2.0762000

aloha

Language:PythonMIT143100

skyvern

Automate browser-based workflows with LLMs and Computer Vision

Language:PythonAGPL-3.0587600

moondream

tiny vision language model

Language:Jupyter NotebookApache-2.0496300

litellm

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Language:PythonNOASSERTION1262700

OpenGrip

a simple, low cost, robotics gripper

6400