ZihengWu (wuziheng)

wuziheng

Geek Repo

Company:Alibaba

Location:Beijing

Github PK Tool:Github PK Tool

ZihengWu's starred repositories

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49208Issues:561Issues:202

professional-programming

A collection of learning resources for curious software engineers

Language:PythonLicense:MITStargazers:45985Issues:985Issues:28

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:34622Issues:309Issues:877

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:23346Issues:265Issues:64

MediaCrawler

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫

Language:PythonLicense:NOASSERTIONStargazers:15426Issues:88Issues:245

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Language:PythonLicense:NOASSERTIONStargazers:9912Issues:131Issues:48

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8803Issues:82Issues:36

yolov9

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Language:PythonLicense:GPL-3.0Stargazers:8653Issues:57Issues:480

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8480Issues:95Issues:374

Firefly

Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5190Issues:38Issues:37

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3882Issues:114Issues:73

jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Language:PythonLicense:NOASSERTIONStargazers:2566Issues:37Issues:50

InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Language:PythonLicense:Apache-2.0Stargazers:2287Issues:41Issues:349

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:1802Issues:17Issues:149

Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Language:PythonLicense:MITStargazers:1596Issues:22Issues:98

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonLicense:Apache-2.0Stargazers:1576Issues:21Issues:85

Awesome-Video-Diffusion-Models

[Arxiv] A Survey on Video Diffusion Models

FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Language:Jupyter NotebookLicense:MITStargazers:1307Issues:19Issues:58

EasyAnimate

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Language:PythonLicense:Apache-2.0Stargazers:867Issues:14Issues:53

Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

Language:PythonLicense:MITStargazers:725Issues:71Issues:13

animate-anything

Fine-Grained Open Domain Image Animation with Motion Guidance

Language:PythonLicense:MITStargazers:668Issues:16Issues:54

HuggingFace-Download-Accelerator

利用HuggingFace的官方下载工具从镜像网站进行高速下载。

distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Language:PythonLicense:MITStargazers:513Issues:8Issues:18

video2numpy

Optimized library for large-scale extraction of frames and audio from video.

Language:PythonLicense:MITStargazers:199Issues:3Issues:26

SMT

This is an official implementation for "Scale-Aware Modulation Meet Transformer".

Language:PythonLicense:MITStargazers:176Issues:2Issues:24

coze-beautify

针对 coze (目前可免费使用 GPT-4)https://www.coze.com (海外版) 和 https://www.coze.cn (大陆版) 的 bot 界面优化的 Chrome 插件

Language:TypeScriptLicense:MITStargazers:63Issues:0Issues:0

llm-scheduling-artifact

Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“

Language:PythonLicense:Apache-2.0Stargazers:46Issues:4Issues:1

blog

张振虎的博客