Rongjiehuang

Rongjiehuang

Geek Repo

Company:Facebook AI Research (FAIR)

Home Page:rongjiehuang.github.io

Github PK Tool:Github PK Tool


Organizations
AIGC-Audio

Rongjiehuang's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonLicense:MITStargazers:165791Issues:1551Issues:2501

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT better.

Language:HTMLLicense:CC0-1.0Stargazers:108110Issues:1402Issues:0

gpt4all

GPT4All: Chat with Local LLMs on Any Device

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonLicense:GPL-3.0Stargazers:63238Issues:264Issues:1558

openai-cookbook

Examples and guides for using the OpenAI API

Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:46230Issues:305Issues:658

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:34754Issues:320Issues:427

JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Language:PythonLicense:MITStargazers:23471Issues:384Issues:177

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19372Issues:298Issues:1344

StableLM

StableLM: Stability AI Language Models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:15851Issues:201Issues:76

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12696Issues:169Issues:507

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Language:PythonLicense:NOASSERTIONStargazers:9936Issues:131Issues:48

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonLicense:NOASSERTIONStargazers:8156Issues:100Issues:86
Language:PythonLicense:NOASSERTIONStargazers:7612Issues:84Issues:100
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3013Issues:24Issues:79

speechgpt

💬 SpeechGPT is a web application that enables you to converse with ChatGPT.

Language:TypeScriptLicense:MITStargazers:2731Issues:20Issues:47

ChatReviewer

ChatReviewer: 使用ChatGPT分析论文优缺点,提出改进建议

Language:PythonLicense:NOASSERTIONStargazers:1256Issues:3Issues:27

chinese_speech_pretrain

chinese speech pretrained models

recurrent-memory-transformer

[NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.

Language:Jupyter NotebookStargazers:751Issues:10Issues:0
Language:PythonLicense:MITStargazers:674Issues:9Issues:27

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

AudioDec

An Open-source Streaming High-fidelity Neural Audio Codec

Language:PythonLicense:NOASSERTIONStargazers:394Issues:30Issues:31

muavic

MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation

Language:PythonLicense:NOASSERTIONStargazers:348Issues:14Issues:20

TranSpeech

PyTorch Implementation of TranSpeech (ICLR'23): Textless NAR Speech-to-Speech Translation with Bilateral Perturbation

Language:PythonLicense:MITStargazers:164Issues:16Issues:7

SimulEval

SimulEval: A General Evaluation Toolkit for Simultaneous Translation

Language:PythonLicense:CC-BY-SA-4.0Stargazers:97Issues:17Issues:26

AV-ConvTasNet

Unofficial Time Domain Audio Visual Speech Separation Implementation

Language:PythonLicense:Apache-2.0Stargazers:44Issues:1Issues:4

DaSS

Official PyTorch implementation of "Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets" (ICLR 2023 notable top 25%)