ORlGlN

followers

0

following

stars

ORlGlN's repositories

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonBSD-4-Clause100

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:PythonMIT000

discohead

Language:Python000

donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Language:PythonMIT000

DragGAN

Implementation of DragGAN: Interactive Point-based Manipulation on the Generative Image Manifold

Language:PythonMIT000

DragGAN-1

Online Demo and Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold"

Language:Python000

ecoute

Ecoute is a live transcription tool that provides real-time transcripts for both the user's microphone input (You) and the user's speakers output (Speaker) in a textbox. It also generates a suggested response using OpenAI's GPT-3.5 for the user to say based on the live transcription of the conversation.

Language:PythonMIT000

efficientvit

EfficientViT is a new family of vision models for efficient high-resolution vision.

Language:PythonApache-2.0000

excalidraw

Virtual whiteboard for sketching hand-drawn like diagrams

Language:TypeScriptMIT000

Fay

Fay是一个完整的开源项目，包含Fay控制器及数字人模型，可灵活组合出不同的应用场景：虚拟主播、现场推销货、商品导购、语音助理、远程语音助理、数字人互动、数字人面试官及心理测评、贾维斯、Her。开源项目，非产品试用！！！

Language:JavaScriptGPL-3.0000

feishu-chatgpt

🎒飞书 ×（GPT-3.5 + DALL·E + Whisper）= 飞一般的工作体验 🚀 语音对话、角色扮演、多话题讨论、图片创作、表格分析、文档导出 🚀

Language:Go000

GeneFace

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Language:PythonMIT000

gerev

🧠 ChatGPT search engine for workplace knowledge 🔎

Language:PythonAGPL-3.0000

huggingface-llama-2-samples

Language:Jupyter Notebook000

langflow

⛓️ LangFlow is a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.

Language:TypeScriptMIT000

llm-answer-engine

Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Mixtral, Langchain, OpenAI, Brave & Serper

000

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonApache-2.0000

MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

MIT000

OpenPromptStudio

🥣 AIGC 提示词可视化编辑器

Language:Vue000

OpenVoice

Instant voice cloning by MyShell.

NOASSERTION000

privateGPT

Interact privately with your documents using the power of GPT, 100% privately, no data leaks

Language:PythonApache-2.0000

Rope

GUI-focused roop

GPL-3.0000

SadTalker

（CVPR 2023）SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonMIT000

sio

Language:Jupyter Notebook010

solution-vesuvius-challenge-ink-detection

Language:PythonMIT000

T-Rex

T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

NOASSERTION000

VideoCrafter

A Toolkit for Text-to-Video Generation and Editing

Language:Python000

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

NOASSERTION000

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT000

xuniren

Language:HTMLMIT000