ORlGlN's repositories
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
DragGAN
Implementation of DragGAN: Interactive Point-based Manipulation on the Generative Image Manifold
DragGAN-1
Online Demo and Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold"
ecoute
Ecoute is a live transcription tool that provides real-time transcripts for both the user's microphone input (You) and the user's speakers output (Speaker) in a textbox. It also generates a suggested response using OpenAI's GPT-3.5 for the user to say based on the live transcription of the conversation.
efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
excalidraw
Virtual whiteboard for sketching hand-drawn like diagrams
Fay
Fay是一个完整的开源项目,包含Fay控制器及数字人模型,可灵活组合出不同的应用场景:虚拟主播、现场推销货、商品导购、语音助理、远程语音助理、数字人互动、数字人面试官及心理测评、贾维斯、Her。 开源项目,非产品试用!!!
feishu-chatgpt
🎒飞书 ×(GPT-3.5 + DALL·E + Whisper)= 飞一般的工作体验 🚀 语音对话、角色扮演、多话题讨论、图片创作、表格分析、文档导出 🚀
GeneFace
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
gerev
🧠 ChatGPT search engine for workplace knowledge 🔎
langflow
⛓️ LangFlow is a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.
llm-answer-engine
Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Mixtral, Langchain, OpenAI, Brave & Serper
modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
OpenPromptStudio
🥣 AIGC 提示词可视化编辑器
OpenVoice
Instant voice cloning by MyShell.
privateGPT
Interact privately with your documents using the power of GPT, 100% privately, no data leaks
Rope
GUI-focused roop
SadTalker
(CVPR 2023)SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
T-Rex
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
VideoCrafter
A Toolkit for Text-to-Video Generation and Editing
VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
whisper
Robust Speech Recognition via Large-Scale Weak Supervision