PeterPham's repositories
DeepStream-Yolo-Pose
NVIDIA DeepStream SDK application for YOLO-Pose models
gigagan-pytorch
Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs
light-speed
A modified VITS that utilizes phoneme duration's ground truth for better robustness
llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.
prompt2model
prompt2model - Generate Deployable Models from Natural Language Instructions
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
translation_layoutrecovery
This a project to translate a .pdf file, preserving the original layout of that .pdf file. [UPDATED] We have achieved the Second Prize of the Cinnamon AI Bootcamp 2023.
100-Days-of-Code-Data-Science
Starting a 100 Days Code Challenge for Learning Data Science from Scratch
Automatic_Number_Plate_Detection_Recognition_YOLOv8
Automatic Number Plate Detection YOLOv8
Awesome-diffusion-model-for-image-processing
one summary of diffusion-based image processing, including restoration, enhancement, coding, quality assessment
cs224u
Code for Stanford CS224u
distill-sd
Segmind Distilled diffusion
facefusion
Next generation face swapper and enhancer
gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
h2ogpt
Private Q&A and summarization of documents+images or chat with local GPT, 100% private, Apache 2.0. Supports LLaMa2, llama.cpp, and more. Demo: https://gpt.h2o.ai/
Lambda-PNN
Unsupervised Deep Learning-based Pansharpening with Jointly-Enhanced Spectral and Spatial Fidelity
LLaMA-Efficient-Tuning
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)
LLaSM
第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验,同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。
magic-edit
MagicEdit: High-Fidelity Temporally Coherent Video Editing
pythoncode-tutorials
The Python Code Tutorials
ReST
[ICCV 2023] ReST: A Reconfigurable Spatial-Temporal Graph Model for Multi-Camera Multi-Object Tracking
Scenimefy
[ICCV 2023] Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
ToolBench
An open platform for training, serving, and evaluating large language model for tool learning.
voicebox-pytorch
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
YuLan-Chat
YuLan-Chat: An Open-Source Bilingual Chatbot