YvanKOB's repositories
500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code
500 AI Machine learning Deep learning Computer vision NLP Projects with code
African-Whisper
🚀 Seamlessly fine-tune and deploy Whisper model.
AlphaCodium
Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""
augraphy
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
awesome-ai-agents
A list of AI autonomous agents
Awesome-Document-Image-Rectification
A comprehensive list of awesome document image rectification papers.
awesome-python
An opinionated list of awesome Python frameworks, libraries, software and resources.
chat-ui
Open source codebase powering the HuggingChat app
crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
docindex
⚡️Fast persistent storage of multiple document embeddings and their metadata into Pinecone for RAG.
faceswap
Deepfakes Software For All
label-studio-doctr-ocr-backend
The aim of this repository is to create and make available an image text annotation tool based on Doctr OCR.
Latte
Latte: Latent Diffusion Transformer for Video Generation.
lightly
A python library for self-supervised learning on images.
material-ui
Material UI: Ready-to-use foundational React components, free forever. It includes Material UI, which implements Google's Material Design.
MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
ocr_ensemble
OCR herbarium labels with an ensemble of image processing and OCR engines
OpenGFW
OpenGFW is a flexible, easy-to-use, open source implementation of GFW (Great Firewall of China) on Linux
openui
OpenUI let's you describe UI using your imagination, then see it rendered live.
python-projects
my junior python prjects
screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
SeeAct
SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).
stable-diffusion-webui
Stable Diffusion web UI
SuperAGI
<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
tamil_ocr
OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes
tsr-convstem
High-Performance Transformers for Table Structure Recognition Need Early Convolutions
VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild