EnigmaHong

EnigmaHong

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

EnigmaHong's repositories

cookbook

Examples and guides for using the Gemini API.

License:Apache-2.0Stargazers:0Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

License:MITStargazers:0Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

License:Apache-2.0Stargazers:0Issues:0Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

License:Apache-2.0Stargazers:0Issues:0Issues:0

surya

OCR, layout analysis, reading order, line detection in 90+ languages

License:GPL-3.0Stargazers:0Issues:0Issues:0

manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/

License:GPL-3.0Stargazers:0Issues:0Issues:0

Open-AnimateAnyone

Unofficial Implementation of Animate Anyone

Stargazers:0Issues:0Issues:0

OOTDiffusion

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

License:NOASSERTIONStargazers:0Issues:0Issues:0

SoraWebui

SoraWebui is an open-source Sora web client, enabling users to easily create videos from text with OpenAI's Sora model.

License:Apache-2.0Stargazers:0Issues:0Issues:0

doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

License:Apache-2.0Stargazers:0Issues:0Issues:0

AnimateDiff

Official implementation of AnimateDiff.

License:Apache-2.0Stargazers:0Issues:0Issues:0

ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

License:NOASSERTIONStargazers:0Issues:0Issues:0

XAgent

An Autonomous LLM Agent for Complex Task Solving

License:Apache-2.0Stargazers:0Issues:0Issues:0

lama-cleaner

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

License:Apache-2.0Stargazers:0Issues:0Issues:0

facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Qwen-7B

The official repo of Qwen-7B (通义千问-7B) chat & pretrained large language model proposed by Alibaba Cloud.

License:NOASSERTIONStargazers:0Issues:0Issues:0

ChatRWKV

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

License:Apache-2.0Stargazers:0Issues:0Issues:0

MetaGPT

🌟 The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo

License:MITStargazers:0Issues:0Issues:0

stable-diffusion-webui

Stable Diffusion web UI

License:AGPL-3.0Stargazers:0Issues:0Issues:0

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

License:Apache-2.0Stargazers:0Issues:0Issues:0

nunif

misc. contains latest version of waifu2x.

License:MITStargazers:0Issues:0Issues:0

Fay

Fay是一个完整的开源项目,包含Fay控制器及数字人模型,可灵活组合出不同的应用场景:虚拟主播、现场推销货、商品导购、语音助理、远程语音助理、数字人互动、数字人面试官及心理测评、贾维斯、Her。 开源项目,非产品试用!!!

License:GPL-3.0Stargazers:0Issues:0Issues:0

GLM-130B

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

License:Apache-2.0Stargazers:0Issues:0Issues:0

RapidOCR

A cross platform OCR Library based on PaddleOCR & OnnxRuntime & OpenVINO.

License:Apache-2.0Stargazers:0Issues:0Issues:0

openai-cookbook

Examples and guides for using the OpenAI API

License:MITStargazers:0Issues:0Issues:0

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

License:MITStargazers:0Issues:0Issues:0

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

License:MITStargazers:0Issues:0Issues:0

llama.cpp

Port of Facebook's LLaMA model in C/C++

License:MITStargazers:0Issues:0Issues:0

Grounded-Segment-Anything

Marrying Grounding DINO with Segment Anything & Stable Diffusion & Tag2Text & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Audio Inputs

License:Apache-2.0Stargazers:0Issues:0Issues:0

MiniGPT-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

License:BSD-3-ClauseStargazers:0Issues:0Issues:0