Beast code in Giters

sleepwalkeryw's starred repositories

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:Python2221100

InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Language:PythonApache-2.0232000

yolov10

YOLOv10: Real-Time End-to-End Object Detection

Language:PythonAGPL-3.0862900

Vitron

A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing

Language:Python26600

swift

ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Language:PythonApache-2.0267200

Yi

A series of large language models trained from scratch by developers @01-ai

Language:Jupyter NotebookApache-2.0752700

NGD-SLAM

NGD-SLAM: Towards Real-Time SLAM for Dynamic Environments without GPU.

Language:C++GPL-3.07200

ml-4m

4M: Massively Multimodal Masked Modeling

Language:PythonApache-2.0146400

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonApache-2.0161700

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Language:PythonApache-2.0404800

ER-NeRF

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Language:PythonMIT92900

ShareGPT4Video

An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Language:Python118500

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonMIT191400

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonApache-2.0814600

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Language:PythonApache-2.0401600

Unique3D

Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

Language:PythonMIT263200

ChatTTS

A generative speech model for daily dialogue.

Language:PythonAGPL-3.02859000

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Language:PythonNOASSERTION207300

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonApache-2.01072500

fastbm25

The fast python bm25 algorithm implemented with reverted index

Language:PythonApache-2.03900

PaddleRec

Recommendation Algorithm大规模推荐算法库，包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM，DSIN，SIGN，IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESMM、ESCMM, MAML、xDeepFM、DeepFEFM、NFM、AFM、RALM、DMR、GateNet、NAML、DIFM、Deep Crossing、PNN、BST、AutoInt、FGCNN、FLEN、Fibinet、ListWise、DeepRec、ENSFM，TiSAS，AutoFIS等，包含经典推荐系统数据集criteo 、movielens等

Language:PythonApache-2.0419100

sleepwalkeryw

sleepwalkeryw's starred repositories

insightface

InternLM-XComposer

yolov10

Vitron

swift

Yi

NGD-SLAM

ml-4m

cambrian

VisualGLM-6B

ER-NeRF

ShareGPT4Video

DeepSeek-VL

MiniCPM-V

GLM-4

translation-agent

Unique3D

ChatTTS

MuseTalk

PaddleSpeech

fastbm25

PaddleRec

Fooocus

RecAI

Qwen-VL

label-studio

joytag

segment-anything

welcome-to-docker

cvat