kstranger's repositories
AGI-Hallucination
A Survey of MultiModel LLM Hallucination
TalkingHead
TTS->3D->Text->Video
AnimateDiff
Official implementation of AnimateDiff.
APISR
APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)
awesome-spider
爬虫集合
Bert-VITS2
vits2 backbone with bert
ComfyUI
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
DPED
new work
fireworks
A spectacular fireworks display
gemma_pytorch
The official PyTorch implementation of Google's Gemma models
gemoji
Emoji images and names.
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
LLaMA-Factory
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
llm-hallucination-survey
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
Megatron-LM
Ongoing research training transformer models at scale
pandora
潘多拉,一个让你呼吸顺畅的ChatGPT。Pandora, a ChatGPT that helps you breathe smoothly.
pic_beautiful
To my love things
proxy_pool
Python ProxyPool for web spider
scrapy-redis
Redis-based components for Scrapy.
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Unity3DTraining
【Unity杂货铺】unity大杂烩~
VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
VLN_Agent_WF
A Extra Info Model with Muti Input and Action Output in the Agent Navigation
ZurichRain.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes