zhaobingbingbing's starred repositories
Awesome-LLMs-Datasets
Summarize existing representative LLMs text datasets.
evol-teacher
Open Source WizardCoder Dataset
annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
abdominal_ultrasound_classification
Combining deep neural networks with PCA and k-NN classification for abdominal organ recognition in ultrasound images.
EchoDiffusion
MICCAI 2023 code for the paper: Feature-Conditioned Cascaded Video Diffusion Models for Precise Echocardiogram Synthesis. EchoDiffusion is a collection of video diffusion models trained from scratch on the EchoNet-Dynamic dataset with the imagen-pytorch repo.
echo_from_noise
Code to implement "Echo from noise: synthetic ultrasound image generation using diffusion models for real image segmentation"
chest-xray-synthesis
Using GANs and Stable Diffusion to generate Chest Xray data points and evaluating them using convolutional classifiers.
Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
evolve-instruct
evolve llm training instruction, from english instruction to any language.
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Chinese_from_dongxiexidian
mirror of dongxiexidian/Chinese
self-instruct
Aligning pretrained language models with instruction data generated by themselves.
self-instruct-zh
基于ChatGPT构建的中文self-instruct数据集
sft_datasets
开源SFT数据集整理,随时补充
LLMforDialogDataGenerate
Generate dialog data from documents using LLM like ChatGLM2 or ChatGPT;利用ChatGLM2,ChatGPT等大模型根据文档生成对话数据集