HongCheng (chg0901)

chg0901

Geek Repo

Location:Kwangwoon University , seoul ,South Korea

Github PK Tool:Github PK Tool

HongCheng's starred repositories

EasySpider

A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

Language:JavaScriptLicense:NOASSERTIONStargazers:30503Issues:203Issues:431

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:23688Issues:198Issues:200

llama3-Chinese-chat

Llama3、Llama3.1 中文仓库(聚合资料,各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

IDM-VTON

IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonLicense:Apache-2.0Stargazers:3012Issues:33Issues:369

stable-diffusion-webui-chinese

stable-diffusion-webui 的汉化扩展

Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️

Language:PythonLicense:Apache-2.0Stargazers:1944Issues:16Issues:5

team-learning-data-mining

主要存储Datawhale组队学习中“数据挖掘/机器学习”方向的资料。

Language:Jupyter NotebookStargazers:1547Issues:28Issues:11

Linly-Talker

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬

Language:PythonLicense:MITStargazers:1476Issues:24Issues:65

metahuman-stream

Real time interactive streaming digital human

Language:PythonLicense:MITStargazers:1036Issues:21Issues:125

llms-from-scratch-cn

仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:658Issues:10Issues:5

Llama3-Tutorial

Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)

SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

Language:PythonLicense:MITStargazers:413Issues:18Issues:23

app-builder

appbuilder-sdk, 千帆AppBuilder-SDK帮助开发者灵活、快速的搭建AI原生应用

Language:PythonLicense:Apache-2.0Stargazers:408Issues:35Issues:28

RAG-Retrieval

Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT,Cross Encoder

Language:PythonLicense:MITStargazers:376Issues:6Issues:22

intro-mathmodel

《数学建模导论》教程,全网最全数学建模模型与算法教程系列,带你走进数学建模的大门!

Llama3-Chinese-Chat

This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model.

StarWhisper

StarWhisper:LLM for Astronomy

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:248Issues:2Issues:13

r-drop

R-Drop方法在中文任务上的简单实验

Language:PythonLicense:MITStargazers:62Issues:2Issues:2

CPsyCoun

[ACL 2024] CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:41Issues:2Issues:0

ChatWithDatawhale

与Datawhale组织的现有仓库以及学习内容对话——快速找到你想学习的内容和贡献内容!

Language:PythonLicense:BSD-3-ClauseStargazers:19Issues:1Issues:0

BeautyMaster

We hope to train VLM to be a beauty master to help you solve the problem of dressing and beauty.

Language:PythonLicense:GPL-3.0Stargazers:11Issues:1Issues:0

MetaGPT-Learn

这个仓库主要用于记录以MetaGPT为基础的Agent开发理论学习记录

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:7Issues:0Issues:0

wonderwiz

大模型驱动的儿童盲盒问答趣味应用

Language:PythonStargazers:3Issues:0Issues:0

Honor-of-Kings_Agent

Honor-of-Kings_Agent

Stargazers:1Issues:0Issues:0