benkang-chen's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:70550Issues:574Issues:0

open-interpreter

A natural language interface for computers

Language:PythonLicense:AGPL-3.0Stargazers:54660Issues:410Issues:964

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:40613Issues:394Issues:1295

Langchain-Chatchat

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

Language:TypeScriptLicense:Apache-2.0Stargazers:31827Issues:285Issues:3869

paper-reading

深度学习经典、新论文逐段精读

License:Apache-2.0Stargazers:26912Issues:728Issues:0

thingsboard

Open-source IoT Platform - Device management, data collection, processing and visualization.

Language:JavaLicense:Apache-2.0Stargazers:17493Issues:563Issues:6656

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:13428Issues:98Issues:781

BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

Language:HTMLLicense:Apache-2.0Stargazers:7901Issues:108Issues:441

ChineseNlpCorpus

搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。

Language:Jupyter NotebookStargazers:5870Issues:117Issues:25

AreaCity-JsSpider-StatsGov

省市区县乡镇三级或四级城市数据,带拼音标注、坐标、行政区域边界范围;2024年06月16日最新采集,提供csv格式文件,支持在线转成多级联动js代码、通用json格式,提供软件转成shp、geojson、sql、导入数据库;带浏览器里面运行的js采集源码,综合了中华人民共和国民政部、国家统计局、高德地图、腾讯地图行政区划数据

Language:JavaScriptLicense:MITStargazers:5646Issues:128Issues:45

pycorrector

pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。

Language:PythonLicense:Apache-2.0Stargazers:5565Issues:84Issues:477

Java-Interview

经历BAT面试后总结的【高级Java后台开发面试指南】,纯净干货无废话,针对高频面试点

IoT-Technical-Guide

:honeybee: IoT Technical Guide --- 从零搭建高性能物联网平台及物联网解决方案和Thingsboard源码分析 :sparkles: :sparkles: :sparkles: (IoT Platform, SaaS, MQTT, CoAP, HTTP, Modbus, OPC, WebSocket, 物模型,Protobuf, PostgreSQL, MongoDB, Spring Security, OAuth2, RuleEngine, Kafka, Docker)

Language:JavaLicense:Apache-2.0Stargazers:4130Issues:133Issues:8

OpenAgents

[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild

Language:PythonLicense:Apache-2.0Stargazers:3973Issues:46Issues:98

pampy

Pampy: The Pattern Matching for Python you always dreamed of.

Language:PythonLicense:MITStargazers:3516Issues:63Issues:33

GLM

GLM (General Language Model)

Language:PythonLicense:MITStargazers:3189Issues:46Issues:192

zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Language:Jupyter NotebookLicense:MITStargazers:2934Issues:32Issues:187

NLP-Interview-Notes

该仓库主要记录 NLP 算法工程师相关的面试题

transformers_tasks

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

Language:Jupyter NotebookStargazers:2145Issues:18Issues:89

ImageProcessing-Python

该资源为作者在CSDN的撰写Python图像处理文章的支撑,主要是Python实现图像处理、图像识别、图像分类等算法代码实现,希望该资源对您有所帮助,一起加油。

Language:Jupyter NotebookStargazers:1854Issues:20Issues:4

chat-dataset-baseline

人工精调的中文对话数据集和一段chatglm的微调代码

Language:Jupyter NotebookLicense:MITStargazers:1153Issues:16Issues:34

Med-ChatGLM

Repo for Chinese Medical ChatGLM 基于中文医学知识的ChatGLM指令微调

Language:PythonLicense:Apache-2.0Stargazers:960Issues:12Issues:61

ChatIE

The online version is temporarily unavailable because we cannot afford the key. You can clone and run it locally. Note: we set defaul openai key. If keys exceed plan and are invalid, please tell us. The response speed depends on openai. ( sometimes, the official is too crowded and slow)

Language:PythonLicense:NOASSERTIONStargazers:788Issues:8Issues:22

DoctorGLM

基于ChatGLM-6B的中文问诊模型

MedQA-ChatGLM

🛰️ 基于真实医疗对话数据在ChatGLM上进行LoRA、P-Tuning V2、Freeze、RLHF等微调,我们的眼光不止于医疗问答

OpenLLMWiki

OpenLLMWiki: Docs of OpenLLMAI. Survey, reproduction and domain/task adaptation of open source chatgpt alternatives/implementations. PiXiu-貔貅 means fortune.

OpenTextClassification

OpenTextClassification is all you need for text classification! Open text classification for everyone, enjoy your NLP journey! 这可能是目前为止最全面的开源文本分类项目,支持中英双语、多种模型、多种任务。

Emotional-Analysis-of-Internet-News

“互联网新闻情感分析”赛题,是CCF大数据与计算智能大赛赛题之一。对新闻情绪进行分类,0代表正面情绪、1代表中性情绪、2代表负面情绪。

ChatGLM_mutli_gpu_tuning

deepspeed+trainer简单高效实现多卡微调大模型

Language:PythonLicense:MITStargazers:116Issues:3Issues:13

FlutterRepos

Flutter 国内开源项目集合

License:Apache-2.0Stargazers:58Issues:5Issues:0