gisupc

gisupc

Geek Repo

Location:China

Github PK Tool:Github PK Tool

gisupc's starred repositories

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT better.

Language:HTMLLicense:CC0-1.0Stargazers:111252Issues:1440Issues:0

awesome-chatgpt-prompts-zh

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49461Issues:562Issues:209

Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

Langchain-Chatchat

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

Language:TypeScriptLicense:Apache-2.0Stargazers:31444Issues:282Issues:3835

milvus

A cloud-native vector database, storage for next generation AI applications

Language:GoLicense:Apache-2.0Stargazers:29736Issues:277Issues:11818

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29388Issues:339Issues:268

Llama-Chinese

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Chinese-Word-Vectors

100+ Chinese Word Vectors 上百种预训练中文词向量

Language:PythonLicense:Apache-2.0Stargazers:11782Issues:285Issues:167

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

magika

Detect file content types with deep learning

Language:RustLicense:Apache-2.0Stargazers:7754Issues:36Issues:413
Language:PythonLicense:Apache-2.0Stargazers:7098Issues:66Issues:71

StableCascade

Official Code for Stable Cascade

Language:Jupyter NotebookLicense:MITStargazers:6525Issues:61Issues:121

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5247Issues:39Issues:37

python-pinyin

汉字转拼音(pypinyin)

Language:PythonLicense:MITStargazers:4838Issues:99Issues:263

text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Language:PythonLicense:Apache-2.0Stargazers:4413Issues:30Issues:148

Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Language:PythonLicense:MITStargazers:4380Issues:35Issues:327

CLUEDatasetSearch

搜索所有中文NLP数据集,附常用英文NLP数据集

ms-swift

Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)

Language:PythonLicense:Apache-2.0Stargazers:3672Issues:20Issues:1099

webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Language:PythonLicense:BSD-3-ClauseStargazers:2235Issues:22Issues:324

mteb

MTEB: Massive Text Embedding Benchmark

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1841Issues:12Issues:406

aigc

《构筑大语言模型应用:应用开发与架构设计》一本关于 LLM 在真实世界应用的开源电子书,介绍了大语言模型的基础知识和应用,以及如何构建自己的模型。其中包括Prompt的编写、开发和管理,探索最好的大语言模型能带来什么,以及LLM应用开发的模式和架构设计。

llm-books

利用LLM构建应用实践笔记

Vary-toy

Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)

osmium-tool

Command line tool for working with OpenStreetMap data based on the Osmium library.

Language:C++License:GPL-3.0Stargazers:506Issues:18Issues:207

ByteTransformer

optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052

Language:C++License:Apache-2.0Stargazers:453Issues:10Issues:10

nlp-hanzi-similar

The hanzi similar tool.(汉字相似度计算工具,中文形近字算法。可用于手写汉字识别纠正,文本混淆等。)

Language:JavaLicense:NOASSERTIONStargazers:222Issues:6Issues:9

pdf-llm_series

The project is for PDF Python learning with Large Language Model.