何雨桐's starred repositories

GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

Language:PythonLicense:MITStargazers:7398Issues:0Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:7514Issues:0Issues:0

AlwaysReddy

AlwaysReddy is a LLM voice assistant that is always just a hotkey away.

Language:PythonLicense:MITStargazers:512Issues:0Issues:0

GlaDOS

This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve.

Language:PythonLicense:MITStargazers:2747Issues:0Issues:0

Llama-Chinese

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Language:PythonStargazers:12700Issues:0Issues:0

obsidian-toggl-integration

A Toggl integration plugin for the popular knowledge base application Obsidian.

Language:TypeScriptLicense:GPL-3.0Stargazers:263Issues:0Issues:0

data-augmentation-review

List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.

Stargazers:1579Issues:0Issues:0

reminders-menubar

Simple macOS menu bar application to view and interact with reminders. Developed with SwiftUI and using Apple Reminders as a source.

Language:SwiftLicense:GPL-3.0Stargazers:2259Issues:0Issues:0

awesome-mac

 Now we have become very big, Different from the original idea. Collect premium software in various categories.

Language:JavaScriptLicense:CC0-1.0Stargazers:72744Issues:0Issues:0

eda_nlp

Data augmentation for NLP, presented at EMNLP 2019

Language:PythonStargazers:1564Issues:0Issues:0

text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Language:PythonLicense:Apache-2.0Stargazers:4208Issues:0Issues:0

Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Language:PythonLicense:Apache-2.0Stargazers:3948Issues:0Issues:0

tsne-cuda

GPU Accelerated t-SNE for CUDA with Python bindings

Language:CudaLicense:BSD-3-ClauseStargazers:1744Issues:0Issues:0

P-tuning

P-tuning方法在中文上的简单实验

Language:PythonStargazers:137Issues:0Issues:0

pet

This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"

Language:PythonLicense:Apache-2.0Stargazers:1610Issues:0Issues:0

Chinese-BERT-wwm

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Language:PythonLicense:Apache-2.0Stargazers:9410Issues:0Issues:0

NLPDataAugmentation

Chinese NLP Data Augmentation, BERT Contextual Augmentation

Language:PythonStargazers:109Issues:0Issues:0

nlpcda

一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda

Language:PythonLicense:Apache-2.0Stargazers:1711Issues:0Issues:0

xmind-crack-patch

Xmind 2023 破解补丁,仅用于学习与研究!

Language:JavaScriptStargazers:112Issues:0Issues:0

pyNeuroML

A single package in Python unifying scripts and modules for reading, writing, simulating and analysing NeuroML2/LEMS models.

Language:PythonLicense:LGPL-3.0Stargazers:34Issues:0Issues:0

brian2

Brian is a free, open source simulator for spiking neural networks.

Language:PythonLicense:NOASSERTIONStargazers:897Issues:0Issues:0
License:MITStargazers:5Issues:0Issues:0

NFT-Toolbox

A non-fungible token (NFT) is a non-interchangeable unit of data stored on a blockchain, a form of digital ledger, that can be sold and traded. Each NFT has its own unique identity. Design NFT’s along with building a web3 dapp, that can mint NFTs.

Language:TypeScriptLicense:Apache-2.0Stargazers:25Issues:0Issues:0

Mac-typora-activation

一个仅仅需要修改官方配置文件的方法,非破解版,无需下载额外软件的typora Mac 免费激活方法

License:MITStargazers:52Issues:0Issues:0

obsidian-zotero-integration

Insert and import citations, bibliographies, notes, and PDF annotations from Zotero into Obsidian.

Language:TypeScriptLicense:GPL-3.0Stargazers:908Issues:0Issues:0

LLM-And-More

LLM-And-More is a professional, plug-and-play, llm trainer and application builder that guides you through the complete LLM workflow from data to evaluation, from training to deployment, from idea to sevice. / LLM-And-More 是一个专业、开箱即用的大模型训练及应用构建一站式解决方案,包含从数据到评估、从训练到部署、从想法到服务的全流程最佳实践。

Language:GoStargazers:441Issues:0Issues:0

self_pretraining

A classification model

Language:PythonLicense:Apache-2.0Stargazers:19Issues:0Issues:0
Language:HTMLStargazers:26Issues:0Issues:0
Language:PythonStargazers:183Issues:0Issues:0

dataset

医学影像数据集列表 『An Index for Medical Imaging Datasets』

Stargazers:2290Issues:0Issues:0