HUANG Jiayu (Curry30huang)

Curry30huang

Geek Repo

Company:Beijing University of Posts and Telecommunications

Location:中国

Github PK Tool:Github PK Tool

HUANG Jiayu's starred repositories

d2l-zh

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Language:PythonLicense:Apache-2.0Stargazers:59348Issues:1045Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:40986Issues:434Issues:9149

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:40041Issues:392Issues:1290

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:29842Issues:424Issues:4166

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:26539Issues:179Issues:4290

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookLicense:MITStargazers:18233Issues:117Issues:504

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Language:PythonLicense:Apache-2.0Stargazers:14442Issues:134Issues:2059

Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Language:PythonLicense:Apache-2.0Stargazers:7003Issues:75Issues:385

LASER

Language-Agnostic SEntence Representations

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:3565Issues:89Issues:210

XLM

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Language:PythonLicense:NOASSERTIONStargazers:2870Issues:57Issues:334

research_tao

NLP研究入门之道

License:MITStargazers:1903Issues:81Issues:0

ChineseNLP

Datasets, SOTA results of every fields of Chinese NLP

ConvNeXt-V2

Code release for ConvNeXt V2 model

Language:PythonLicense:NOASSERTIONStargazers:1418Issues:8Issues:68

GP2040-CE

Multi-Platform Gamepad Firmware for Raspberry Pi Pico and other RP2040 boards

Language:C++License:MITStargazers:1250Issues:24Issues:349

uncap

Map Caps Lock to Escape or any key to any key

Prompt-BERT

PromptBERT: Improving BERT Sentence Embeddings with Prompts

BUPTBachelorThesis

A LaTeX Template for BUPT Bachelor Thesis (updated in 2023)

Language:TeXLicense:MITStargazers:174Issues:5Issues:14

trans-encoder

Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations

Language:PythonLicense:Apache-2.0Stargazers:132Issues:7Issues:4

uniapp-vue3-template

使用uniapp+vite+vue3+uview-plus3.0 搭建的微信小程序快速开发模版

pdf_parsing

PDF解析(文字,章节,表格,图片,参考),基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答,摘要,信息抽取

scaling_sentemb

Scaling Sentence Embeddings with Large Language Models

mirror-bert

[EMNLP'21] Mirror-BERT: Converting Pretrained Language Models to universal text encoders without labels.

Language:PythonLicense:MITStargazers:73Issues:8Issues:4

NoisyNN-PyTorch

non-official NoisyNN Implemnentation

Language:PythonLicense:Apache-2.0Stargazers:47Issues:10Issues:2

mSimCSE

mSimCSE: Multilingual SimCSE

Language:PythonLicense:MITStargazers:33Issues:1Issues:3

Multi-stage-Distillaton-Framework

[NAACL 2022] Source code for the paper "Multi-stage Distillation Framework for Cross-Lingual Semantic Similarity Matching"

Language:PythonStargazers:6Issues:0Issues:0

xsim

Our Research in Cross-lingual Representational Similarity

Language:PythonStargazers:4Issues:1Issues:0

cl-osa

Cross-language plagiarism detection using Wikidata

Language:JavaLicense:MITStargazers:1Issues:0Issues:0

TEIMMA-Reuse-Annotator

TE (Text) - IM(Image) - MA(Math) reuse annotator

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

PaperTool

论文检索与信息提取工具

Language:PythonStargazers:1Issues:0Issues:0