Shenshen's repositories
simple-CNN
A homework of convolutional neural network
sjm1992st.github.io
Personal certificate
awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
backtrader
Python Backtesting library for trading strategies
BianQue
中文医疗对话模型扁鹊(BianQue)
clause
:horse_racing: Chatopera语义理解系统
DeepRec
DeepRec is a recommendation engine based on TensorFlow.
Firefly
Firefly(流萤): 中文对话式大语言模型(全量微调+QLoRA),支持微调Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya、Bloom等大模型
git-tips
:trollface:Git的奇技淫巧
Hyponymy_Hypernym
The hyponymy and hypernym of some noun classes
lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
Lunar-Solar-Calendar-Converter
公历(阳历)农历(阴历)转换,支持时间段从1900-2100
models
Models and examples built with TensorFlow
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
PersonRelationKnowledgeGraph
ChinesePersonRelationGraph, person relationship extraction based on nlp methods.中文人物关系知识图谱项目,内容包括中文人物关系图谱构建,基于知识库的数据回标,基于远程监督与bootstrapping方法的人物关系抽取,基于知识图谱的知识问答等应用。
PPT_PDF
My Profile
PromptCBLUE
PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and zero-shot learning in the medical domain in Chinese
safe-rlhf
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
scikit-cuda
Python interface to GPU-powered libraries
stable-diffusion
A latent text-to-image diffusion model
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
starcoder
Home of StarCoder: fine-tuning & inference!
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
trl
Train transformer language models with reinforcement learning.