sjm1992st

followers

following

stars

Beijing

Shenshen's repositories

simple-CNN

A homework of convolutional neural network

Language:Python24 3 1

sjm1992st.github.io

Personal certificate

200

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

Apache-2.0000

backtrader

Python Backtesting library for trading strategies

GPL-3.0000

BianQue

中文医疗对话模型扁鹊(BianQue)

000

chatglm2-doctor

Language:PythonNOASSERTION000

clause

:horse_racing: Chatopera语义理解系统

NOASSERTION000

DeepRec

DeepRec is a recommendation engine based on TensorFlow.

Apache-2.0000

Emotional_Chatting

020

Firefly

Firefly(流萤): 中文对话式大语言模型(全量微调+QLoRA)，支持微调Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya、Bloom等大模型

000

git-tips

:trollface:Git的奇技淫巧

000

Hyponymy_Hypernym

The hyponymy and hypernym of some noun classes

020

lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

MIT000

Lunar-Solar-Calendar-Converter

公历(阳历)农历(阴历)转换，支持时间段从1900-2100

Language:HTML000

models

Models and examples built with TensorFlow

Language:PythonApache-2.0000

mydata

Language:Python020

NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Language:PythonMIT000

OUCML

Language:Python000

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

MIT000

PersonRelationKnowledgeGraph

ChinesePersonRelationGraph, person relationship extraction based on nlp methods.中文人物关系知识图谱项目,内容包括中文人物关系图谱构建,基于知识库的数据回标,基于远程监督与bootstrapping方法的人物关系抽取,基于知识图谱的知识问答等应用。

Language:Python020

PPT_PDF

My Profile

Language:Python000

PromptCBLUE

PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and zero-shot learning in the medical domain in Chinese

Language:Python000

safe-rlhf

Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Apache-2.0000

scikit-cuda

Python interface to GPU-powered libraries

Language:PythonNOASSERTION000

sjm1992st

000

stable-diffusion

A latent text-to-image diffusion model

NOASSERTION000

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Apache-2.0000

starcoder

Home of StarCoder: fine-tuning & inference!

Apache-2.0000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Apache-2.0000

trl

Train transformer language models with reinforcement learning.

Apache-2.0000