Xiwen.HE's repositories
awesome-reinforcement-learning-zh
中文整理的强化学习资料(Reinforcement Learning)
ccf-deadlines
⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
ConSERT
Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer
decima-sim
Learning Scheduling Algorithms for Data Processing Clusters
DeepRL
DRL的笔试题/面试题
demo-routenet
Demo of RouteNet in ACM SIGCOMM'19
dolma
Data and tools for generating and inspecting OLMo pre-training data.
ElegantRL
Cloud-native Deep Reinforcement Learning. 🔥
ESMM-1
阿里巴巴ESMM模型解读
External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
fastText
Library for fast text representation and classification.
GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
graph-learn
An Industrial Graph Neural Network Framework
LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
OLMo
Modeling, training, eval, and inference code for OLMo
opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
paper-reading
深度学习经典、新论文逐段精读
QUANTAXIS
QUANTAXIS 支持任务调度 分布式部署的 股票/期货/期权/港股/虚拟货币 数据/回测/模拟/交易/可视化/多账户 纯本地量化解决方案
Reco-papers
Classic papers and resources on recommendation
RL-Stock
📈 如何用深度强化学习自动炒股
rlb-dp
Real-Time Bidding by Reinforcement Learning in Display Advertising
rlkit
Collection of reinforcement learning algorithms
rtb-papers
A collection of research and survey papers of real-time bidding (RTB) based display advertising techniques.
Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL + D2RL and parallel Environments.
stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
TensorFlow-Examples
TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)
text_matching
常用文本匹配模型tf版本,数据集为QA_corpus,持续更新中