chenchongyuan's repositories
2020CCF-NER
2020 CCF大数据与计算智能大赛-非结构化商业文本信息中隐私信息识别-第7名方案
baichuan-7B
A large-scale 7B pretraining language model developed by Baichuan
bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
ccf_2020_qa_match
ccf 2020 qa match competition top1
ChatGLM-6B
ChatGLM-6B:开源双语对话语言模型 | An open bilingual dialogue language model
digix2020_ctr_rank1
华为digix算法大赛2020机器学习赛道-ctr预估初赛/决赛rank1
gaic_track3_pair_sim
全球人工智能技术创新大赛-赛道三-冠军方案
IQuS
A dataset of informal query understanding with machine reading comprehension
KDD-Cup-2020-MultimodalitiesRecall
KDD Cup 2020 Challenges for Modern E-Commerce Platform: Multimodalities Recall
KDD-Multimodalities-Recall
This is our solution for KDD Cup 2020. We implemented a very neat and simple neural ranking model based on siamese BERT which ranked first among the solo teams and ranked 12th among all teams on the final leaderboard.
KDD_WinnieTheBest
KDD Cup 2020 Challenges for Modern E-Commerce Platform: Multimodalities Recall first place
keywordfilter
基于分词原理修改写的一个过滤敏感词库,可以改成动态,支持返回敏感词,高亮敏感词,替换敏感词等操作,本敏感词收集了5W多个违法词、敏感词、违禁词,已去重,最新追加了将近1W个最新词,几十个矫正词、变异词。
llm_interview_note
大模型面试题及答案,大模型八股文
NLP-Dictionary
情感词典、停用词典、同义词典、程度词典、否定词典、敏感词典
sensitive-stop-words
互联网常用敏感词、停止词词库
tencent-sensitive-words
腾讯的离线敏感词库
TensorFlow-Examples
TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)
TextLevelGNN
A rough re-implement of Text Level Graph Neural Network for Text Classification
xf_event_extraction2020Top1
科大讯飞2020事件抽取挑战赛第一名解决方案&完整事件抽取系统
xw2020-top1
“2020创青春·交子杯” 挑战赛 AI算法赛道 TOP1方案
YIZHIFU2020-top1
2020翼支付风险用户识别 初赛、复赛AB榜Rank1
zhihu
This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, such as text generation, machine translation, deep convolution GAN and other actual combat code.