Matt Zheng's repositories
DouBanRecommend
基于豆瓣图书的推荐、知识图谱与知识引擎简单构建neo4j
py-kenlm-model
python | 高效使用统计语言模型kenlm:新词发现、分词、智能纠错等
py-yanwenzi
网络表情NLP,颜文字识别,颜文字表情实体识别、属性检测、新颜发现
Attention-RNN-Multi-Touch-Attribution
Attention-RNN来做多触点归因模型
python-Apriori
Python,两款Apriori算法实践与比较,基于今日头条数据的练习题
gensim-fast2vec
gensim-fast2vec改造、灵活使用大规模外部词向量(具备OOV查询能力)
streamlit_demo
streamlit一些样例以及相关的博文收集
WA-ModelEnsemble
Weight Averaging Model Ensemble
KwaiSurvival-Test-Demo
2021/7/9测试KwaiSurvival的实验代码
causal_inference_demo
Causal Inference Demo
word-discovery
速度更快、效果更好的中文新词发现
SparkDesk_Document_QA
SparkDesk Document QA
BERT-train2deploy
BERT模型从训练到部署
neo4j-python-pandas-py2neo-v3
利用pandas将excel中数据抽取,以三元组形式加载到neo4j数据库中构建相关知识图谱
chinese_province_city_area_mapper
一个用于提取简体中文字符串中省,市和区并能够进行映射,检验和简单绘图的python模块
el-2019-baseline
2019年百度的实体链指比赛(ccks2019),一个baseline
kg-2019-baseline
2019年百度的三元组抽取比赛,一个baseline
QAonMilitaryKG
QAonMilitaryKG,QaSystem based on military knowledge graph that stores in mongodb which is different from the previous one, 基于mongodb存储的军事领域知识图谱问答项目,包括飞行器、太空装备等8大类,100余小类,共计5800项的军事武器知识库,该项目不使用图数据库进行存储,通过jieba进行问句解析,问句实体项识别,基于查询模板完成多类问题的查询,主要是提供一种工业界的问答**demo。
Baidu-AIP-Address
百度最近推出了地址识别,不过python SDK没有更新,只能用请求的方式
Enterprise-Registration-Data-of-Chinese-Mainland
**大陆 31 个省份1978 年至 2019 年一千多万工商企业注册信息,包含企业名称、注册地址、统一社会信用代码、地区、注册日期、经营范围、法人代表、注册资金、企业类型等详细资料。This repository is an dataset of over 10,000,000 enterprise registration data of 31 provinces in Chinese mainland from 1978 to 2019.【工商大数据】、【企业信息】、【enterprise registration data】。
KOBE
Source code and dataset for KDD 2019 paper "Towards Knowledge-Based Personalized Product Description Generation in E-commerce"
modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
ProductKnowledgeGraph
GoodsKG, a knowledge graph that contains the product-product hierarchy and producer sales goods relationship, which sum up to 1300 products and more than 90000 brands, 基于京东网站的商品上下级概念,商品品牌之间关系,商品描述维度等知识库,基于该知识库可以支持商品属性库构建,商品销售问答,品牌物品生产等知识查询服务,也可用于情感分析等下游应用.
recommenders
Best Practices on Recommendation Systems
spark
Apache Spark - A unified analytics engine for large-scale data processing
tabby
Self-hosted AI coding assistant
YouTubeCommenter
AI to generate YouTube comments based on video title