There are 20 repositories under nlp-machine-learning topic.
An open source library for deep learning end-to-end dialog systems and chatbots.
一个可以自己进行训练的中文聊天机器人, 根据自己的语料训练出自己想要的聊天机器人,可以用于智能客服、在线问答、智能聊天等场景。目前包含seq2seq、seqGAN版本、tf2.0版本、pytorch版本。
Datasets, tools, and benchmarks for representation learning of code.
An Open-Source Framework for Prompt-Learning.
Text Classification Algorithms: A Survey
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
精华机器学习,NLP,图像识别, 深度学习等人工智能领域学习资料,搜索,推荐,广告系统架构及算法技术资料整理
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.
End-to-end neural table-text understanding models.
skweak: A software toolkit for weak supervision applied to NLP tasks
👄 The most accurate natural language detection library for Go, suitable for long and short text alike
自然语言处理领域下的相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
SpaCy 中文模型 | Models for SpaCy that support Chinese
BabyAI platform. A testbed for training agents to understand and execute language commands.
👄 The most accurate natural language detection library for Rust, suitable for long and short text alike
👄 The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Repository with all what is necessary for sentiment analysis and related areas
Resources for learning about Text Mining and Natural Language Processing
The hands-on NLTK tutorial for NLP in Python
Python AI assistant 🧠
This repo contains my coursework, assignments, and Slides for Natural Language Processing Specialization by deeplearning.ai on Coursera
Compendium of the resources available from top NLP conferences.
The Schema-Guided Dialogue Dataset
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
基于在线民宿 UGC 数据的意见挖掘项目,包含数据挖掘和NLP 相关的处理,负责数据采集、主题抽取、情感分析等任务。目的是克服用户打分和评论不一致,实时对在线民宿的满意度评测,包含在线评论采集和情感可视化分析。搭建了百度地图POI查询入口,可以进行自动化的批量查询 POI 信息的功能;构建了基于在线民宿语料的 LDA 自动主题聚类模型,利用主题中心词能找出对应的主题属性字典;以用户打分作为标注,然后 litNlp 自带的字符级 TextCNN 进行情感分析,将情感分类概率分布作为情感趋势,最后通过 POI 热力图的方式对不同地域的民宿满意度进行展示。软件版本请见链接。
Hierarchical Attention Networks for document classification
A curated list of awesome machine learning frameworks, libraries, courses, books and many more.
Melusine is a high-level library for emails classification and feature extraction "dédiée aux courriels français".
Natural Language Processing with TensorFlow, published by Packt
Implementation of character based convolutional neural network
An RNN-based Chinese Poem Generator