There are 32 repositories under nlp-machine-learning topic.
An open source library for deep learning end-to-end dialog systems and chatbots.
An Open-Source Framework for Prompt-Learning.
Chatbot继续沿着LLM前进,近期更新小参数量SLM的和训练脚本,支持本地训练。新增ChatAgent,实现各种有实际场景价值的Agent实现。
精选机器学习,NLP,图像识别, 深度学习等人工智能领域学习资料,搜索,推荐,广告系统架构及算法技术资料整理。算法大牛笔记汇总
Datasets, tools, and benchmarks for representation learning of code.
Text Classification Algorithms: A Survey
🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
自然语言处理领域下的相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
The most accurate natural language detection library for Go, suitable for short text and mixed-language text
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).
End-to-end neural table-text understanding models.
A deep dive into embeddings starting from fundamentals
The most accurate natural language detection library for Rust, suitable for short text and mixed-language text
Python AI assistant 🧠
skweak: A software toolkit for weak supervision applied to NLP tasks
We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 Turbo (April 2024) and GPT-4o.
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.
This repo contains my coursework, assignments, and Slides for Natural Language Processing Specialization by deeplearning.ai on Coursera
Welcome to the Data Science EBooks repository! This collection offers a variety of high-quality ebooks on Data Science, Machine Learning, and AI. Perfect for both beginners and advanced learners, explore these resources to deepen your knowledge and skills.
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
BabyAI platform. A testbed for training agents to understand and execute language commands.
Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.
SpaCy 中文模型 | Models for SpaCy that support Chinese
Precision Medicine Knowledge Graph (PrimeKG)
Converse with book - Built with GPT-3
The Schema-Guided Dialogue Dataset
Resources for learning about Text Mining and Natural Language Processing
The hands-on NLTK tutorial for NLP in Python
Repository with all what is necessary for sentiment analysis and related areas
历年ICLR论文和开源项目合集,包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.
Compendium of the resources available from top NLP conferences.