wayson20's starred repositories

AppFlowy

Bring projects, wikis, and teams together with AI. AppFlowy is an AI collaborative workspace where you achieve more without losing control of your data. The best open source alternative to Notion.

Language:DartLicense:AGPL-3.0Stargazers:51382Issues:336Issues:2685

DeepFaceLab

DeepFaceLab is the leading software for creating deepfakes.

Language:PythonLicense:GPL-3.0Stargazers:46677Issues:1134Issues:1340

joplin

Joplin - the secure note taking and to-do app with synchronisation capabilities for Windows, macOS, Linux, Android and iOS.

Language:TypeScriptLicense:NOASSERTIONStargazers:44800Issues:483Issues:6467

AFFiNE

There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable and ready to use.

Language:TypeScriptLicense:NOASSERTIONStargazers:38056Issues:206Issues:2097

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:30051Issues:428Issues:4180

haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonLicense:Apache-2.0Stargazers:15293Issues:130Issues:3476

DocsGPT

GPT-powered chat for documentation, chat with your documents

Language:PythonLicense:MITStargazers:14529Issues:87Issues:372

QAnything

Question and Answer based on Anything.

Language:PythonLicense:Apache-2.0Stargazers:11128Issues:97Issues:364

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonLicense:Apache-2.0Stargazers:10809Issues:184Issues:1900

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:6554Issues:39Issues:953

ragas

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Language:PythonLicense:Apache-2.0Stargazers:6265Issues:33Issues:654

pycorrector

pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,LLaMA等模型应用在纠错场景,开箱即用。

Language:PythonLicense:Apache-2.0Stargazers:5423Issues:85Issues:460

stopwords

中文常用停用词表(哈工大停用词表、百度停用词表等)

CDial-GPT

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

Language:PythonLicense:MITStargazers:1747Issues:28Issues:108
Language:PythonLicense:MITStargazers:1363Issues:34Issues:25

contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

Language:PythonLicense:MITStargazers:1189Issues:17Issues:109

evernote-backup

Backup & export all Evernote notes and notebooks

Language:PythonLicense:MITStargazers:916Issues:19Issues:56

NeumAI

Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

Language:PythonLicense:Apache-2.0Stargazers:816Issues:9Issues:14

papermage

library supporting NLP and CV research on scientific papers

Language:PythonLicense:Apache-2.0Stargazers:659Issues:9Issues:33

Neural_Topic_Models

Implementation of topic models based on neural network approaches.

Language:Jupyter NotebookStargazers:409Issues:8Issues:18

chinese_correct_wsd

简易的中文纠错和消歧

Knowledge-QA-LLM

QA based on local knowledge and LLM.

Language:PythonLicense:Apache-2.0Stargazers:188Issues:7Issues:10

RapidStructure

版面分析 | 表格识别 | 文档方向分类

Language:PythonLicense:Apache-2.0Stargazers:174Issues:6Issues:16

kgi-slot-filling

This is the code for our KILT leaderboard submissions (KGI + Re2G models).

Language:PythonLicense:Apache-2.0Stargazers:143Issues:7Issues:12

Chinese-Keyphrase-Extraction

无监督中文关键词抽取(Keyphrase Extraction),基于统计,基于图【LDA与PageRank(TextRank, TPR, Salience Rank, Single TPR等)】,基于嵌入【SIFRank等】,开箱即用!

Language:PythonLicense:MITStargazers:100Issues:2Issues:4

rouge_chinese

Python ROUGE Score Implementation for Chinese Language Task (official rouge score)

Language:PythonLicense:Apache-2.0Stargazers:78Issues:0Issues:0
Language:PythonLicense:MITStargazers:75Issues:11Issues:6

BTMpy

BTM in python

Language:Jupyter NotebookLicense:MITStargazers:29Issues:3Issues:10