Chenghao Mou (ChenghaoMou)

ChenghaoMou

Geek Repo

Company:Docusign

Location:California, US

Home Page:https://chenghaomou.github.io/

Twitter:@MouChenghao

Github PK Tool:Github PK Tool

Chenghao Mou's repositories

text-dedup

All-in-one text de-duplication

Language:PythonLicense:Apache-2.0Stargazers:553Issues:4Issues:57

touchbar-lyric

Show synced lyric in the touch-bar with BetterTouchTool and NetEase APIs

pytorch-pQRNN

Implementation of pQRNN in PyTorch

Language:PythonLicense:MITStargazers:45Issues:3Issues:7

embeddings

zero-vocab or low-vocab embeddings

Language:PythonLicense:MITStargazers:16Issues:4Issues:2

awesome-data-deduplication

An awesome list of data deduplication use cases, papers, tools, and methods.

Language:PythonLicense:MITStargazers:3Issues:1Issues:2
Language:HTMLLicense:NOASSERTIONStargazers:3Issues:3Issues:0

deduplicate-text-datasets

A modified version of Google's tool for pure text file

Language:RustLicense:Apache-2.0Stargazers:3Issues:2Issues:0

karafuru

Traditional Chinese colors in your terminal

Language:PythonLicense:MITStargazers:2Issues:1Issues:0

simhash

Simhash in C++

Language:C++License:MITStargazers:2Issues:2Issues:0

lightning-grid-template

A minimal template for pytorch-lightning and grid.ai

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

mini-vae

Minimal GMM VAE model for NLP

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

ai.robots.txt

A list of AI agents and robots to block.

License:MITStargazers:0Issues:0Issues:0

awesome-nlp

:book: A curated list of resources dedicated to Natural Language Processing (NLP)

License:CC0-1.0Stargazers:0Issues:1Issues:0

bender-ruler

Bender Rule analysis for NLP papers

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

bigcode-analysis

Repository for analysis notebooks and experimentes of the BigCode project.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

blog

Public repo for HF blog posts

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Stargazers:0Issues:2Issues:0

closedapi

Tired of seeing not-so-open apis behind paywalls.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

data_tooling

Tools for managing datasets for governance and training.

Language:HTMLLicense:Apache-2.0Stargazers:0Issues:1Issues:0

edgar-crawler

SEC EDGAR Exhibit Downloader

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

file-explorer-markdown-titles

Obsidian Plugin that adds the the markdown title within your notes to the file explorer

Language:TypeScriptLicense:GPL-3.0Stargazers:0Issues:1Issues:0

go-wordninja

Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.

Language:GoLicense:MITStargazers:0Issues:1Issues:0

open-source-mac-os-apps

πŸš€ Awesome list of open source applications for macOS. https://t.me/s/opensourcemacosapps

Language:SwiftLicense:CC0-1.0Stargazers:0Issues:1Issues:0

paper2audio

Convert research papers to audio files.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

presidio

Context aware, pluggable and customizable data protection and de-identification SDK for text and images

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pytorch-dice-loss

Dice loss for data-imbalanced NLP tasks

Language:PythonStargazers:0Issues:2Issues:0

quartz

🌱 a fast, batteries-included static-site generator that transforms Markdown content into fully functional websites

Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

star-classification

A tool for the projects you starred on GitHub

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

table-transformer-doclaynet

Table Transformer Fine-tuned with DocLayNet Dataset

License:Apache-2.0Stargazers:0Issues:2Issues:0