Chiawei (codenamewei)

codenamewei

Geek Repo

Location:Seoul, South Korea

Home Page:https://codenamewei.substack.com/

Twitter:@codenamewei_

Github PK Tool:Github PK Tool

Chiawei's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:65999Issues:550Issues:0

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Language:PythonLicense:Apache-2.0Stargazers:31571Issues:164Issues:4608

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:30006Issues:425Issues:4178

applied-ml

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

DeepFaceLive

Real-time face swap for PC streaming or video calls

Language:PythonLicense:GPL-3.0Stargazers:25457Issues:356Issues:144

shap

A game theoretic approach to explain the output of any machine learning model.

Language:Jupyter NotebookLicense:MITStargazers:22301Issues:239Issues:2514

rasa

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Language:PythonLicense:Apache-2.0Stargazers:18455Issues:350Issues:6644

prefect

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Language:PythonLicense:Apache-2.0Stargazers:15613Issues:162Issues:5337

haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonLicense:Apache-2.0Stargazers:15131Issues:131Issues:3447

arrow

Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing

Language:C++License:Apache-2.0Stargazers:14080Issues:353Issues:25383

albumentations

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Language:PythonLicense:MITStargazers:13883Issues:128Issues:972

flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Language:PythonLicense:NOASSERTIONStargazers:13776Issues:201Issues:2308

allennlp

An open-source NLP research library, built on PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:11727Issues:280Issues:2557

lime

Lime: Explaining the predictions of any machine learning classifier

Language:JavaScriptLicense:BSD-2-ClauseStargazers:11455Issues:263Issues:634

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

TextBlob

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

Language:PythonLicense:MITStargazers:9062Issues:262Issues:268

pydub

Manipulate audio with a simple and easy high level interface

Language:PythonLicense:MITStargazers:8689Issues:134Issues:575

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8395Issues:130Issues:1059

librosa

Python library for audio and music analysis

Language:PythonLicense:ISCStargazers:6907Issues:137Issues:1201

awesome-data-engineering

A curated list of data engineering tools for software developers

wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Language:C++License:NOASSERTIONStargazers:6359Issues:246Issues:925

NN-SVG

Publication-ready NN-architecture schematics.

Language:JavaScriptLicense:MITStargazers:4484Issues:59Issues:41

jupyter-book

Create beautiful, publication-quality books and documents from computational content.

Language:PythonLicense:BSD-3-ClauseStargazers:3768Issues:61Issues:1357

knockknock

🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code

Language:PythonLicense:MITStargazers:2773Issues:65Issues:41

konlpy

Python package for Korean natural language processing.

Language:PythonLicense:NOASSERTIONStargazers:1401Issues:64Issues:338

Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Language:PythonLicense:MITStargazers:822Issues:31Issues:79

wordninja

Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.

Language:PythonLicense:MITStargazers:770Issues:10Issues:21

libri-light

dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.

Language:PythonLicense:MITStargazers:467Issues:21Issues:16

contextualSpellCheck

✔️Contextual word checker for better suggestions

Language:PythonLicense:MITStargazers:404Issues:9Issues:42

malaysian-dataset

We gather Malaysian dataset! https://malaysian-dataset.readthedocs.io/

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:295Issues:19Issues:321