Di Wang (Digo)

Digo

Geek Repo

Company:Carnegie Mellon University

Location:Pittsburgh, PA, USA

Github PK Tool:Github PK Tool


Organizations
asyml
lapps
oaqa

Di Wang's starred repositories

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5529Issues:0Issues:0

splade

SPLADE: sparse neural search (SIGIR21, SIGIR22)

Language:PythonLicense:NOASSERTIONStargazers:746Issues:0Issues:0

awesome-search

Awesome Search - this is all about the (e-commerce, but not only) search and its awesomeness

Language:HTMLStargazers:1361Issues:0Issues:0

searcharray

Full text search in your Pandas dataframe

Language:PythonLicense:Apache-2.0Stargazers:199Issues:0Issues:0

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11277Issues:0Issues:0

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonLicense:Apache-2.0Stargazers:4514Issues:0Issues:0

gritlm

Generative Representational Instruction Tuning

Language:Jupyter NotebookLicense:MITStargazers:530Issues:0Issues:0

gradio

Build and share delightful machine learning apps, all in Python. ๐ŸŒŸ Star to support our work!

Language:PythonLicense:Apache-2.0Stargazers:32270Issues:0Issues:0

leptonai

A Pythonic framework to simplify AI service building

Language:PythonLicense:Apache-2.0Stargazers:2625Issues:0Issues:0

gpt-neo

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Language:PythonLicense:MITStargazers:8205Issues:0Issues:0

x-deeplearning

An industrial deep learning framework for high-dimension sparse data

Language:PureBasicLicense:Apache-2.0Stargazers:4252Issues:0Issues:0

byteps

A high performance and generic framework for distributed DNN training

Language:PythonLicense:NOASSERTIONStargazers:3618Issues:0Issues:0

nsg

Navigating Spreading-out Graph For Approximate Nearest Neighbor Search

Language:C++License:MITStargazers:623Issues:0Issues:0

mind

2020 MIND news recomendation first place solution

Language:Jupyter NotebookStargazers:93Issues:0Issues:0

cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Language:PythonLicense:AGPL-3.0Stargazers:9423Issues:0Issues:0

nlp-datasets

Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)

Stargazers:5728Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:747Issues:0Issues:0

lit

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

Language:TypeScriptLicense:Apache-2.0Stargazers:3465Issues:0Issues:0

yacy_search_server

Distributed Peer-to-Peer Web Search Engine and Intranet Search Appliance

Language:JavaLicense:NOASSERTIONStargazers:3383Issues:0Issues:0

denspi

Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index (DenSPI)

Language:PythonLicense:Apache-2.0Stargazers:200Issues:0Issues:0

TextAttack

TextAttack ๐Ÿ™ is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

Language:PythonLicense:MITStargazers:2906Issues:0Issues:0

awesome-contrastive-self-supervised-learning

A comprehensive list of awesome contrastive self-supervised learning papers.

Stargazers:1206Issues:0Issues:0

checklist

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Language:Jupyter NotebookLicense:MITStargazers:1998Issues:0Issues:0

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Language:JavaScriptLicense:Apache-2.0Stargazers:18307Issues:0Issues:0

annotated_deep_learning_paper_implementations

๐Ÿง‘โ€๐Ÿซ 60+ Implementations/tutorials of deep learning papers with side-by-side notes ๐Ÿ“; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ๐ŸŽฎ reinforcement learning (ppo, dqn), capsnet, distillation, ... ๐Ÿง 

Language:PythonLicense:MITStargazers:54060Issues:0Issues:0

DPR

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Language:PythonLicense:NOASSERTIONStargazers:1699Issues:0Issues:0

textclean

Tools for cleaning and normalizing text data

Language:RStargazers:245Issues:0Issues:0

sparse_attention

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

Language:PythonStargazers:1513Issues:0Issues:0

wavegrad

A fast, high-quality neural vocoder.

Language:PythonLicense:Apache-2.0Stargazers:271Issues:0Issues:0

DALI

DALI: a large Dataset of synchronised Audio, LyrIcs and vocal notes.

Language:PythonLicense:NOASSERTIONStargazers:347Issues:0Issues:0