Manas-Embold's starred repositories

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:10177Issues:162Issues:734

mesh-transformer-jax

Model parallel transformers in JAX and Haiku

Language:PythonLicense:Apache-2.0Stargazers:6282Issues:112Issues:206

GLM

GLM (General Language Model)

Language:PythonLicense:MITStargazers:3171Issues:46Issues:192

BIG-bench

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Language:PythonLicense:Apache-2.0Stargazers:2835Issues:51Issues:150

copilot-clone

VSCode extension for code suggestion

Language:TypeScriptLicense:MITStargazers:1751Issues:30Issues:47

Project_CodeNet

This repository is to support contributions for tools for the Project CodeNet dataset hosted in DAX

Language:PythonLicense:Apache-2.0Stargazers:1536Issues:55Issues:32

OCTIS

OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

Language:PythonLicense:MITStargazers:718Issues:15Issues:103

ProphetNet

A research project for natural language generation, containing the official implementations by MSRA NLC team.

Language:PythonLicense:MITStargazers:686Issues:20Issues:76

fastT5

⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.

Language:PythonLicense:Apache-2.0Stargazers:564Issues:13Issues:65

KENLG-Reading

Author: Wenhao Yu (wyu1@nd.edu). ACM Computing Survey'22. Reading list for knowledge-enhanced text generation, with a survey.

pash

PaSh: Light-touch Data-Parallel Shell Processing

Language:ShellLicense:MITStargazers:548Issues:14Issues:208

vulnerablecode

A free and open vulnerabilities database and the packages they impact. And the tools to aggregate and correlate these vulnerabilities. Sponsored by NLnet https://nlnet.nl/project/vulnerabilitydatabase/ for https://www.aboutcode.org/ Chat at https://gitter.im/aboutcode-org/vulnerablecode Docs at https://vulnerablecode.readthedocs.org/

Language:PythonLicense:Apache-2.0Stargazers:511Issues:22Issues:933

commit-autosuggestions

A tool that AI automatically recommends commit messages.

Language:PythonLicense:NOASSERTIONStargazers:383Issues:7Issues:4

python-graphs

A static analysis library for computing graph representations of Python programs suitable for use with graph neural networks.

Language:PythonLicense:Apache-2.0Stargazers:325Issues:8Issues:10

CodeTrans

Pretrained Language Models for Source code

Language:Jupyter NotebookLicense:MITStargazers:247Issues:13Issues:9
Language:ShellLicense:GPL-3.0Stargazers:178Issues:18Issues:3

FUNDED_NISL

FUNDED is a novel learning framework for building vulnerability detection models.

D2A

This repository is to support contributions for tools and new data entries for the D2A dataset hosted in DAX

Language:PythonLicense:Apache-2.0Stargazers:61Issues:9Issues:9

porting

Helper scripts and notes that were used while porting various nlp models

Language:Jupyter NotebookStargazers:44Issues:5Issues:3

Logram

Efficient Log Parsing Using n-Gram Dictionaries

CodeBERT

CodeBERT

Language:PythonLicense:MITStargazers:33Issues:1Issues:0

BigDataCourse

Materials for the Advanced Data Analysis Techniques with Apache Spark mini-course

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:27Issues:5Issues:0

CCSD-benchmark-for-code-summarization

This repo is the benchmark for source code summarization on C language

vscode-codecomplete

This repo contains all of the code for my Youtube series on how to create a VSCode extension for autocompleting code using Deep Learning!

tourniquet

A Python library for easy and fast program transformation/repair

Language:PythonLicense:Apache-2.0Stargazers:15Issues:28Issues:21

PyART

DataSet and source code for PyART

Language:PythonLicense:Apache-2.0Stargazers:12Issues:0Issues:0

Duets

Duets is a dataset of 395 open-source Maven-based libraries and 2,874 clients https://ieeexplore.ieee.org/abstract/document/9463096

Gpu-bandwidth-benchmark

A test to see the speed of transfer of tensors from cpu to gpu in pytorch with 2 cuda streams

Language:Jupyter NotebookStargazers:4Issues:1Issues:0
Language:Jupyter NotebookLicense:MITStargazers:4Issues:0Issues:0