Antonio Valerio Miceli Barone (Avmb)

Avmb

Geek Repo

Company:School of Informatics, The University of Edinburgh

Location:Edinburgh, UK

Home Page:http://homepages.inf.ed.ac.uk/amiceli/

Github PK Tool:Github PK Tool

Antonio Valerio Miceli Barone's repositories

lowrank-gru

Gated Recurrent Unit with Low-rank matrix factorization

Language:PythonLicense:NOASSERTIONStargazers:34Issues:7Issues:0

inverse_scaling_prize_code_identifier_swap

Submission to the inverse scaling prize

Language:Jupyter NotebookStargazers:23Issues:1Issues:3

clweadv

Towards cross-lingual distributed representations without parallel text trained with adversarial autoencoders

Language:PythonLicense:LGPL-3.0Stargazers:22Issues:4Issues:0

lowrank-highwaynetwork

Low-rank Highway Networks

Language:PythonLicense:MITStargazers:14Issues:3Issues:1

deep-nmt-architectures

Training scripts for paper Miceli Barone et al. 2017 "Deep Architectures for Neural Machine Translation"

Language:ShellStargazers:11Issues:0Issues:0

DialogLLMScenic

Dialogue-based generation of self-driving simulation scenarios using Large Language Models

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10Issues:1Issues:0

marian-mBART

Training harness to pretrain a mBART model using Marian

Language:PythonStargazers:5Issues:0Issues:0

lowrank-lstm

Low-rank plus diagonal LSTM

Language:LuaLicense:MITStargazers:3Issues:0Issues:0

deepnl

Deep Learning for Natural Language Processing

Language:PythonLicense:GPL-3.0Stargazers:2Issues:2Issues:0

FlowCrosslingualEmbeddings

NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)

Language:PythonLicense:BSD-3-ClauseStargazers:1Issues:2Issues:0

inverse-scaling-eval-pipeline

Basic pipeline for running different sized GPT models and plotting the results

Language:PythonStargazers:1Issues:0Issues:0

MT_Scaling_Prompt_Injection

Scaling Behavior of Machine Translation with Large Language Models under Prompt Injection Attacks

Language:PythonStargazers:1Issues:2Issues:0

neuralLMReorderer

Non-projective Dependency-based Pre-Reordering with Recurrent Neural Network for Machine Translation.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:2Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

hackathon_chatarena

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

License:Apache-2.0Stargazers:0Issues:0Issues:0

IntroDeepLearning

Course material for the Introduction to Deep Learning course

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

lm-robustness

Robust recurrent language model with Random Network Distillation

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

lrn

Source code for "A Lightweight Recurrent Network for Sequence Modeling"

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:2Issues:0

marian-dev-wmt2020

Fast Neural Machine Translation in C++ - development repository

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

mosesdecoder

Moses, the machine translation system

Language:C++Stargazers:0Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

Theano

Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It can use GPUs and perform efficient symbolic differentiation.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0