EPFL Data Science Lab (dlab)'s repositories
transformers-CFG
🤗 A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformers
homepage2vec
Language-Agnostic Website Embedding and Classification
llm-latent-language
Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".
understanding-decoding
The data and the PyTorch implementation for the models and experiments in the paper "Language Model Decoding as Likelihood–Utility Alignment".
property-inference-attacks
Modular framework for property inference attacks on deep neural networks
transformers-GCD-PR
🤗 Transformers with Context-Free Grammar Generation Support
Negativity_in_2016_campaign
Code for the Paper "United States Politicians' Tone Became More Negative with 2016 Primary Campaigns"
distribution-inference-risks
Distribution Inference Risks: Identifying and Mitigating Sources of Leakage
amplification_paradox
This repo contains the simulation code for the paper "The Amplification Paradox in Recommender Systems"
laughing-head
Code for the laughing head paper
wiki_image_classification
Wikipedia Image Classification project
quotebank-toolkit
Scripts for cleaning and enriching Quotebank
edisum
The data and the PyTorch implementation for the models and experiments in the paper "Edisum: Summarizing and Explaining Wikipedia Edits at Scale"
litellm-copy
Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)
multilingual-entity-insertion
Repository for "Fine-tuning large language models for link recommendation in Wikipedia" project
youtube-embeddings
YouTube channel embeddings and social dimensions