Pham Thanh Lam's repositories
AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
alpaca-lora
Instruct-tune LLaMA on consumer hardware
alpaca.cpp
Locally run an Instruction-Tuned Chat-Style LLM
discord-llm
Experimenting with LLMs to Research, Reflect, and Plan (LLM assistants, retrieval, and Discord integration)
FastChat
The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"
Get-Things-Done-with-Prompt-Engineering-and-LangChain
LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading and indexing data, creating prompt templates, CSV agents, and using retrieval QA chains to query the custom data. Projects for using a private LLM for chat with PDF files, cryptocurrency tweets sentiment analysis.
ibeta
https://www.ibeta.com/iso-30107-3-presentation-attack-detection-confirmation-letters/
jailbreak_llms
[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).
Kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
lampts.github.io
A beautiful, simple, clean, and responsive Jekyll theme for academics
llama-recipes
Examples and recipes for Llama 2 model
llm-attacks
Universal and Transferable Attacks on Aligned Language Models
llm-idiosyncrasies
Code release for "Idiosyncrasies in Large Language Models"
NeurIPS-llm-efficiency-challenge
Code for NeurIPS LLM Efficiency Challenge
open-llms
🤖 A list of open LLMs available for commercial use.
openssm
OpenSSM: Safe, Reliable, and Trustable Small Specialist Models (SSMs) for Industrial AI Applications.
RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
the-algorithm-ml
Source code for Twitter's Recommendation Algorithm
vigogne
Fine-tune French instruction-following models
whispercpp.py
Python bindings for whisper.cpp