Nikolay Karelin's starred repositories
llm-answer-engine
Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper
llm-attacks
Universal and Transferable Attacks on Aligned Language Models
tidy-text-mining
Manuscript of the book "Tidy Text Mining with R" by Julia Silge and David Robinson
langkit
🔍 LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). 📚 Extracts signals from prompts & responses, ensuring safety & security. 🛡️ Features include text quality, relevance metrics, & sentiment analysis. 📊 A comprehensive tool for LLM observability. 👀
Complete-Machine-Learning-
This repository contains everything you need to become proficient in Machine Learning
text-clustering
Easily embed, cluster and semantically label text datasets
PassportEye
Extraction of machine-readable zone information from passports, visas and id-cards via OCR
chat-with-websites
Latests Langchain (2024) App to chat with any website given its URL
amber-train
Pre-training code for Amber 7B LLM
ghostbuster
Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)
realtime-indexer-qa-chat
Streamlit app providing real-time question-answering chat over a document collection.
dl-hse-ami
Deep Learning course materials (HSE, Faculty of Computer Science)
ds_interview_prep_resources
Comprehensive resources for data science interview preparation: assignments, math problems, logic tasks, live coding examples, and leetcode breakdowns.
luigi-course-materials
Материалы для курса Введение в Data Engineering: дата пайплайны
yongks-python-rmarkdown-book
A Python book written in Rmarkdown. Author yongks
kingsbounty3
King's Bounty 3 (extended JavaScript fan remake of original 1990 game)
ghostbuster-data
Data from the paper "Ghostbuster: Detecting Text Ghostwritten by Large Language Models"