Ajay Jangid's starred repositories
mlops-zoomcamp
Free MLOps course from DataTalks.Club
data-science-interviews
Data science interview questions and answers
unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
ML-Papers-Explained
Explanation to key concepts in ML
llm-zoomcamp
LLM Zoomcamp - a free online course about building a Q&A system
pytorch-original-transformer
My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.
Questgen.ai
Question generation using state-of-the-art Natural Language Processing algorithms
DensePhrases
[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.org/abs/2012.12624
zeno-build
Build, evaluate, understand, and fix LLM-based apps
awesome-open-mlops
The Fuzzy Labs guide to the universe of open source MLOps
vakyansh-models
Open source speech to text models for Indic Languages
a_lazy_data_science_guide
A guide book on data science for busy and equally lazy Data Scientists 😄
langchain-cohere-qdrant-retrieval
This is a template retrieval repo to create a Flask api server using LangChain with Cohere embeddings and Qdrant Vector Database
easy-elasticsearch
Using business-level retrieval system (BM25) with Python in just a few lines.
cmlextensions
Added functionality to the cml python package
code_docstring_relevance_checker
A tiny transformer + LGBM based model to find relevance match between code and docstring