Amanpreet Singh (apsdehal)

apsdehal

Geek Repo

Company:CTO @ContextualAI. Past @huggingface @facebookresearch

Location:San Francisco

Home Page:https://apsdehal.in

Twitter:@apsdehal

Github PK Tool:Github PK Tool


Organizations
jquery
nko5
nyu-mll
sdslabs

Amanpreet Singh's starred repositories

docling

Get your documents ready for gen AI

Language:PythonLicense:MITStargazers:10509Issues:0Issues:0

bee-agent-framework

The framework for building scalable agentic applications.

Language:TypeScriptLicense:Apache-2.0Stargazers:1088Issues:0Issues:0

CLAIR_and_APO

Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment

Language:Jupyter NotebookLicense:MITStargazers:46Issues:0Issues:0

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonLicense:Apache-2.0Stargazers:2052Issues:0Issues:0

searxng

SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.

Language:PythonLicense:AGPL-3.0Stargazers:14072Issues:0Issues:0

monolith

⬛️ CLI tool for saving complete web pages as a single HTML file

Language:RustLicense:CC0-1.0Stargazers:11239Issues:0Issues:0

supervision

We write your reusable computer vision tools. 💜

Language:PythonLicense:MITStargazers:24259Issues:0Issues:0

hatchet

A distributed, fault-tolerant task queue

Language:GoLicense:MITStargazers:4262Issues:0Issues:0

text-clustering

Easily embed, cluster and semantically label text datasets

Language:PythonLicense:Apache-2.0Stargazers:463Issues:0Issues:0

UIE

Unified Structure Generation for Universal Information Extraction

Language:PythonStargazers:900Issues:0Issues:0

pytorch-widedeep

A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch

Language:PythonLicense:Apache-2.0Stargazers:1300Issues:0Issues:0

gritlm

Generative Representational Instruction Tuning

Language:Jupyter NotebookLicense:MITStargazers:567Issues:0Issues:0

tantivy

Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust

Language:RustLicense:MITStargazers:12188Issues:0Issues:0

jsonrepair

Repair invalid JSON documents

Language:TypeScriptLicense:NOASSERTIONStargazers:574Issues:0Issues:0

ibis

the portable Python dataframe library

Language:PythonLicense:Apache-2.0Stargazers:5331Issues:0Issues:0

pkl

A configuration as code language with rich validation and tooling.

Language:JavaLicense:Apache-2.0Stargazers:10377Issues:0Issues:0

InPars

Inquisitive Parrots for Search

Language:PythonLicense:Apache-2.0Stargazers:178Issues:0Issues:0

trectools

A simple toolkit to process TREC files in Python.

Language:PythonLicense:BSD-3-ClauseStargazers:167Issues:0Issues:0

highlight

highlight.io: The open source, full-stack monitoring platform. Error monitoring, session replay, logging, distributed tracing, and more.

Language:TypeScriptLicense:NOASSERTIONStargazers:7678Issues:0Issues:0

surya

OCR, layout analysis, reading order, table recognition in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:14285Issues:0Issues:0

generalized-kmeans-clustering

Spark library for generalized K-Means clustering. Supports general Bregman divergences. Suitable for clustering probabilistic data, time series data, high dimensional data, and very large data.

Language:HTMLLicense:Apache-2.0Stargazers:298Issues:0Issues:0

YuLan-IR

YuLan-IR: Information Retrieval Boosted LMs

Language:PythonLicense:MITStargazers:215Issues:0Issues:0

DRUGS

Stop messing around with finicky sampling parameters and just use DRµGS!

Language:HTMLLicense:MITStargazers:318Issues:0Issues:0

mwmbl

An open source, non-profit web search engine

Language:PythonLicense:AGPL-3.0Stargazers:1510Issues:0Issues:0

geziyor

Geziyor, blazing fast web crawling & scraping framework for Go. Supports JS rendering.

Language:GoLicense:MPL-2.0Stargazers:2635Issues:0Issues:0

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:11665Issues:0Issues:0

pdffigures2

Given a scholarly PDF, extract figures, tables, captions, and section titles.

Language:ScalaLicense:Apache-2.0Stargazers:613Issues:0Issues:0

dynaconf

Configuration Management for Python ⚙

Language:PythonLicense:MITStargazers:3794Issues:0Issues:0

plane

🔥 🔥 🔥 Open Source JIRA, Linear, Monday, and Asana Alternative. Plane helps you track your issues, epics, and product roadmaps in the simplest way possible.

Language:TypeScriptLicense:AGPL-3.0Stargazers:30825Issues:0Issues:0

Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Language:PythonLicense:MITStargazers:3563Issues:0Issues:0