justingodden

justingodden

Geek Repo

Github PK Tool:Github PK Tool


Organizations
arenaml

justingodden's starred repositories

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++License:Apache-2.0Stargazers:61630Issues:1686Issues:2643

meilisearch

A lightning-fast search API that fits effortlessly into your apps, websites, and workflow

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:43358Issues:444Issues:9278

Flowise

Drag & drop UI to build your customized LLM flow

Language:TypeScriptLicense:Apache-2.0Stargazers:30574Issues:250Issues:1386

typesense

Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences

Language:C++License:GPL-3.0Stargazers:20839Issues:125Issues:1479

paperless-ngx

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

Language:PythonLicense:GPL-3.0Stargazers:19786Issues:108Issues:1614

zincsearch

ZincSearch . A lightweight alternative to elasticsearch that requires minimal resources, written in Go.

Language:GoLicense:NOASSERTIONStargazers:16948Issues:153Issues:301

debezium

Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.

Language:JavaLicense:Apache-2.0Stargazers:10538Issues:217Issues:0

unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Language:HTMLLicense:Apache-2.0Stargazers:8771Issues:58Issues:1110

FastUI

Build better UIs faster.

Language:PythonLicense:MITStargazers:8175Issues:65Issues:213

langgraph

Build resilient language agents as graphs.

Language:PythonLicense:MITStargazers:6079Issues:65Issues:282

BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Language:PythonLicense:MITStargazers:6064Issues:54Issues:1673

mailpit

An email and SMTP testing tool with API for developers

tortoise-orm

Familiar asyncio ORM for python, built with relations in mind

Language:PythonLicense:Apache-2.0Stargazers:4599Issues:50Issues:1082

spotlight

Deep recommender models using PyTorch.

Language:PythonLicense:MITStargazers:2977Issues:106Issues:110

camelot

A Python library to extract tabular data from PDFs

Language:PythonLicense:MITStargazers:2964Issues:43Issues:318

tika

The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).

Language:JavaLicense:Apache-2.0Stargazers:2457Issues:97Issues:0

tabula-py

Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame

Language:PythonLicense:MITStargazers:2168Issues:47Issues:282

metarank

A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine

Language:ScalaLicense:Apache-2.0Stargazers:2072Issues:16Issues:303

semantic-router

Superfast AI decision making and intelligent processing of multi-modal data.

Language:PythonLicense:MITStargazers:1985Issues:19Issues:161

excalibur

A web interface to extract tabular data from PDFs

Language:HTMLLicense:MITStargazers:1571Issues:38Issues:129

create-expo-stack

CLI tool to initialize a React Native application with Expo. Provides options to include Typescript, file-based routing via Expo Router, configuration based routing via pure React Navigation, styling via Nativewind, Restyle, Unistyles, StyleSheets, or Tamagui, and/or backend as a service such as Firebase and Supabase.

Language:EJSLicense:MITStargazers:1201Issues:11Issues:96

aerich

A database migrations tool for TortoiseORM, ready to production.

Language:PythonLicense:Apache-2.0Stargazers:833Issues:22Issues:272

autogen-ui

Web UI for AutoGen (A Framework Multi-Agent LLM Applications)

Language:TypeScriptLicense:MITStargazers:728Issues:20Issues:17

FlashRank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.

Language:PythonLicense:Apache-2.0Stargazers:618Issues:6Issues:26

NeMo-Aligner

Scalable toolkit for efficient model alignment

Language:PythonLicense:Apache-2.0Stargazers:549Issues:16Issues:71

opensearch-py

Python Client for OpenSearch

Language:PythonLicense:Apache-2.0Stargazers:344Issues:20Issues:327

asus-numberpad-driver

Maintained feature-rich linux driver for NumberPad(2.0) on Asus laptops. NumberPad(2.0) is illuminated numeric keypad integrated to touchpad which appears when is done tap on top right corner of touchpad for atleast 1s by default (configurable) or slide gesture from top right/left corner to the center, the left shows calc app aswell (configurable).

Language:PythonLicense:GPL-2.0Stargazers:260Issues:3Issues:153

showcase-books-search

A site to instantly search 28M books from OpenLibrary using Typesense Search (an open source alternative to Algolia / ElasticSearch) ⚡ 📚 🔍

Language:JavaScriptLicense:Apache-2.0Stargazers:148Issues:12Issues:2

showcase-hn-comments-semantic-search

Semantic Search + Keyword Search + Hybrid Search + Filtering + Faceting on 300K HN Comments

Language:JavaScriptLicense:Apache-2.0Stargazers:39Issues:4Issues:1