Theresia Tanzil (theresia)

theresia

Geek Repo

Company:Scrapinghub

Home Page:proses.id

Twitter:@theresiatanzil

Github PK Tool:Github PK Tool


Organizations
id-python

Theresia Tanzil's starred repositories

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT better.

Language:HTMLLicense:CC0-1.0Stargazers:105603Issues:1379Issues:0

typesense

Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences

Language:C++License:GPL-3.0Stargazers:18354Issues:117Issues:1329

SingleFile

Web Extension for saving a faithful copy of a complete web page in a single HTML file

Language:JavaScriptLicense:AGPL-3.0Stargazers:14071Issues:116Issues:1027

khoj

Your AI second brain. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3). Self-host locally or use our cloud instance. Access from Obsidian, Emacs, Desktop app, Web or Whatsapp.

Language:PythonLicense:AGPL-3.0Stargazers:10966Issues:61Issues:393

monolith

⬛️ CLI tool for saving complete web pages as a single HTML file

Language:RustLicense:CC0-1.0Stargazers:10177Issues:62Issues:141

Scrapegraph-ai

Python scraper based on AI

Language:PythonLicense:MITStargazers:10083Issues:73Issues:134

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-4-ClauseStargazers:9564Issues:120Issues:627

cookbook

A collection of guides and examples for the Gemini API.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3540Issues:53Issues:60

curl-impersonate

curl-impersonate: A special build of curl that can impersonate Chrome & Firefox

Language:PythonLicense:MITStargazers:3408Issues:59Issues:143

llm-app

LLM App templates for RAG, knowledge mining, and stream analytics. Ready to run with Docker,⚡in sync with your data sources.

whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Language:PythonLicense:AGPL-3.0Stargazers:1609Issues:24Issues:133

curl_cffi

Python binding for curl-impersonate via cffi. A http client that can impersonate browser tls/ja3/http2 fingerprints.

Language:PythonLicense:MITStargazers:1524Issues:27Issues:244

scrapy-playwright

🎭 Playwright integration for Scrapy

Language:PythonLicense:BSD-3-ClauseStargazers:871Issues:18Issues:209

nvpy

Simplenote syncing note-taking application, inspired by Notational Velocity and ResophNotes, but uglier and cross-platformerer.

Language:PythonLicense:NOASSERTIONStargazers:842Issues:49Issues:173

JGAAP

The Java Graphical Authorship Attribution Program

SearchAnything

A semantic local search engine powered by AI models.

Language:PythonLicense:MITStargazers:240Issues:9Issues:5

cartographist

experimental web browser optimized for rabbit-holing

Language:JavaScriptLicense:GPL-3.0Stargazers:210Issues:4Issues:2

datasette-extract

Import unstructured data (text and images) into structured tables

Language:JavaScriptLicense:Apache-2.0Stargazers:123Issues:3Issues:27

google-takeout-to-sqlite

Save data from Google Takeout to a SQLite database

Language:PythonLicense:Apache-2.0Stargazers:96Issues:6Issues:8

scrapy-impersonate

Scrapy download handler that can impersonate browser' TLS signatures or JA3 fingerprints.

Language:PythonLicense:MITStargazers:67Issues:3Issues:7

reddit-gpt-summarizer

Reddit Summarizer using LLMs OpenAI/Anthropic, Streamlit + Python

Whisper2Summarize

Whisper2Summarize is an application that uses Whisper for audio processing and GPT for summarization. It generates summaries of audio transcripts quickly and accurately, making it ideal for a variety of use cases such as note-taking, research, and content creation.

llm-ollama

LLM plugin providing access to local Ollama models using HTTP API

Language:PythonLicense:Apache-2.0Stargazers:42Issues:3Issues:5

evernote-to-sqlite

Tools for converting Evernote content to SQLite

Language:PythonLicense:Apache-2.0Stargazers:36Issues:6Issues:12

llamaindex-practices

This repository contains the source code for running llamaindex tutorials from https://howaibuildthis.substack.com/

Language:Jupyter NotebookStargazers:34Issues:1Issues:0

TopicGPT

TopicGPT allows to integrate the benefits of LLMs into Topic Modelling

Language:PythonLicense:MITStargazers:15Issues:0Issues:4

go-enex

Convert Evernote's export file(*.enex) into HTML and images

Language:GoLicense:MITStargazers:7Issues:1Issues:1

google-takeout-to-sqlite

Save data from Google Takeout to a SQLite database

Language:PythonLicense:Apache-2.0Stargazers:2Issues:1Issues:0

noto

{"japanese": "note", "javanese": "to arrange"}

Language:PythonStargazers:2Issues:0Issues:0