jg-bernard's starred repositories

awesome-web-archiving

An Awesome List for getting started with web archiving

License:CC0-1.0Stargazers:1896Issues:0Issues:0

open-parse

Improved file parsing for LLM’s

Language:PythonLicense:MITStargazers:2095Issues:0Issues:0

waymore

Find way more from the Wayback Machine, Common Crawl, Alien Vault OTX, URLScan & VirusTotal!

Language:PythonLicense:MITStargazers:1550Issues:0Issues:0

fabric

fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

Language:PythonLicense:MITStargazers:18942Issues:0Issues:0

gpt-crawler

Crawl a site to generate knowledge files to create your own custom GPT from a URL

Language:TypeScriptLicense:ISCStargazers:18180Issues:0Issues:0

unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Language:HTMLLicense:Apache-2.0Stargazers:7517Issues:0Issues:0

gpt-researcher

GPT based autonomous agent that does online comprehensive research on any given topic

Language:PythonLicense:MITStargazers:13093Issues:0Issues:0

nomic

Interact, analyze and structure massive text, image, embedding, audio and video datasets

Language:PythonStargazers:1136Issues:0Issues:0

gpt4all

GPT4All: Chat with Local LLMs on Any Device

Language:C++License:MITStargazers:67019Issues:0Issues:0

stringdist

String distance functions for R

Language:RStargazers:314Issues:0Issues:0
Language:RLicense:GPL-3.0Stargazers:72Issues:0Issues:0

ollama

Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.

Language:GoLicense:MITStargazers:77576Issues:0Issues:0

anything-llm

The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.

Language:JavaScriptLicense:MITStargazers:17131Issues:0Issues:0

pytok

A web scraper for TikTok using Playwright

Language:PythonStargazers:45Issues:0Issues:0

flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Language:PythonLicense:NOASSERTIONStargazers:13712Issues:0Issues:0

fabricator

[EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.

Language:PythonLicense:Apache-2.0Stargazers:98Issues:0Issues:0

Giveme5W1H

Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?

Language:HTMLLicense:Apache-2.0Stargazers:503Issues:0Issues:0

NewsMTSC

Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k sentences and a state-of-the-art classification model.

Language:PythonLicense:NOASSERTIONStargazers:135Issues:0Issues:0

OpenLLM

Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.

Language:PythonLicense:Apache-2.0Stargazers:9278Issues:0Issues:0

timelms

TimeLMs: Diachronic Language Models from Twitter

Language:Jupyter NotebookStargazers:98Issues:0Issues:0

tweetnlp

TweetNLP for all the NLP enthusiasts working on Twitter! The Python library tweetnlp provides a collection of useful tools to analyze/understand tweets such as sentiment analysis, emoji prediction, and named entity recognition, powered by state-of-the-art language models specialised on Twitter.

Language:PythonLicense:MITStargazers:291Issues:0Issues:0

gentle

gentle forced aligner

Language:PythonLicense:MITStargazers:1409Issues:0Issues:0

relatio

code base for constructing narrative statements from text

Language:Jupyter NotebookLicense:MITStargazers:90Issues:0Issues:0

uwot

An R package implementing the UMAP dimensionality reduction method.

Language:RLicense:GPL-3.0Stargazers:313Issues:0Issues:0

umap

Uniform Manifold Approximation and Projection

Language:PythonLicense:BSD-3-ClauseStargazers:7206Issues:0Issues:0

text

Using Transformers from HuggingFace in R

Language:RStargazers:128Issues:0Issues:0

huggingfaceR

Hugging Face state-of-the-art models in R

Language:RLicense:NOASSERTIONStargazers:130Issues:0Issues:0

recogito

Interactive Annotation of Text and Images

Language:JavaScriptLicense:NOASSERTIONStargazers:18Issues:0Issues:0

annotorious

Add image annotation functionality to any web page with a few lines of JavaScript.

Language:TypeScriptLicense:BSD-3-ClauseStargazers:627Issues:0Issues:0

recogito-js

A JavaScript library for text annotation

Language:JavaScriptLicense:BSD-3-ClauseStargazers:351Issues:0Issues:0