Michael Günther (guenthermi)

guenthermi

Geek Repo

Company:Jina AI

Location:Berlin, Germany

Twitter:@michael_g_u

Github PK Tool:Github PK Tool


Organizations
embeddings-benchmark
jina-ai
Wikidata

Michael Günther's repositories

postgres-word2vec

utils to use word embedding models like word2vec vectors in a PostgreSQL database

Language:CLicense:MITStargazers:141Issues:9Issues:7

table-embeddings

Tools for training schema-aware Web table embedding for unsupervised and supervised machine learning on tabular data

Language:PythonLicense:MITStargazers:13Issues:2Issues:1

the-movie-database-import

Script to import data from the The Movie Database to PostgreSQL (Dataset URL: https://www.kaggle.com/rounakbanik/the-movies-dataset

Language:PythonLicense:MITStargazers:10Issues:1Issues:1

postgres-retrofit

Tools to create database-specific text value embeddings from word embedding datasets

Language:PythonLicense:MITStargazers:7Issues:3Issues:0

docarray

🧬 The data structure for unstructured multimodal data · Neural Search · Vector Search · Document Store

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

google-play-dataset-import

Script to import data from a Google Play Store Apps dataset to a PostgreSQL database (Dataset URL: https://www.kaggle.com/lava18/google-play-store-apps)

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

open-food-facts-postgresql-import

Script to import data from the Open Food Facts to PostgreSQL (Dataset URL: https://www.kaggle.com/openfoodfacts/world-food-facts)

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

fast_minh

Python package for fast MinHash calculation and operations

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

mteb

MTEB: Massive Text Embedding Benchmark

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

NLP-OSS

Democratizing NLP!

License:CC0-1.0Stargazers:0Issues:0Issues:0

SimilarityMeasure

Compute for one node in a graph the most similar one

Language:C++Stargazers:0Issues:1Issues:0

test-gradient-cache

Small test script of gradient cache (https://github.com/luyug/GradCache) applied to train a model for a retrieval task on the SciFact dataset (https://allenai.org/data/scifact)

Language:PythonStargazers:0Issues:1Issues:0