Daniel Melemed (danielmelemed)

danielmelemed

Geek Repo

Github PK Tool:Github PK Tool

Daniel Melemed's starred repositories

keras

Deep Learning for humans

Language:PythonLicense:Apache-2.0Stargazers:61161Issues:1910Issues:11914

deeplearning-models

A collection of various deep learning architectures, models, and tips

Language:Jupyter NotebookLicense:MITStargazers:16396Issues:593Issues:28

annoy

Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

Language:C++License:Apache-2.0Stargazers:12827Issues:318Issues:394

clip-as-service

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Language:PythonLicense:NOASSERTIONStargazers:12249Issues:221Issues:606

ludwig

Low-code framework for building custom LLMs, neural networks, and other AI models

Language:PythonLicense:Apache-2.0Stargazers:10927Issues:193Issues:1059

portia

Visual scraping for Scrapy

Language:PythonLicense:BSD-3-ClauseStargazers:9212Issues:502Issues:451

text_classification

all kinds of text classification models and more with deep learning

Language:PythonLicense:MITStargazers:7776Issues:299Issues:123

dedupe

:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

Language:PythonLicense:MITStargazers:4012Issues:120Issues:806

StarSpace

Learning embeddings for classification, retrieval and ranking.

more-itertools

More routines for operating on iterables, beyond itertools

Language:PythonLicense:MITStargazers:3544Issues:43Issues:309

textdistance

📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

Language:PythonLicense:MITStargazers:3321Issues:64Issues:0

presidio

Context aware, pluggable and customizable data protection and de-identification SDK for text and images

Language:PythonLicense:MITStargazers:3236Issues:64Issues:387

spark-notebook

Interactive and Reactive Data Science using Scala and Spark.

Language:JavaScriptLicense:Apache-2.0Stargazers:3148Issues:190Issues:515

pelias

Pelias is a modular open-source geocoder using Elasticsearch.

Language:TwigLicense:MITStargazers:3135Issues:102Issues:842

spark-testing-base

Base classes to use when writing tests with Spark

Language:ScalaLicense:Apache-2.0Stargazers:1501Issues:78Issues:206

bert_language_understanding

Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN

frameless

Expressive types for Spark.

Language:ScalaLicense:Apache-2.0Stargazers:869Issues:29Issues:157

AWScala

Using AWS SDK on the Scala REPL

Language:ScalaLicense:NOASSERTIONStargazers:737Issues:36Issues:106

word2vec-graph

Exploring word2vec embeddings as a graph of nearest neighbors

celeb-detection-oss

GIPHY's Open-Source Celebrity Detection Deep Learning Model

Language:PythonLicense:MPL-2.0Stargazers:677Issues:51Issues:7

scikit-kge

Python library to compute knowledge graph embeddings

Language:PythonLicense:MITStargazers:469Issues:37Issues:9

s3monkey

A Python library that allows you to interact with Amazon S3 Buckets as if they are your local filesystem.

lstm-siamese-text-similarity

⚛️ It is keras based implementation of siamese architecture using lstm encoders to compute text similarity

Language:PythonLicense:MITStargazers:280Issues:9Issues:10

LSH

Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents

Language:PythonLicense:MITStargazers:276Issues:10Issues:20

MinHash

Example Python code for comparing documents using MinHash

Language:PythonLicense:MITStargazers:244Issues:5Issues:7

sparkling-graph

SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.

Language:ScalaLicense:BSD-2-ClauseStargazers:150Issues:20Issues:19

zoltar

Common library for serving TensorFlow, XGBoost and scikit-learn models in production.

Language:JavaLicense:Apache-2.0Stargazers:138Issues:25Issues:67

spark-lucenerdd

Spark RDD with Lucene's query and entity linkage capabilities

Language:ScalaLicense:Apache-2.0Stargazers:126Issues:13Issues:65

TENE

A sparsity aware implementation of "Enhanced Network Embedding with Text Information" (ICPR 2018).

Language:PythonLicense:GPL-3.0Stargazers:71Issues:4Issues:0

dataset-person-name-disambiguation

creating a dataset for person name disambiguation using combination of sources like wikipedia, DBLP authors and PPDB.