Julien Heiduk (JulienHeiduk)

JulienHeiduk

Geek Repo

Location:France

Github PK Tool:Github PK Tool

Julien Heiduk's starred repositories

Made-With-ML

Learn how to design, develop, deploy and iterate on production-grade ML applications.

Language:Jupyter NotebookLicense:MITStargazers:36678Issues:1222Issues:70

AI-Expert-Roadmap

Roadmap to becoming an Artificial Intelligence Expert in 2022

Language:JavaScriptLicense:MITStargazers:28801Issues:961Issues:63

applied-ml

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

data-engineering-zoomcamp

Free Data Engineering course!

Language:Jupyter NotebookStargazers:23888Issues:438Issues:124

twint

An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.

Language:PythonLicense:MITStargazers:15664Issues:325Issues:1173

haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonLicense:Apache-2.0Stargazers:14803Issues:132Issues:3371

google-api-python-client

🐍 The official Python client library for Google's discovery based APIs.

Language:PythonLicense:Apache-2.0Stargazers:7584Issues:285Issues:1061

snscrape

A social networking service scraper in Python

Language:PythonLicense:GPL-3.0Stargazers:4321Issues:100Issues:973

orchest

Build data pipelines, the easy way 🛠️

Language:TypeScriptLicense:Apache-2.0Stargazers:4035Issues:43Issues:480

FLAML

A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

Language:Jupyter NotebookLicense:MITStargazers:3793Issues:62Issues:501

EconML

ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to bring automation to complex causal inference problems. To date, the ALICE Python SDK (econml) implements orthogonal machine learning algorithms such as the double machine learning work of Chernozhukov et al. This toolkit is designed to measure the causal effect of some treatment variable(s) t on an outcome variable y, controlling for a set of features x.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:3680Issues:77Issues:558

KeyBERT

Minimal keyword extraction with BERT

Language:PythonLicense:MITStargazers:3346Issues:32Issues:197

textdistance

📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

Language:PythonLicense:MITStargazers:3337Issues:64Issues:0

Top2Vec

Top2Vec learns jointly embedded topic, document and word vectors.

Language:PythonLicense:BSD-3-ClauseStargazers:2899Issues:38Issues:327

explainerdashboard

Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.

Language:PythonLicense:MITStargazers:2270Issues:23Issues:235

dev-gpt

Your Virtual Development Team

Language:PythonLicense:Apache-2.0Stargazers:1718Issues:43Issues:35

optimus

:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

Language:PythonLicense:Apache-2.0Stargazers:1465Issues:37Issues:218

awesome-colab-notebooks

Collection of google colaboratory notebooks for fast and easy experiments

Language:PythonLicense:MITStargazers:1267Issues:41Issues:6

PDPbox

python partial dependence plot toolbox

Language:Jupyter NotebookLicense:MITStargazers:836Issues:18Issues:67

textra

A command-line application to convert images, PDFs, and audio files to text using Apple's APIs

Language:SwiftLicense:MITStargazers:645Issues:10Issues:9

common-intern

🤖 A selenium script to automatically apply to software engineering internships.

scikit-network

Graph Algorithms

Language:PythonLicense:NOASSERTIONStargazers:595Issues:13Issues:64

AWS-Slides

Contains screenshots of all the slides of Andrew Brown's AWS Course

insight

Repository for Project Insight: NLP as a Service

Language:PythonLicense:GPL-3.0Stargazers:297Issues:12Issues:8

SummerTime

An open-source text summarization toolkit for non-experts. EMNLP'2021 Demo

Language:PythonLicense:Apache-2.0Stargazers:262Issues:14Issues:52

youtube_search

Tool for searching for youtube videos to avoid using their heavily rate-limited API

Language:PythonLicense:MITStargazers:207Issues:7Issues:25

gym-cryptotrading

OpenAI Gym Environment API based Bitcoin trading environment

Language:PythonLicense:MITStargazers:132Issues:15Issues:1

nylon

An intelligent, flexible grammar of machine learning.

Language:PythonLicense:MITStargazers:84Issues:7Issues:31

lighthouse-batch-parallel

A module for helping collecting websites' Lighthouse audit data in batches. Get the report data stream in CSV, JS Object or JSON format. Also provide a cli-tool to generate the report file in CSV or JSON format directly.

Language:JavaScriptLicense:Apache-2.0Stargazers:28Issues:2Issues:4

complementary_products_suggestions

Recommender system for detecting complementary products based on products' textual attributes using Siamese Neural Networks. This code is part of the code for my Master Thesis Research.

Language:Jupyter NotebookStargazers:13Issues:2Issues:1