Julian Passebecq (julian-passebecq)

julian-passebecq

Geek Repo

Location:Geneva, Switzerland

Github PK Tool:Github PK Tool

Julian Passebecq's starred repositories

crawlee-python

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

Language:PythonLicense:Apache-2.0Stargazers:2852Issues:0Issues:0

mermaid

Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown

Language:JavaScriptLicense:MITStargazers:69306Issues:0Issues:0

parquet-format

Apache Parquet Format

Language:ThriftLicense:Apache-2.0Stargazers:1700Issues:0Issues:0

clickhouse-operator

Altinity Kubernetes Operator for ClickHouse creates, configures and manages ClickHouse clusters running on Kubernetes

Language:GoLicense:Apache-2.0Stargazers:1812Issues:0Issues:0

pythondataanalysis

Python data repo, jupyter notebook, python scripts and data.

Language:HTMLStargazers:404Issues:0Issues:0

adventure-spark

The "Adventure Works - Spark" repository is a collection of code and resources for analyzing the Adventure Works dataset using Databricks, PySpark, Delta Lake, and Python. It provides examples and tools for ingesting, processing, and analyzing the data to gain insights

Language:PythonStargazers:4Issues:0Issues:0

cloudgram

Generate diagrams for your cloud architecture using code

Language:JavaScriptLicense:Apache-2.0Stargazers:92Issues:0Issues:0

zoil

Generates random Oil and Gas Data

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

posts

A list of all my posts and personal projects

Language:Jupyter NotebookStargazers:63Issues:0Issues:0

Building-OLAP-Dimensional-Model-using-BigQuery-and-DBT

This project is about building a dimensional data warehouse in BigQuery by transforming an OLTP system to an OLAP system, using dbt as our data transformation tool.

Language:ShellStargazers:5Issues:0Issues:0

streamlit-calendar

A Streamlit component to show calendar view using FullCalendar

Language:PythonLicense:Apache-2.0Stargazers:101Issues:0Issues:0

covid-19-data

Data on COVID-19 (coronavirus) cases, deaths, hospitalizations, tests • All countries • Updated daily by Our World in Data

Language:PythonStargazers:5659Issues:0Issues:0

PortfolioProjects

PortfolioProjects PY/ML/SQL/ANALYSIS/VISUALIZATION

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

D3FromShiny

This is the code from a tutorial on using D3 with R Shiny

Language:JavaScriptStargazers:9Issues:0Issues:0

spark-sklearn

Scikit-learn integration package for Apache Spark

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

PowerBI-IBCS

IBCS-styled data visualizations created using only core Power BI visuals (Matrix, Table, New Card)

Stargazers:42Issues:0Issues:0

sticky-notes-app

This interactive React app allows users to create sticky notes, as well as edit, search through, save and delete them.

Language:JavaScriptStargazers:3Issues:0Issues:0

Notepad

A markdown editor made with React.

Language:JavaScriptStargazers:3Issues:0Issues:0

mlxtend

A library of extension and helper modules for Python's data analysis and machine learning libraries.

Language:PythonLicense:NOASSERTIONStargazers:4829Issues:0Issues:0

TabularEditor-Scripts

Scripts for Tabular Editor 2 & 3. Community driven to make your Tabular Editor experience as fast as possible.

Language:C#License:MITStargazers:147Issues:0Issues:0

superset

Apache Superset is a Data Visualization and Data Exploration Platform

Language:TypeScriptLicense:Apache-2.0Stargazers:60768Issues:0Issues:0

dbt-duckdb-tutorial

This is a simple analytic project using DuckDB & dbt with air quality data.

Stargazers:11Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

powerbi-client-react

Power BI for React which provides components and services to enabling developers to easily embed Power BI reports into their applications.

Language:TypeScriptLicense:MITStargazers:296Issues:0Issues:0

dbt_facebook_ads

Fivetran data models for Facebook Ads built using dbt.

Language:ShellLicense:Apache-2.0Stargazers:24Issues:0Issues:0

llama-cpp-python-streamlit

A streamlit app for using a llama-cpp-python high level api

Language:PythonLicense:MITStargazers:6Issues:0Issues:0

roadmap

A public roadmap for Streamlit

Language:PythonLicense:Apache-2.0Stargazers:55Issues:0Issues:0

dbt-snowflake-query-tags

From the SELECT team, a dbt package to automatically tag dbt-issued queries with informative metadata.

License:MITStargazers:42Issues:0Issues:0

mlops-coding-course

Learn how to create, develop, and maintain a state-of-the-art MLOps code base

Language:PythonLicense:CC-BY-4.0Stargazers:181Issues:0Issues:0

dbldatagen

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines

Language:PythonLicense:NOASSERTIONStargazers:291Issues:0Issues:0