Mehdi Cherti (mehdidc)

mehdidc

Geek Repo

Company:Juelich Supercomputing Center (JSC), Forschungszentrum Jülich GmbH, LAION

Location:Germany

Home Page:https://mehdidc.github.io

Twitter:@mehdidc

Github PK Tool:Github PK Tool

Mehdi Cherti's starred repositories

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:18924Issues:197Issues:101

aider

aider is AI pair programming in your terminal

Language:PythonLicense:Apache-2.0Stargazers:10109Issues:94Issues:491

litellm

Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)

Language:PythonLicense:NOASSERTIONStargazers:8972Issues:59Issues:2260

undetected-chromedriver

Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)

Language:PythonLicense:GPL-3.0Stargazers:8614Issues:127Issues:1450

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8292Issues:75Issues:903

docker-selenium

Provides a simple way to run Selenium Grid with Chrome, Firefox, and Edge using Docker, making it easier to perform browser automation

Language:ShellLicense:NOASSERTIONStargazers:7571Issues:255Issues:1444

FlareSolverr

Proxy server to bypass Cloudflare protection

Language:PythonLicense:MITStargazers:5893Issues:57Issues:875

DrissionPage

基于python的网页自动化工具。既能控制浏览器,也能收发数据包。可兼顾浏览器自动化的便利性和requests的高效率。功能强大,内置无数人性化设计和便捷功能。语法简洁而优雅,代码量少。

Language:PythonLicense:BSD-3-ClauseStargazers:5436Issues:147Issues:215

SeleniumBase

📊 Python's all-in-one framework for web crawling, scraping, testing, and reporting. Supports pytest. UC Mode provides stealth. Includes many tools.

Language:PythonLicense:MITStargazers:4325Issues:136Issues:1211

cloudscraper

A Python module to bypass Cloudflare's anti-bot page.

Language:PythonLicense:MITStargazers:4015Issues:148Issues:0

helm

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in Holistic Evaluation of Text-to-Image Models (HEIM) (https://arxiv.org/abs/2311.04287).

Language:PythonLicense:Apache-2.0Stargazers:1684Issues:31Issues:1017

curl_cffi

Python binding for curl-impersonate via cffi. A http client that can impersonate browser tls/ja3/http2 fingerprints.

Language:PythonLicense:MITStargazers:1457Issues:26Issues:239

torchtitan

A native PyTorch Library for large model training

Language:PythonLicense:BSD-3-ClauseStargazers:1152Issues:27Issues:80

PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Language:PythonLicense:AGPL-3.0Stargazers:1023Issues:32Issues:77

LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

recurrentgemma

Open weights language model from Google DeepMind, based on Griffin.

Language:PythonLicense:Apache-2.0Stargazers:527Issues:16Issues:5

HPSv2

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:283Issues:9Issues:25

rho

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

OmniFusion

OmniFusion — a multimodal model to communicate using text and images

Language:PythonLicense:Apache-2.0Stargazers:198Issues:6Issues:2

CloudflareBypassForScraping

A cloudflare verification bypass script for webscraping

Language:PythonLicense:MITStargazers:176Issues:3Issues:4
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:117Issues:3Issues:9

nxtp

Object Recognition as Next Token Prediction (CVPR 2024)

Language:PythonLicense:NOASSERTIONStargazers:109Issues:2Issues:2

llm-compression-intelligence

Official github repo for the paper "Compression Represents Intelligence Linearly"

Language:PythonLicense:MITStargazers:86Issues:3Issues:7

t2v_metrics

Evaluating text-to-image/video/3D models with VQAScore

Language:PythonLicense:Apache-2.0Stargazers:57Issues:3Issues:0

vlm-evaluation

VLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning

Language:PythonLicense:NOASSERTIONStargazers:56Issues:0Issues:0

lumi-llm-scaling

Scripts and documentation on scaling large language model training on the LUMI supercomputer

Language:ShellLicense:MITStargazers:9Issues:1Issues:0

scaling-laws-openclip

Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)

Language:Jupyter NotebookStargazers:2Issues:1Issues:0