NielsRogge

NielsRogge

Geek Repo

Company:HuggingFace

Location:Belgium

Home Page:nielsrogge.github.io

Twitter:@NielsRogge

Github PK Tool:Github PK Tool

NielsRogge's repositories

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:9302Issues:138Issues:450

transformers

🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.

Language:PythonLicense:Apache-2.0Stargazers:44Issues:4Issues:0

DocLayout-YOLO

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

License:AGPL-3.0Stargazers:2Issues:0Issues:0

huggingface.js

Utilities to use the Hugging Face Hub API

Language:TypeScriptLicense:MITStargazers:2Issues:0Issues:0

segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2Issues:0Issues:0

Show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

clip_dinoiser

Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.

License:Apache-2.0Stargazers:1Issues:0Issues:0

GST

Official implementation of "GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers"

Language:PythonLicense:BSD-3-ClauseStargazers:1Issues:0Issues:0

Long-CLIP

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"

License:Apache-2.0Stargazers:1Issues:0Issues:0

ml-veclip

The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"

License:NOASSERTIONStargazers:1Issues:0Issues:0

ultralytics

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite

Language:PythonLicense:AGPL-3.0Stargazers:1Issues:0Issues:0

AiM

Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"

License:MITStargazers:0Issues:0Issues:0

Apollo

Music repair method to convert lossy MP3 compressed music to lossless music.

Stargazers:0Issues:0Issues:0

chat-ui

Open source codebase powering the HuggingChat app

License:Apache-2.0Stargazers:0Issues:0Issues:0

CoMAE

[AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CounTR

CounTR: Transformer-based Generalised Visual Counting

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

doubletake

[ECCV 2024] DoubleTake: Geometry Guided Depth Estimation

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

EMA-VFI

[CVPR 2023] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolatio

License:Apache-2.0Stargazers:0Issues:0Issues:0

FluxMusic

Text-to-Music Generation with Rectified Flow Transformers

License:NOASSERTIONStargazers:0Issues:0Issues:0

GenerateCT

ECCV 2024 & GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes

License:MITStargazers:0Issues:0Issues:0

LightenDiffusion

Official pytorch implementation for "LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models"

Stargazers:0Issues:0Issues:0

Lotus

Official Implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

License:Apache-2.0Stargazers:0Issues:0Issues:0

mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

License:MITStargazers:0Issues:0Issues:0

PGTFormer

[IJCAI'24] Beyond Alignment: Blind Video Face Restoration via Parsing-Guided Temporal-Coherent Transformer

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

shic

Official implementation of the 2024 ECCV paper SHIC: Shape-Image Correspondences with no Keypoint Annotation

Stargazers:0Issues:0Issues:0

sos-bench

This codebase stores the complete artifacts and describes how to reproduce or extend the results from the paper "Style over Substance: Failure modes of LLM judges in alignment benchmarking", including the MisMo-Bench meta-benchmark.

License:Apache-2.0Stargazers:0Issues:0Issues:0

StreamingT2V

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Language:PythonStargazers:0Issues:0Issues:0

VFIMamba

VFIMamba: Video Frame Interpolation with State Space Models

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0