Naoto Inoue (naoto0804)

naoto0804

Geek Repo

Company:CyberAgent Inc. AILab

Location:Tokyo,Japan

Home Page:https://naoto0804.github.io

Twitter:@naoto_inoue_

Github PK Tool:Github PK Tool

Naoto Inoue's starred repositories

evolutionary-model-merge

Official repository of Evolutionary Optimization of Model Merging Recipes

Language:PythonLicense:Apache-2.0Stargazers:1082Issues:40Issues:10

textgrad

Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Language:PythonLicense:MITStargazers:995Issues:14Issues:21

webarena

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Language:PythonLicense:Apache-2.0Stargazers:631Issues:20Issues:108

ziplora-pytorch

Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"

Language:PythonLicense:MITStargazers:477Issues:11Issues:19

gill

🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:401Issues:16Issues:40

dreamsim

DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight)

Language:PythonLicense:MITStargazers:320Issues:11Issues:17

difflogic

A Library for Differentiable Logic Gate Networks

Language:PythonLicense:MITStargazers:313Issues:14Issues:15

magi

Generate a transcript for your favourite Manga: Detect manga characters, text blocks and panels. Order panels. Cluster characters. Match texts to their speakers. Perform OCR. (CVPR'24)

visualwebarena

VisualWebArena is a benchmark for multimodal agents.

Language:PythonLicense:MITStargazers:182Issues:7Issues:37
Language:PythonLicense:MITStargazers:130Issues:4Issues:0

graphist

Official Repo of Graphist

RALF

[CVPR24 Oral] Official repository for RALF: Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation

Language:PythonLicense:Apache-2.0Stargazers:69Issues:2Issues:7

Face2Diffusion

[CVPR 2024] Face2Diffusion for Fast and Editable Face Personalization https://arxiv.org/abs/2403.05094

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:60Issues:3Issues:5

PPTC

PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion

Language:PythonLicense:MITStargazers:45Issues:2Issues:7
Language:PythonLicense:Apache-2.0Stargazers:25Issues:6Issues:2

strand_integration

[PG2023] Refinement of Hair Geometry by Strand Integration

LACE

Continuous diffusion for layout generation

Language:PythonLicense:MITStargazers:17Issues:3Issues:7

FontCLIP

This is the official implementation of FontCLIP: A Semantic Typography Visual-Language Model for Multilingual Font Applications

Language:Jupyter NotebookLicense:MITStargazers:16Issues:4Issues:2

MaskDiffusion

Code for ''MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation''

SVGEditBench

A benchmark dataset for evaluating LLM's SVG editing capabilities

Language:PythonLicense:MITStargazers:12Issues:2Issues:0

Manga109Dialog

Official repository of Manga109Dialog (ICME 2024)

Language:Jupyter NotebookStargazers:8Issues:0Issues:0

Awesome-Aesthetics-Assessment

Collection of Aesthetics Assessment Papers for Graphic Designs and Images.

FineBio

Data and code for the paper "FineBio: A Fine-Grained Video Dataset of Biological Experiments with Hierarchical Annotation"

Language:PythonLicense:MITStargazers:7Issues:0Issues:0

text2palette

Implementation of "Multimodal Color Recommendation for Vector Graphic Documents" ACM MM'23

Language:PythonLicense:MITStargazers:7Issues:0Issues:0

Structure_Guided_Diffusion_Model

A Structure-guided Diffusion Model for Large-hole Image Completion

Language:JavaScriptStargazers:6Issues:0Issues:0
Language:JavaScriptLicense:Apache-2.0Stargazers:4Issues:0Issues:0

cropping-design-constraints

codes for evaluating image cropping under design constraints

Language:PythonStargazers:4Issues:1Issues:0

SpotError

[AAAI2024] Spot the Error: Non-Autoregressive Graphic Layout Generation with Wireframe Locator

Stargazers:3Issues:0Issues:0

adaptive-mbr

Code of "Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding" 2024

Language:PythonLicense:MITStargazers:2Issues:1Issues:0

tdc-typography-generation

This repository contains codes for https://arxiv.org/abs/2309.02099

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0