filipecalegario / awesome-generative-ai

A curated list of Generative AI tools, works, models, and references

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Awesome Generative AI AwesomeTrack Awesome List

A curated list of Generative AI projects, tools, artworks, and models

Repository Introduction

Welcome to our Awesome List of Generative AI resources! This repository is a curated collection of references in the dynamic field of Generative AI, equipped with various sources such as academic papers, technical articles, online courses, tutorials, and software.

Structure

  1. Sections: Each section represents a different Generative AI-related category (e.g., LLMs, prompt engineering, image synthesis, educational resources, etc.). The Inboxes are the more general references of a category. When a new category emerges, it becomes a specific subsection.

  2. References within sections: Inside each section, references are listed in reverse chronological order, with the most recent one at the top. This order signifies the ever-evolving landscape of Generative AI, keeping you up-to-date with the latest developments.

This repository is designed to offer you the most recent advancements at your fingertips, allowing you to explore the depth of older resources at your own pace. It's regularly updated, ensuring you're always on track with the rapidly progressing world of Generative AI.

Contribute to our Repository

Your contributions are welcome and greatly appreciated! If you have a valuable resource that you believe should be on this list, or if you see any outdated information, please make a Pull Request. This will help us maintain the quality and relevance of our Awesome List.

Follow this roadmap, keep learning, and enjoy your journey through Generative AI!

Generative AI Area

Generative AI history, timelines, maps, and definitions

Ethics, Philosophical questions and Discussions about Generative AI

Critical Views about Generative AI

Generative AI Processes and Artifacts

More info

Generative AI is a branch of artificial intelligence that focuses on creating new data based on patterns learned from existing data. Here's a step-by-step explanation of the process:

  1. Starting with Data: Every Generative AI process begins with data. This can be in various forms such as text, images, sounds, or other datasets. This data serves as the foundational material that the AI uses to recognize and understand patterns.

  2. Training the AI: With the data in hand, the next step is 'training'. During this phase, the AI processes the data multiple times to learn and internalize the patterns present. The outcome of this stage is a 'model', which acts like a digital representation of the knowledge derived from the data.

  3. Fine-Tuning: At times, there's a need for the AI to focus on specific nuances or characteristics. In such cases, an additional set of data is used to 'fine-tune' the already trained model, enhancing its capabilities in the desired direction.

  4. Using the Model: After training, the model is prepared to make inferences, which means using its acquired knowledge to process new data and come up with relevant outputs. This inference process can be executed locally on a machine or can be accessed remotely through an 'API'. The choice between local execution and API access often depends on factors like computational resources, application needs, and user preferences. Whether locally or via an API, the goal is to leverage the model's capabilities to derive meaningful results from new data inputs.

  5. Generating New Data: With the model set up, the AI can now produce or 'generate' new data. By giving the AI certain 'input parameters' or guidelines, it returns with 'generated output', which is the newly created content.

  6. Applications: The output generated by the AI can be incorporated into a range of applications, be it websites, mobile apps, or other digital platforms. The 'interface' refers to the user-facing portion of these applications, enabling users to interact with and benefit from the AI's capabilities.

In essence, Generative AI is about feeding an AI system vast amounts of data, training it to grasp underlying patterns, and then utilizing that trained knowledge to produce novel data. The potential applications and benefits of this technology are vast and continue to grow as the field evolves.

Generative AI Tools Directories

Courses and Educational Materials

Human-AI Interaction

Papers Collection

Online Tools and Applications

  • VocalReplica: AI-Powered Vocal and Instrumental Isolation for Your Favorite Tracks
  • LangMagic: Learn languages from native content.
  • Persuva: Persuva is the AI-driven platform to create persuasive, high-converting ad copy at scale.
  • Dittto.ai: Fix your hero copy with an AI trained on top SaaS websites.
  • SEOByAI: Rank Faster on Google with FREE AI SEO Tools
  • SinglebaseCloud: AI-powered backend platform with Vector DB, DocumentDB, Auth, and more to speed up app development.
  • TrollyAI: Create professional SEO articles, 2x faster
  • WebscrapeAI: Scrape any website without code using AI
  • Architecture Helper: Analyze any building architecture, and generate your own custom styles, in seconds.
  • AI-Flow: Connect multiple AI models easily
  • Code to Flow: Visualize, Analyze, and Understand Your Code flow. Turn Code into Interactive Flowcharts with AI. Simplify Complex Logic Instantly.
  • Recast Studio: AI-powered podcast marketing assistant.
  • Clipwing: A tool for cutting long videos into dozens of short clips.
  • Tailor: Get a daily podcast and newsletter, created for you by an AI
  • ZZZ Code AI: AI-powered free website to get any programming question answered or code generated.
  • Scribble Diffusion: turn your sketch into a refined image using AI
  • Paint by Text: Edit your photos using written instructions, with the help of an AI.
  • Scenario AI: AI-generated game assets
  • AnimalAI: custom AI-generated animal portraits (profits are directed to various wildlife conservation organizations)
  • starryai: AI Art Generator App - AI Art Maker
  • ProsePainter: an interactive tool to "paint with words." It incorporates guidable text-to-image generation into a traditional digital painting interface
  • ProsePainter: Image + Sketching Interface + CLIP! - YouTube
  • Cocreator AI: creative computer agent (in wait list)
  • Runway ML: AI video creation suite
  • Hotpot.ai - Hotpot.ai: set of AI Tools to post-process images
  • Toonify yourself by Justin Pinkney: turn a human face into a cartoon
  • deepart.io: a online tool for applying style transfer
  • Artbreeder: web-based tool to generate images by breeding existing images
  • Ostagram.ru: image style transfer plataform
  • cleanup.pictures: remove objects, people, text and defects from any picture for free
  • remove.bg: remove background from images
  • Quick, Draw!: can a neural network learn to recognize doodling? A game to help NL by adding users drawing
  • Nekton.ai: automate your workflows with AI
  • Documind.chat: Chat with PDF using AI. Documind is a powerful chat with pdf tool that lets you ask questions from your pdf documents.
  • Snowpixel: Generate Images/Videos/Animations/Audio/Music/3D Objects with Text and/or Image. Upload your own data to create custom models.
  • Chatpdf.so: Talk to PDF using GPT4 AI. Chatpdf.so is a chatpdf tool that lets you do question answering on your pdf documents.
  • Yona.ai: Create deeply personalized AI chatbots from your own conversations, your stories, your data. You can harness the power of your chat history to build an AI companion for a nostalgic trip down memory lane, whimsical fantasies, or any other unique purpose.
  • Voicesphere: Chat with your documents to get intelligent, context specific answers.

Text

Small Language Models

Large Language Models (LLMs)

Prompt Engineering

Prompt Optimizers

  • [🔥🔥🔥] stanfordnlp/dspy: DSPy: The framework for programming — not prompting — foundation models
  • vaibkumr/prompt-optimizer: Minimize LLM token complexity to save API costs and model computations.
  • PromptPerfect: "Optimize Your Prompts to Perfection"
  • [🔥🔥🔥] LLMLingua: Designing a Language for LLMs via Prompt Compression

Prompt Engineering for Text-to-text

Prompt Engineering for Text-to-image

Mamba

Running LLMs Locally

  • PowerInfer: a high-speed inference engine for deploying LLMs locally
  • [🔥🔥] Ollama: run Llama 2, Code Llama, and other models locally
  • GPT4All: A free-to-use, locally running, privacy-aware chatbot. No GPU or internet required.
  • LM Studio: Discover, download, and run local LLMs
  • ggerganov/llama.cpp: Port of Facebook's LLaMA model in C/C++

Function Calling

GPTs and Assistant API

Retrieval-Augmented Generation (RAG)

Embeddings and Semantic Search

Autonomous LLM Agents

Multi-agents

LLM Evaluation

LLMOps

  • Eden AI: provides a unique API connected to the AI engines
  • Dify: LLMOps platform for creating and operating AI-native apps based on GPT-4
  • LLM App: LLM App is a Python library that helps you build real-time AI-powered data pipelines with few lines of code.

AI Engineering

Attacks on LLMs

LangChain

ChatGPT

Text-related Generative Tools

Research AI Tools

AI Tools for Research

AI Tools for Searching

  • whitead/paper-qa: "LLM Chain for answering questions from documents with citations"
  • Metaphor: search engine that "understands language — in the form of prompts — so you can say what you're looking for in all the expressive and creative ways"

Image

Image Synthesis

Inbox: Stable Diffusion

Stable Diffusion Deployed Web Tools

Web UI for Stable Diffusion via Google Colab

References Collection about Stable Diffusion

Hypertechniques

ControlNet

Textual Inversion

DreamBooth

Deforum

Creative Uses of Generative AI Image Synthesis Tools

Image Upscaling

Image Restoration

  • sczhou/codeformer: face restoration algorithm for old photos and AI-generated faces
  • TencentARC/GFPGAN: GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration

Image Segmentation

Video and Animation

Audio and Music

Speech

Inbox: Text-to-speech (TTS) and avatars

Inbox: Speech-to-text (STT) and spoken content analysis

Games

Code and Programming

Multimodal

Multimodal Embedding Space

Datasets

  • Ego-Exo4D: a foundational dataset by Meta for research on video learning and multimodal perception Dataset Download
  • Carolina: General Corpus of Contemporary Brazilian Portuguese with provenance and typology information - Corpus Geral do Português Brasileiro Contemporâneo
  • RedPajama-Data-v2 by Together AI: an open dataset with 30 trillion tokens for training Large Language Models
  • Have I Been Trained?: tool for searching 5.8 billion images used to train popular AI art models
  • laion-aesthetic-6pls: exploring 12 million of the 2.3 billion images used to train Stable Diffusion's image generator
  • CLIP retrieval for laion5B: CLIP retrieval using Laion5B. "It works by converting the text query to a CLIP embedding , then using that embedding to query a knn index of clip image embedddings".
  • rom1504/clip-retrieval: Easily compute CLIP embeddings and build a CLIP retrieval system with them
  • LAION: Large-scale Artificial Intelligence Open Network
  • gabolsgabs/DALI: a large Dataset of synchronised Audio, LyrIcs and vocal notes

Misc

People and works

Interesting Twitter Accounts

Interesting Instagram Accounts, Posts and Reels

Interesting Youtube Channels

Interesting GitHub Repositories

Artists and Artworks

Galleries

Related Awesome Lists

Bio experiments

  • fMRI-to-image: tweet by danberridge "The 'presented images' were shown to a group of humans. The 'reconstructed images' were the result of an fMRI output to Stable Diffusion. In other words, Stable Diffusion literally read people's minds."

Jobs in Generative AI

Improving Google Colab experience

Auxiliary tools and concepts

  • fynk: AI powered contract management software
  • Taskbase: Virtual assistants packaged with AI powered software.
  • AI Wedding Toast: Generate a personalized wedding speech with AI
  • Interviews Chat: Your Personal Interview Prep & Copilot
  • Inline Help: Answer customer questions before they ask
  • LinkActions: AI Internal Links Assistant
  • Marblism: Generate a SaaS boilerplate from a prompt
  • SiteSpeakAI: Automate your customer support with AI
  • Room Reinvented: Transform your room effortlessly with Room Reinvented! Upload a photo and let AI create over 30 stunning interior styles. Elevate your space today.
  • FairyTailAI: Personalized bedtime story generator
  • PromptPal: Search for prompts and bots, then use them with your favourite AI. All in one place.
  • Never Jobless LinkedIn Message Generator: Maximize Your Interview Chances with AI-Powered LinkedIn Messaging.
  • Aispect: New way to experience events.
  • SiteGPT: Make AI your expert customer support agent.
  • PressPulse AI: Get personalized media coverage leads every morning.
  • GPTHelp.ai: ChatGPT for your website / AI customer support chatbot.
  • chaiNNer-org/chaiNNer: A node-based image processing and AI upscaling GUI that makes it easy to chain together complex processing tasks
  • BIRME: Bulk Image Resizing Made Easy 2.0 (Online & Free)
  • The Art of PNG Glitch
  • HashLips/hashlips_art_engine: tool used to create multiple different instances of artworks based on provided layers
  • Taplio: The all-in-one, AI-powered LinkedIn tool.

Dimension reduction techniques

Roadmaps, Tracks, Rails

Stargazers over time

Stargazers over time

Contribute

Contributions welcome! Read the contribution guidelines first.

License

CC0

To the extent possible under law, Filipe Calegario has waived all copyright and related or neighboring rights to this work.

"Buy Me A Coffee"

About

A curated list of Generative AI tools, works, models, and references

License:Creative Commons Zero v1.0 Universal