A curated list of Generative AI projects, tools, artworks, and models
- Generative AI Area
- Generative AI history, maps, and definitions
- Ethics, Philosophical questions and Discussions about Generative AI
- Critical Views about Generative AI
- Generative AI Processes and Artifacts
- Generative AI Tools Directories
- Courses and Educational Materials
- Human-AI Interaction
- Papers Collection
- Online Tools and Applications
- Text
- Large Language Models (LLMs)
- Research AI Tools
- Image
- Video and Animation
- Audio and Music
- Speech
- Games
- Code and Programming
- Multimodal
- Datasets
- Misc
Welcome to our Awesome List of Generative AI resources! This repository is a curated collection of references in the dynamic field of Generative AI, equipped with various sources such as academic papers, technical articles, online courses, tutorials, and software.
-
Sections: Each section represents a different Generative AI-related category (e.g., LLMs, prompt engineering, image synthesis, educational resources, etc.). The Inboxes are the more general references of a category. When a new category emerges, it becomes a specific subsection.
-
References within sections: Inside each section, references are listed in reverse chronological order, with the most recent one at the top. This order signifies the ever-evolving landscape of Generative AI, keeping you up-to-date with the latest developments.
This repository is designed to offer you the most recent advancements at your fingertips, allowing you to explore the depth of older resources at your own pace. It's regularly updated, ensuring you're always on track with the rapidly progressing world of Generative AI.
Your contributions are welcome and greatly appreciated! If you have a valuable resource that you believe should be on this list, or if you see any outdated information, please make a Pull Request. This will help us maintain the quality and relevance of our Awesome List.
Follow this roadmap, keep learning, and enjoy your journey through Generative AI!
- The AI Timeline (@TheAITimeline) / X
- Generative AI for Beginners: Part 1 — Introduction to AI | by Raja Gupta | Medium
- Artificial Intelligence Learning Roadmap [AI Roadmap] 2024
- A Brief History of Generative AI - DATAVERSITY
- A Simple Guide To The History Of Generative AI | Bernard Marr
- Generative AI Timeline from January 2023 to July 2023
- The rise of generative AI: A timeline of triumphs, hiccups and hype | CIO Dive
- Brief History In Time: Decoding the Evolution of Generative AI | LinkedIn
- [🔥🔥🔥] FirstMark | 2024 MAD (ML/AI/Data) Landscape: Full Steam Ahead The 2024 MAD (Machine Learning, AI & Data) Landscape
- Timeline of AI forecasts - AI Digest
- Generative AI Iceberg
- [🔥🔥🔥] Generative AI in a nutshell: a map with the most common Generative AI' concepts by Henrik Kniberg Youtube Video explaining the map
- 60+ Generative AI Terms You Must Know By Heart: by Analytics Vidhya
- The Four Wars of the AI Stack (Dec 2023 Recap): "recap of top items for the AI Engineer from Dec 2023" ("The Data Wars, The War of the GPU Rich/Poor, The Multimodality War, The RAG/Ops War")
- GenAI Prism Infographic by Brian Solis: A Framework for Collaborating with Generative AI
- LLM Visualization
- [2310.04438] A Brief History of Prompt: Leveraging Language Models: the paper presents an exploration of the evolution of prompt engineering. The author, Golam Md Muktadir, extensively used ChatGPT for content generation
- An AI Engineer’s Guide to Machine Learning and Generative AI | by ai geek (wishesh) | Oct, 2023 | Medium
- Emerging Trends in Generative AI Research: A Selection of Recent Papers
- The architecture of today's LLM applications - The GitHub Blog
- [🔥🔥🔥] [2310.07127] An HCI-Centric Survey and Taxonomy of Human-Generative-AI Interactions: "a survey of 154 papers, providing a novel taxonomy and analysis of Human-GenAI Interactions from both human and Gen-AI perspectives".
- The Building Blocks of Generative AI | by Jonathan Shriftman | Medium
- [🔥] Generative AI exists because of the transformer: a visual story by Financial Times
- Early days of AI - by Elad Gil: thoughts about AI as "an entirely new era and discontinuity from the past"
- The Next Token of Progress: 4 Unlocks on the Generative AI Horizon | Andreessen Horowitz
- [2309.07930] Generative AI: discusses a model-, system-, and application-level view on generative AI.
- The state of AI in 2023: Generative AI’s breakout year | McKinsey
- A jargon-free explanation of how AI large language models work | Ars Technica
- The Generative AI Revolution: Exploring the Current Landscape | by Towards AI Editorial Team | Jun, 2023 | Towards AI
- The Story of AI Winters and What it Teaches Us Today
- There Would Have Been No LLMs Without This (episode#3 in the History series): timeline of LLMs by Turing Post
- The Next Token of Progress: 4 Unlocks on the Generative AI Horizon | Andreessen Horowitz: critical innovations on the horizon: steering, memory, ability to use tools, and multimodality
- The economic potential of generative AI: The next productivity frontier: report by McKinsey Jun 2023
- A survey of Generative AI Applications | arxiv: "this survey aims to serve as a valuable resource for researchers and practitioners to navigate the rapidly expanding landscape of generative AI"
- Paper Digest - ChatGPT: Recent Papers on ChatGPT
- AI Index Report 2023 – Artificial Intelligence Index: report that measures trends in AI written by the Human-Centered Artificial Intelligence from Stanford University
- A Survey of Large Language Models: paper that summarizes the evolution of language models, with a focus on LLMs, discussing their advances, techniques, and impact on AI development and usage
- The Generative AI Timeline: post in Linkedin by David Foster
- Who Owns the Generative AI Platform? | Andreessen Horowitz: this article discusses the generative AI market and presents an interesting technology stack of the area
- A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT | arxiv
- [🔥🔥] Toward General Design Principles for Generative AI Applications: this paper presents a set of seven principles for the design of generative AI applications
- [🔥] The landscape of generative AI landscape reports | by Ramsri Goutham | Jan, 2023 | Medium: a meta report on the reports published by 9 venture capital firms
- Generative AI with Cohere: Part 1 - Model Prompting: overview of Generative AI by Cohere AI
- Generative AI with Cohere: Part 2 - Use Case Ideation: a list of Generative AI use cases by Cohere AI
- Large Language Models and Where to Use Them: Part 1: a list of LLM use cases by Cohere AI
- Large Language Models and Where to Use Them: Part 2
- What's the big deal with Generative AI? Is it the future or the present?: summarization of the area of Generative AI by Cohere AI
- Timeline of AI and language models: LLM timeline organized by Dr Alan D. Thompson from Life Architect
- A Comprehensive Survey on Pretrained Foundation Models: A History from BERT to ChatGPT | arxiv
- A Review of Generative AI from Historical Perspectives: paper by Dipankar Dasgupta, Deepak Venugopal and Kishor Datta Gupta
- Matt Shumer on Twitter: "The definitive AI market map Twitter thread": "The definitive AI market map Twitter thread"
- [🔥] Base11 Research - generative-ai: report about Generative AI produced by the investment firm Base10
- Engines of Wow: AI Art Comes of Age – Steve Murch
- AI exploded on the scene at the end of 2022 / Twitter: categories for analyzing tools of Generative AI
- [🔥🔥🔥] Mapping the Generative AI landscape | Antler
- [🔥🔥🔥] AI Timeline: A history of text-to-image ML models by Fabian Mosele
- AI-Generated Art: From Text to Images & Beyond Examples
- 1 week of Stable Diffusion | multimodal.art
- The Five Stages Of AI Grief - NOEMA
- Generative AI Ethics: 8 Biggest Concerns and Risks
- Automated Social Science: Language Models as Scientist and Subjects | NBER
- It’s time to retire the term “user”: the proliferation of AI means we need a new word
- Understanding how personality traits, experiences, and attitudes shape negative bias toward AI-generated artworks | Scientific Reports
- Tracking AI: Monitoring Bias in Artificial Intelligence Chatbots
- Will AI’s Next Wave of Super Intelligence Replace Human Ingenuity? It’s Complicated - Grit Daily News
- Who is Afraid of Frankenstein? And of Generative AI? | Fast Company Brasil [PT-BR]
- Hito Steyerl, Mean Images, NLR 140/141, March–June 2023
- The copyright conundrum of AI art - The Verge
- Recommendations for the advancement of artificial intelligence in Brazil – ABC [PT-BR]
- We must stop AI replicating the problems of surveillance capitalism
- Artificial Intelligence at the Service of Collective Intelligence
- New Training Method Helps AI Generalize like People Do - Scientific American
- [2310.01405] Representation Engineering: A Top-Down Approach to AI Transparency: "an approach to enhancing the transparency of AI systems that draws on insights from cognitive neuroscience"
- Generative AI Resources for Berkeley Law Faculty & Staff - Berkeley Law
- Licensing is neither feasible nor effective for addressing AI risks
- Generative AI companies must publish transparency reports
- Does ChatGPT have a liberal bias?
- More human than human: measuring ChatGPT political bias | Public Choice
- Redefining Bias: The Human Prejudice Against AI | Medium
- AI Art and its Impact on Artists: paper published in the Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society
- The Age of AI has begun | Bill Gates
- The AIKEA Effect: by Artur Piszek
- Ethics of Artificial Intelligence: Case Studies and Options for Addressing Ethical Challenges | SpringerLink
- Embracing change and resetting expectations | Microsoft Unlocked: text by Terence Tao
- Art and the science of generative AI | Science
- Where AI evolves from here
- The Age of AI has begun: notes by Bill Gates
- GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models: OpenAI's paper that discusses the possible implications of GPTs on the U.S. labor market
- Why generative AI scares artists but not content writers
- Cultures in AI/AI in Culture: NeurIPS 2022 Workshop webpage
- AI Data Laundering - Waxy.org: How Academic and Nonprofit Researchers Shield Tech Companies from Accountability
- [🔥🔥🔥] (1232) The End of Art: An Argument Against Image AIs - YouTube: video essay by Steven Zapata
- [🔥🔥🔥] The End of Art: An Argument Against Image AIs (Public) - Google Docs: transcript of the video essay by Steven Zapata
- [🔥🔥🔥] Generative AI: A Creative New World | Sequoia Capital US/Europe: report by Sequoia Capital about the possible applications of Generative AI
- Synthetic Creativity - by Cavin - Deep Markets
- Our Vision for the Future of Synthetic Media | by Victor Riparbelli | Medium
- Deep Else: A Critical Framework for AI Art
- How Photography Became An Art Form | Aaron Hertzmann’s blog
- Can Computers Create Art? by Aaron Hertzmann: 2018's essay published on the Arts Journal
- Text Is the Universal Interface - Scale
- This artist is dominating AI-generated art. And he’s not happy about it. | MIT Technology Review
- The REAL fight over AI art: StableDiffusion | Reddit
- Rutkowski battling AI art overlord | Reddit
- Instead of mining cryptocoins with GPUs, are we now mining art? | Reddit
- Using AI to create art is NOT art! | Reddit : ArtistLounge
- Appreciating the Poetic Misunderstandings of A.I. Art | The New Yorker
- Generative AI is not the panacea we’ve been promised | Eric Siegel for Big Think+ - YouTube
- Thoughts on GenAI by James Gosling
- Automated Social Science: Language Models as Scientist and Subjects | NBER
- When Will the GenAI Bubble Burst? - by Gary Marcus
- Nightshade, the tool that ‘poisons’ data, gives artists a fighting chance against AI | TechCrunch
- How AI Fails Us | Edmond & Lily Safra Center for Ethics
- Generative AI Has a Visual Plagiarism Problem - IEEE Spectrum: "Experiments with Midjourney and DALL-E 3 show a copyright minefield"
- [2308.03762] GPT-4 Can't Reason: "despite the genuinely impressive improvement, there are good reasons to be highly skeptical of GPT-4's ability to reason"
- Risk and Harm: Unpacking Ideologies in the AI Discourse | Proceedings of the 5th International Conference on Conversational User Interfaces
- [2305.18654] Faith and Fate: Limits of Transformers on Compositionality
- [2210.02667] A Human Rights-Based Approach to Responsible AI
- On the Dangers of Stochastic Parrots | Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency
- This new data poisoning tool lets artists fight back against generative AI | MIT Technology Review
- The Short-Term Effects of Generative Artificial Intelligence on Employment: Evidence from an Online Labor Market by Xiang Hui, Oren Reshef, Luofeng Zhou :: SSRN
- AI in Education Group Meeting Notes - Google Docs
- Syllabi Policies for AI Generative Tools - Google Docs
- Five takeaways from UK’s AI safety summit at Bletchley Park | Artificial intelligence (AI) | The Guardian
- Frontier AI: capabilities and risks – discussion paper - GOV.UK
- AI Safety Summit Policy Updates | AISS 2023
- Responsible enterprise decisions with knowledge-enriched generative AI | Deloitte Netherlands
- [2310.13149] Understanding Generative AI in Art: An Interview Study with Artists on G-AI from an HCI Perspective
- [2309.12338] Artificial Intelligence and Aesthetic Judgment: "as generative AI influences contemporary aesthetic judgment we outline some of the pitfalls and traps in attempting to scrutinize what AI generated media means"
- AI Worship | Marginal REVOLUTION
- Artificial intelligence technology behind ChatGPT was built in Iowa — with a lot of water | AP News
- ChatGPT is fun, but not an author | Science
- Behind the AI boom, an army of overseas workers in ‘digital sweatshops’ | The Washington Post: Scale AI’s Remotasks workers in the Philippines cry foul over low pay
- It’s Not Intelligent If It Always Halts: A Critical Perspective on Current Approaches to AGI | Life Is Computation
- The human costs of the AI boom | TechCrunch
- AI Scams, Spam, Hacking, Are Ruining the Internet
- The ChatGPT revolution is another tech fantasy
- Why AI Will Save the World | Andreessen Horowitz
- Hollywood studios proposed AI contract that would give them likeness rights ‘for the rest of eternity’ - The Verge
- The shady world of Brave selling copyrighted data for AI training
- Inside the AI Factory: the humans that make tech seem human - The Verge
- Why transformative artificial intelligence is really, really hard to achieve
- AI and the automation of work — Benedict Evans
- Yuval Noah Harari argues that AI has hacked the operating system of human civilisation
- Generative AI Takes Stereotypes and Bias From Bad to Worse
- Governance of superintelligence by OpenAI
- AIAAIC - AIAAIC Repository: "The independent, open, public interest resource detailing incidents and controversies driven by and relating to artificial intelligence, algorithms, and automation"
- Just Calm Down About GPT-4 Already - IEEE Spectrum
- Pause Giant AI Experiments: An Open Letter - Future of Life Institute
- "OpenAI released plugins for ChatGPT": tweet from @thealexbanks with a list of reflections about the impact of ChatGPT plugins
- Is a socially fair Artificial Intelligence possible? | Uma Inteligência Artificial socialmente justa é possível?: post in Portuguese by H.D. Mabuse
- Noam Chomsky on ChatGPT: It's "Basically High-Tech Plagiarism" and "a Way of Avoiding Learning" | Open Culture
- Despite Their Feats, Large Language Models Still Haven't Contributed to Linguistics | Towards Data Science
- Will ChatGPT Kill the Student Essay? | The Atlantic
- What ChatGPT and generative AI mean for science | Nature
- ChatGPT Is a Bullshit Generator Waging Class War
- Some thoughts about generative AI and the future of education – Mark Carrigan
- Educator Considerations for ChatGPT - OpenAI API
- Stable Diffusion Frivolous · Because lawsuits based on ignorance deserve a response.: a community response for the "Stable Diffusion litigation"
- Stable Diffusion litigation · Joseph Saveri Law Firm & Matthew Butterick
- Generative Language Models and Automated Influence Operations: Emerging Threats and Potential Mitigations | OpenAI
- Abstracts written by ChatGPT fool scientists
- When Machines Change Art | Aaron Hertzmann’s blog
- The Dark Risk of Large Language Models | WIRED UK
- ChatGPT, DALL-E 2 and the collapse of the creative process
- What AI-Generated Art Really Means for Human Creativity | WIRED
- Forecasting Potential Misuses of Language Models for Disinformation Campaigns—and How to Reduce Risk
- The Dark Side of AI Art: 4 Potential Issues With the Growing Trend
- Armed With ChatGPT, Cybercriminals Build Malware And Plot Fake Girl Bots
- ChatGPT And The Mass Production Of Office Work - Farsight
- The Danger Of ChatGPT Nobody Talks About | by Jacob Ferus | Dec, 2022 | Medium
- Mind Control in the Metaverse. If we’ve learned anything about… | by Louis Rosenberg | Predict | Dec, 2022 | Medium
- The Brilliance and Weirdness of ChatGPT - The New York Times
- Como o texto gerado por Inteligência Artificial está envenenando a Internet - MIT Technology Review
- O ChatGPT é o momento “Jurassic Park” da inteligência artificial - NeoFeed
- Por favor, mais racionalidade e menos frenesi em relação ao chatGPT (Parte 1 de 2) | by Cezar Taurion | Dec, 2022 | Medium
- E se estivermos usando uma IA pseudocientífica? - Diogo Cortiz
- As limitações da sensação tecnológica de 2023: o ChatGPT | IAgora? | Época NEGÓCIOS
- 7 Revealing Ways AIs Fail - IEEE Spectrum
More info
Generative AI is a branch of artificial intelligence that focuses on creating new data based on patterns learned from existing data. Here's a step-by-step explanation of the process:
-
Starting with Data: Every Generative AI process begins with data. This can be in various forms such as text, images, sounds, or other datasets. This data serves as the foundational material that the AI uses to recognize and understand patterns.
-
Training the AI: With the data in hand, the next step is 'training'. During this phase, the AI processes the data multiple times to learn and internalize the patterns present. The outcome of this stage is a 'model', which acts like a digital representation of the knowledge derived from the data.
-
Fine-Tuning: At times, there's a need for the AI to focus on specific nuances or characteristics. In such cases, an additional set of data is used to 'fine-tune' the already trained model, enhancing its capabilities in the desired direction.
-
Using the Model: After training, the model is prepared to make inferences, which means using its acquired knowledge to process new data and come up with relevant outputs. This inference process can be executed locally on a machine or can be accessed remotely through an 'API'. The choice between local execution and API access often depends on factors like computational resources, application needs, and user preferences. Whether locally or via an API, the goal is to leverage the model's capabilities to derive meaningful results from new data inputs.
-
Generating New Data: With the model set up, the AI can now produce or 'generate' new data. By giving the AI certain 'input parameters' or guidelines, it returns with 'generated output', which is the newly created content.
-
Applications: The output generated by the AI can be incorporated into a range of applications, be it websites, mobile apps, or other digital platforms. The 'interface' refers to the user-facing portion of these applications, enabling users to interact with and benefit from the AI's capabilities.
In essence, Generative AI is about feeding an AI system vast amounts of data, training it to grasp underlying patterns, and then utilizing that trained knowledge to produce novel data. The potential applications and benefits of this technology are vast and continue to grow as the field evolves.
- ToolList.ai: AI Tools Aggregator
- Toolify: AI Tools Directory & AI Tools List
- LLM Explorer: A Curated LLM List. Explore LLM List of the Open-Source LLM Models
- OrbicAI: "The Larget AI Directory, GPT Stores, AWS PartyRocks Apps and Lots of Free AI Tools"
- Altern: "Gateway to AI Discoveries"
- ainave: "navigate the world of AI with ease", curated AI Tools and AI News
- AI Search: Find AI Tools & Apps | Search The Most Complete AI Tools Directory | AI Search
- AiSuperSmart Ai Tool Directory: Find Ai Tools According to your Use Cases!
- HD Robots: AI tools directory with chatbot assistant
- AIForme: AI tools discovery platform with comparison feature
- Technologies in LabLab: list of AI tools suggested by lablab.ai for their hackathons
- Vondy - Next Generation AI Apps: collection of AI tools organized by tasks
- AI Tool Master List: directory maintained by ClickUp
- AI Valley: "The Newest AI Tools And Prompts"
- AI Finder: repository with more than 1500 AI tools
- BestWebbs: "one-stop destination for all AI Tools"
- Future Tools - Find The Exact AI Tool For Your Needs: list of AI tools
- Futurepedia - The Largest AI Tools Directory | Home: directory of AI tools
- There's An AI For That: AI database
- AI Depot - Discover New AI Tools: collection of AI tools organized by tags and presented in a card format
- Generative AI Database: a database in Notion with types, models, sectors, URLs, and APIs
- Altern - The place to discover new AI tools and products.
- The Generative AI Landscape: "a collection of awesome generative AI applications"
- The ultimate list of AI tools for creators | Descript: collection organized by Descript
- Maxim AI: a generative AI evaluation and observability platform
- Generative AI Explained by NVIDIA: A no-coding course by NVIDIA that presents Generative AI concepts and applications, as well as the challenges and opportunities in the field
- Paulescu/hands-on-rl: Free course that takes you from zero to Reinforcement Learning PRO 🦸🏻🦸🏽
- DataCamp's Become a Generative AI Developer series: 9 code-alongs on building chatbots using LangChain and the OpenAI and Pinecone APIs, and working with the Hugging Face ecosystem. Free, for a limited time only.
- rasbt/LLMs-from-scratch: Implementing a ChatGPT-like LLM from scratch, step by step
- Introduction to Generative AI | SqillPlan: introduction to Generative AI, including models such as GANs, Variational Autoencoders, Autoregressive Models, and their applications, evaluation, ethics, and challenges
- udlbook/udlbook: Understanding Deep Learning by Professor Simon J.D. Prince
- Book: Understanding Deep Learning: website with the book draft and Google Colabs of the book by Simon J.D. Prince
- List of Generative AI Learning resources from AWS and Google: list organized as a LinkedIn post by Ankit Agarwal
- How AI chatbots like ChatGPT or Bard work – visual explainer | The Guardian
- [🔥🔥] Generative AI for Beginners: introductory 12 lesson course by Microsoft
- Introduction to Generative AI: series of Medium articles by Youssef Hosni
- Animated AI: animations and instructional videos about neural networks
- Deep Learning AI - Learn the fundamentals of generative AI for real-world applications: created in partnership with AWS, this course presents the fundamentals of how generative AI works and how to deploy it in real-world applications.
- Google Cloud Skills Boost - Introduction to Generative AI: an introductory level microlearning course covering Google Tools aimed at explaining what Generative AI is, how it is used, and how it differs from traditional machine learning methods.
- Google Cloud Skills Boost: Generative AI learning path: curated content on Generative AI "from the fundamentals of Large Language Models to how to create and deploy generative AI solutions on Google Cloud"
- AI for Industrial Design: "students at the National University of Singapore explore AI’s capability for design in a semester course and share what they learned. Directed by Donn Koh at the Division of Industrial Design, NUS."
- Let Us Show You How GPT Works — Using Jane Austen - The New York Times
- [🔥🔥🔥] ChatGPT Prompt Engineering for Developers - DeepLearning.AI: short course taught by Isa Fulford (OpenAI) and Andrew Ng (DeepLearning.AI) that provide best practices for prompt engineering
- [🔥🔥🔥] DAIR.AI: Democratizing Artificial Intelligence Research, Education, and Technologies
- Welcome to the 🤗 Deep Reinforcement Learning Course: a Hugging Face Course on Deep Reinforcement Learning
- Crash course in AI art generation by PromptHero: paid ($99) course focused on prompt engineering
- Visual intuition for diffusion models and AI art. #stablediffusionart #aiart #aiartwork #aiartcommunity
- The Illustrated Stable Diffusion by Jay Alammar: "gentle introduction [on] how Stable Diffusion works"
- [🔥]johnowhitaker/tglcourse: The Generative Landscape - a course on generative modelling (currently unfinished)
- Words are Images | BustBright - Machine Learning Art: 7-week Online class starting October 24th, 2022 by Derrick Schultz
- Grokking Stable Diffusion.ipynb - Colaboratory - Part 1: notebook by @johnowhitaker exploring Stable Diffusion details
- Grokking Stable Diffusion: Textual Inversion.ipynb - Colaboratory - Part 2: sequel to Grokking Stable Diffusion by @johnowhitaker that focus on Text Inversion
- GitHub - johnowhitaker/aiaiart: Course content and resources for the AIAIART course
- Implementation/tutorial of stable diffusion with side-by-side notes by labml.ai | Twitter
- Practical Deep Learning for Coders 2023 - Part II: continuation of the course focusing on the implementation of Stable Diffusion from scratch.
- Practical Deep Learning for Coders 2022 - Part I: "free course designed for people with some coding experience who want to learn how to apply deep learning and machine learning to practical problems" by Jeremy Howard
- UX for AI: How to Power Human Experiences with AI - Design Tool Tuesday - YouTube
- Behind-the-Design: Meet Copilot by Microsoft Design
- [🔥🔥🔥] [2310.07127] An HCI-Centric Survey and Taxonomy of Human-Generative-AI Interactions: "a survey of 154 papers, providing a novel taxonomy and analysis of Human-GenAI Interactions from both human and Gen-AI perspectives".
- Guidelines for Human-AI Interaction - Microsoft Research: a set of "18 generally applicable design guidelines for human-AI" interaction
- Paper Digest - ChatGPT: Recent Papers on ChatGPT
- dair-ai/ML-Papers-Explained: Explanation to key concepts in ML
- AI Reading List - Google Docs: reading list organized by Jack Soslow (@JackSoslow)
- Aman's AI Journal • Papers List: set of seminal AI/ML papers curated by Aman Chadha
- Casual GAN Papers Reading Club: Community knowledge base for Casual GAN Papers
- Casual GAN Papers: Easy to read summaries of popular AI papers
- The Illustrated VQGAN: illustrated explanation on how VQGAN works
- CLIP: Connecting Text and Images: OpenAI's explanation on how CLIP works
- VQGAN+CLIP — How does it work?. The synthetic imagery (“GAN Art”) scene… | by Alexa Steinbrück | Medium
- The Methods Corpus | Papers With Code
- https://ieeexplore.ieee.org/abstract/document/9043519: A State-of-the-Art Review on Image Synthesis With Generative Adversarial Networks
- Utilizando redes adversárias generativas (GANs) como agente de apoio à inspiração para artistas: Trabalho de Graduação de Cláudio Carvalho no Centro de Informática - UFPE
- GAN Lab: Play with Generative Adversarial Networks in Your Browser!
- [PDF] Music2Video: Automatic Generation of Music Video with fusion of audio and text | Semantic Scholar
- [PDF] Active Divergence with Generative Deep Learning - A Survey and Taxonomy | Semantic Scholar
- [PDF] Automating Generative Deep Learning for Artistic Purposes: Challenges and Opportunities | Semantic Scholar
- Lunroo: 45+ Free AI Tools for Social Media Marketing. Save your time on routine tasks using AI.
- COUNT: AI-powered accounting for small businesses
- Competitor Research: AI tool to help companies track their competitors
- StartKit.AI: Boilerplate for quickly building AI products
- No-Code Scraper: Data Scraping without Code - Seamlessly extract data from any website with just a few simple inputs.
- BacklinkGPT: AI-powered link-building platform that helps you generate personalized outreach messages for faster link building.
- VocalReplica: AI-Powered Vocal and Instrumental Isolation for Your Favorite Tracks
- LangMagic: Learn languages from native content.
- Persuva: Persuva is the AI-driven platform to create persuasive, high-converting ad copy at scale.
- Dittto.ai: Fix your hero copy with an AI trained on top SaaS websites.
- SEOByAI: Rank Faster on Google with FREE AI SEO Tools
- SinglebaseCloud: AI-powered backend platform with Vector DB, DocumentDB, Auth, and more to speed up app development.
- TrollyAI: Create professional SEO articles, 2x faster
- WebscrapeAI: Scrape any website without code using AI
- Architecture Helper: Analyze any building architecture, and generate your own custom styles, in seconds.
- AI-Flow: Connect multiple AI models easily
- Code to Flow: Visualize, Analyze, and Understand Your Code flow. Turn Code into Interactive Flowcharts with AI. Simplify Complex Logic Instantly.
- Recast Studio: AI-powered podcast marketing assistant.
- Clipwing: A tool for cutting long videos into dozens of short clips.
- Tailor: Get a daily podcast and newsletter, created for you by an AI
- ZZZ Code AI: AI-powered free website to get any programming question answered or code generated.
- Scribble Diffusion: turn your sketch into a refined image using AI
- Paint by Text: Edit your photos using written instructions, with the help of an AI.
- Scenario AI: AI-generated game assets
- AnimalAI: custom AI-generated animal portraits (profits are directed to various wildlife conservation organizations)
- starryai: AI Art Generator App - AI Art Maker
- ProsePainter: an interactive tool to "paint with words." It incorporates guidable text-to-image generation into a traditional digital painting interface
- ProsePainter: Image + Sketching Interface + CLIP! - YouTube
- Cocreator AI: creative computer agent (in wait list)
- Runway ML: AI video creation suite
- Hotpot.ai - Hotpot.ai: set of AI Tools to post-process images
- Toonify yourself by Justin Pinkney: turn a human face into a cartoon
- deepart.io: a online tool for applying style transfer
- Artbreeder: web-based tool to generate images by breeding existing images
- Ostagram.ru: image style transfer plataform
- cleanup.pictures: remove objects, people, text and defects from any picture for free
- remove.bg: remove background from images
- Quick, Draw!: can a neural network learn to recognize doodling? A game to help NL by adding users drawing
- Nekton.ai: automate your workflows with AI
- Documind.chat: Chat with PDF using AI. Documind is a powerful chat with pdf tool that lets you ask questions from your pdf documents.
- Snowpixel: Generate Images/Videos/Animations/Audio/Music/3D Objects with Text and/or Image. Upload your own data to create custom models.
- Chatpdf.so: Talk to PDF using GPT4 AI. Chatpdf.so is a chatpdf tool that lets you do question answering on your pdf documents.
- Yona.ai: Create deeply personalized AI chatbots from your own conversations, your stories, your data. You can harness the power of your chat history to build an AI companion for a nostalgic trip down memory lane, whimsical fantasies, or any other unique purpose.
- Voicesphere: Chat with your documents to get intelligent, context specific answers.
- Tune AI: AI chat app powered by open source models
- GPT Mobile GPT Mobile is an Android app that can chat with multiple LLMs at once! Currently supports ChatGPT, Anthropic Claude, and Google Gemini.
- [2402.17764] The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
- mbzuai-oryx/MobiLlama: Small Language Model tailored for edge devices
microsoft/LMOps: General technology for enabling AI capabilities w/ LLMs and MLLMs
- F*** You, Show Me The Prompt: quickly understand inscrutable LLM frameworks by intercepting API calls
- danielmiessler/fabric: fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
- langfuse/langfuse: Open source LLM engineering platform: Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more
- naklecha/llama3-from-scratch: llama3 implementation one matrix multiplication at a time
- [2405.03825] Organizing a Society of Language Models: Structures and Mechanisms for Enhanced Collective Intelligence
- Open challenges in LLM research
- stanfordnlp/dspy: DSPy: The framework for programming — not prompting — foundation models
- Groq: service focused on fast inference speed, providing API access to Llama 2 70B-4K and Mixtral 8x7B-32K
- [🔥🔥🔥] LLMLingua: Designing a Language for LLMs via Prompt Compression
- Floom AI gateway and marketplace for developers, enables streamlined integration of AI features into products
- rasbt/LLMs-from-scratch: Implementing a ChatGPT-like LLM from scratch, step by step
- GoogleCloudPlatform/generative-ai: Sample code and notebooks for Generative AI on Google Cloud
- LLM Visualization
- Automatic Hallucination detection with SelfCheckGPT NLI
- StreamingLLM gives language models unlimited context: giving language models unlimited context
- iusztinpaul/hands-on-llms: learn about LLMs, LLMOps, and vector DBs for free by designing, training, and deploying a real-time financial advisor LLM system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 𝘷𝘪𝘥𝘦𝘰 & 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴
- Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation)
- Poe: a platform that lets people ask questions, get instant answers, and have back-and-forth conversations with a wide variety of AI-powered bots
- [2311.01555] Instruction Distillation Makes Large Language Models Efficient Zero-shot Rankers
- [🔥🔥] State of LLM Apps 2023 · Streamlit
- The architecture of today's LLM applications - The GitHub Blog
- Demystifying LLMs: How they can do things they weren't trained to do - The GitHub Blog
- How AI chatbots like ChatGPT or Bard work – visual explainer | The Guardian
- cpacker/MemGPT: teaching LLMs memory management for unbounded context [demo page] [arxiv]
- [2307.10169] Challenges and Applications of Large Language Models: a systematic set of open problems and application successes of LLM area
- Related resources from around the web | OpenAI Cookbook: tools and papers for improving outputs from GPT
- [🔥🔥🔥] Patterns for Building LLM-based Systems & Products: "practical patterns for integrating large language models (LLMs) into systems & products" by Eugene Yan
- Hannibal046/Awesome-LLM: Awesome-LLM: a curated list of Large Language Model
- [2309.06794] Cognitive Mirage: A Review of Hallucinations in Large Language Models
- Generative AI for Strategy & Innovation: an experiment about management theories with ChatGPT by Harvard Business Review Italia
- The TextFX project: "AI-powered tools for rappers, writers and wordsmiths" (partnership between Lupe Fiasco and Google)
- A jargon-free explanation of how AI large language models work | Ars Technica
- [🔥🔥🔥] What We Know About LLMs (Primer)
- A simple guide to fine-tuning Llama 2 | Brev docs
- microsoft/semantic-kernel: integrate cutting-edge LLM technology quickly and easily into your apps
- CoPrompt: platform for teams to use ChatGPT together
- [🔥🔥🔥] Emerging Architectures for LLM Applications | Andreessen Horowitz: "a reference architecture for the emerging LLM app stack"
- Advanced Guide to ChatGPT: guide by Neatprompts.com
- Falcon LLM - Home: a foundational large language model (LLM) with 40 billion parameters trained on one trillion tokens shared by Technology Innovation Institute from Abu Dhabi
- [🔥🔥🔥] The Hugging Face Open LLM Leaderboard: "the 🤗 Open LLM Leaderboard aims to track, rank and evaluate LLMs and chatbots as they are released"
- google/BIG-bench: "a collaborative benchmark intended to probe large language models and extrapolate their future capabilities"
- togethercomputer/OpenChatKit: provides an open-source base to create both specialized and general purpose chatbots for various applications
- Paper Digest - ChatGPT: Recent Papers on ChatGPT
- Let Us Show You How GPT Works — Using Jane Austen - The New York Times
- Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks | arxiv: "a novel framework called Search-in-the-Chain (SearChain) to improve the accuracy, credibility and traceability of LLM-generated content for multi-hop question answering"
- [🔥🔥🔥] Mooler0410/LLMsPracticalGuide: list of practical guide resources of LLMs based on the paper Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond
- hpcaitech/ColossalAI: Making large AI models cheaper, faster and more accessible
- microsoft/LoRA: Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"s
- kyrolabs/awesome-langchain: 😎 Awesome list of tools and project with the awesome LangChain framework
- Stability AI Launches the First of its StableLM Suite of Language Models — Stability AI
- Free Dolly | The Databricks Blog: open source, instruction-following LLM, fine-tuned on a human-generated instruction dataset licensed for research and commercial use
- Summary of ChatGPT/GPT-4 Research and Perspective Towards the Future of Large Language Models: paper with "a comprehensive survey of ChatGPT and GPT-4 and their prospective applications across diverse domains"
- lm-sys/FastChat: The release repo for "Vicuna: An Open Chatbot Impressing GPT-4" [demo]
- [🔥🔥🔥] oobabooga/text-generation-webui: a gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion
- Why LLaMa Is A Big Deal | Hackaday: post that discusses the impact of LLaMa and Alpaca in popularizing LLMs and even using them in small hardware devices
- logspace-ai/langflow: a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows
- More than you've asked for: A Comprehensive Analysis of Novel Prompt Injection Threats to Application-Integrated Large Language Models: paper on LLM Security
- Cohere AI: a way to integrate state-of-the-art language models to applications
- Langchain for paper summarization: using langchain to build a app for paper summarization
- Red-Teaming Large Language Models | Hugging Faces: strategies for testing LLMs against jailbreaks and attacks
- hwchase17/langchain: "building applications with LLMs through composability"
- Top Large Language Models (LLMs) in 2023 | MarkTechPost: list with large language models from diverse companies
- Godly: Instant context for GPT3
- GPTZero: "Detect AI Plagiarism. Accurately"
- GPT-3 Apps: GPT-3 Powered Micro Products (ex: cat namer, poet pocket, summarize)
- Inside language models (from GPT-3 to PaLM) – Dr Alan D. Thompson – Life Architect
- Google AI Blog: Pathways Language Model (PaLM): Scaling to 540 Billion Parameters for Breakthrough Performance
- DeepMind says its new language model can beat others 25 times its size | MIT Technology Review
- Integrated AI: How to talk to AI for free using nine platforms (Megatron, GPT-3, GPT-J, Wudao, J1..) - YouTube by Dr Alan D. Thompson. The following references came from this video description
- Haystack: framework for building applications with LLMs and Transformers (e.g. agents, semantic search, question-answering)
- SolidUI: AI-generated visualization prototyping and editing platform, support 2D, 3D models, combined with LLM(Large Language Model) for quick editing.
- DSPy: Not Your Average Prompt Engineering: a post about the DSPy, a framework developed by the Stanford NLP group aimed at algorithmically optimizing language model prompts
- [🔥🔥🔥] stanfordnlp/dspy: DSPy: The framework for programming — not prompting — foundation models
- Anthropic's Prompt Engineering Interactive Tutorial
- ncwilson78/System-Prompt-Library: A library of shared system prompts for creating customized educational GPT agents.
- Promptstacks: a prompt engineering community
- Prompt engineering - OpenAI API: OpenAI's document with strategies and tactics for getting better results from large language models
- [2310.04438] A Brief History of Prompt: Leveraging Language Models: the paper presents an exploration of the evolution of prompt engineering. The author, Golam Md Muktadir, extensively used ChatGPT for content generation
- [2311.05661] Prompt Engineering a Prompt Engineer: this paper deals with the problem of "constructing a meta-prompt that more effectively guides LLMs to perform automatic prompt engineering"
- [2311.04155] Black-Box Prompt Optimization: Aligning Large Language Models without Model Training
- [🔥🔥] Prompt Engineering Roadmap - roadmap.sh
- [🔥🔥🔥] Learn Prompting: series of lessons of prompt engineering
- [🔥🔥🔥] Prompt Engineering | Lil'Log: prompt engineering learning notes by Lilian Weng
- [🔥🔥🔥] ChatGPT Prompt Engineering for Developers - DeepLearning.AI: short course taught by Isa Fulford (OpenAI) and Andrew Ng (DeepLearning.AI) that provide best practices for prompt engineering
- [🔥🔥🔥] Prompt Engineering Guide: a project by DAIR.AI that intends to educate researchers and practitioners about prompt engineering
- the Book: collection of prompts and hints of prompt engineering
- dair-ai/Prompt-Engineering-Guide: Guide and resources for prompt engineering
- zou-group/textgrad: Automatic "Differentiation" via Text, using large language models to backpropagate textual gradients.
- [🔥🔥🔥] stanfordnlp/dspy: DSPy: The framework for programming — not prompting — foundation models
- vaibkumr/prompt-optimizer: Minimize LLM token complexity to save API costs and model computations.
- PromptPerfect: "Optimize Your Prompts to Perfection"
- [🔥🔥🔥] LLMLingua: Designing a Language for LLMs via Prompt Compression
- danielmiessler/fabric: fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
- ChatGPT for designers: ChatGPT Cheat Sheet V2 to craft better prompts
- [🔥] [2307.11760] Large Language Models Understand and Can be Enhanced by Emotional Stimuli
- [🔥] [2305.13252] "According to ..." Prompting Language Models Improves Quoting from Pre-Training Data
- [🔥] [2307.05300] Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration
- timqian/openprompt.co: Create. Use. Share. ChatGPT prompts
- 60 ChatGPT Prompts for Data Science (Tried, Tested, and Rated): post by Travis Tang from DataDrivenInvestor
- f/awesome-chatgpt-prompts: this repo includes ChatGPT prompt curation to use ChatGPT better
- brexhq/prompt-engineering: "Tips and tricks for working with Large Language Models like OpenAI's GPT-4"
- How to write an effective GPT-3 prompt | Zapier: a list of 6 GPT-3 tips for getting the desired output
- The Art of ChatGPT Prompting: A Guide to Crafting Clear and Effective Prompts: e-book by Fatih Kadir Akın (@fkadev)
- USP AI Prompt Book: Stable Diffusion v2.1 Prompt Book
- daspartho/prompt-extend: extending stable diffusion prompts with suitable style cues using text generation
- Prompt Box: "organize and save your AI prompts"
- Midjourney artist reference - Google Sheets
- Stable Diffusion Prompt Book — Stability.Ai: prompt book for Stable Diffusion v2.0 and v2.1 released by Stability.AI
- The Ultimate Stable Diffusion Prompt Guide by PromptHero
- CLIP Interrogator - a Hugging Face Space by pharma: image-to-text tool to figure out what a good prompt might be to create new images like an existing one
- [🔥🔥🔥] Prompt book for data lovers II - Google Slides: An open source exploration on text-to-image and data visualization
- some9000/StylePile: A helper script for AUTOMATIC1111/stable-diffusion-webui. Basically a mix and match to quickly get different results without wasting a lot of time writing prompts.
- Artists To Study | All images generated with Google Colab TPUs + CompVis/stable-diffusion-v1-4 + Huggingface Diffusers: a systematic study of artists' styles made by @camenduru
- CLIP retrieval for laion5B: CLIP retrieval using Laion5B. "It works by converting the text query to a CLIP embedding , then using that embedding to query a knn index of clip image embedddings".
- rom1504/clip-retrieval: Easily compute CLIP embeddings and build a CLIP retrieval system with them
- PromptDesign | Reddit: Reddit community for "the art of communicating with natural language models"
- Prompt Engineering and Zero-Shot/Few-Shot Learning [Guide] - inovex GmbH: prompt engineering for text generation
- clip-interrogator.ipynb - Colaboratory: a tool for image-to-prompt
- Useful Prompt Engineering tools and resources | Reddit
- PromptHero: Search the best prompts for Stable Diffusion, DALL-E and Midjourney
- promptoMANIA: AI art community with prompt generator
- Lexica: search over 10M+ Stable Diffusion images and prompts
- list of artists for SD v1.4 A-C / D-I / J-N / O-Z
- succinctly/text2image-prompt-generator · Hugging Face: a GPT-2 model fine-tuned on the succinctly/midjourney-prompts dataset, which contains 250k text prompts that users issued to the Midjourney text-to-image service over a month period
- The Prompter | vicc | Substack: a newsletter about news, tips and thoughts around prompt engineering
- (19) Nikhil Agrawal 📌 on Twitter: 11 AI Images Prompt websites to level up the image quality
- Phraser: a tool that support prompt creation
- PromptBase | Prompt Marketplace: PromptBase is a marketplace for DALL·E, Midjourney & GPT-3 prompts, where people can sell prompts and make money from their prompt crafting skills.
- Professional AI whisperers have launched a marketplace for DALL-E prompts - The Verge
- Visual Prompt Builder: simple deck of illustrated card to combine modifiers for prompt building
- Prompt Engineering Template - Google Sheets: spreadsheet with lists of modifiers for prompt building and a lot of interesting links for reference
- Prompt Engineering: From Words to Art - Saxifrage Blog
- DALL·Ery GALL·Ery Resources: DALL·E 2 and AI art prompt resources & tools to inspire beautiful images
- [2204.13988] A Taxonomy of Prompt Modifiers for Text-To-Image Generation
- List of Aesthetics | Aesthetics Wiki | Fandom
- Artist Directory (Volcano Comparison) | AI Art Creation Wiki | Fandom
- The DALL·E 2 Prompt Book – DALL·Ery GALL·Ery
- DALL·Ery GALL·Ery: A guide to OpenAI's DALL·E – prompts, projects, examples, and tips
- (2) MASSIVE 💥 DALL-E 2 ANIME ⚡︎ KEYWORDS + MODIFIERS LIST ★ : haaaaven: image prompt modifier collection by haaaaven
- DrawBench: a list of prompts the Google Imagen is organizing as a benchmark
- CLIP Prompt Engineering for Generative Art - matthewmcateer.me: list of styles tested with Quick CLIP Guided Diffusion
- Adobe should make a boring app for prompt engineers (Interconnected)
- [2206.00169] Discovering the Hidden Vocabulary of DALLE-2
- When SD just doesn't understand the prompt no matter how hard I try | Reddit
- It's very interesting how some prompts have very defined output but other specific ones are not | Reddit
- [2312.00752] Mamba: Linear-Time Sequence Modeling with Selective State Spaces: alternative to Transformer architecture.
- Mamba: A shallow dive into a new architecture for LLMs | by Geronimo (@geronimo7) | Dec, 2023 | Medium
- Mamba-Chat: A chat LLM based on the state-space model architecture
- PowerInfer: a high-speed inference engine for deploying LLMs locally
- [🔥🔥] Ollama: run Llama 2, Code Llama, and other models locally
- GPT4All: A free-to-use, locally running, privacy-aware chatbot. No GPU or internet required.
- LM Studio: Discover, download, and run local LLMs
- ggerganov/llama.cpp: Port of Facebook's LLaMA model in C/C++
- Nexusflow/NexusRaven-V2-13B · Hugging Face: "surpassing GPT-4 for Zero-shot Function Calling"
- Featured GPTs: curated custom GPTs list for daily tasks
- AllGPTs: a directory to find GPTs
- ragapp/ragapp: an alternative to use Agentic RAG in enterprises
- LlamaParse: GenAI-native document parsing platform by LlamaIndex
- Retrieval-Augmented Generation for Large Language Models: A Survey
- weaviate/Verba: Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
- imartinez/privateGPT: "Interact with your documents using the power of GPT, 100% privately, no data leaks"
- pinecone-io/canopy: Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone
- Forget RAG, the Future is RAG-Fusion: post by Adrian H. Raudaschl in Towards Data Science
- Rerankers and Two-Stage Retrieval | Pinecone
- Retrieval Augmented Generation | Pinecone
- dssjon/biblos: biblos.app: example of RAG architecture using semantic search and summarization for retrieving Bible passages
- 🪆 Introduction to Matryoshka Embedding Models
- Getting creative with embeddings by Amelia Wattenberger
- The Hidden Life of Embeddings: Linus Lee - YouTube
- neuml/txtai: semantic search and workflows powered by language models
- facebookresearch/faiss: A library for efficient similarity search and clustering of dense vectors
- Optimize Your Chatbot’s Conversational Intelligence Using GPT-3 | by Amogh Agastya | Better Programming: tutorial presenting semantic search concepts
- [🔥] whitead/paper-qa: "LLM Chain for answering questions from documents with citations", demo
- What is Semantic Search?
- Learning Center | Pinecone: Pinecone's guides to vector embeddings
- BLIP+CLIP | CLIP Interrogator | Kaggle: a Kaggle notebook for image description and captioning (imate-to-text)
- jerryjliu/gpt_index: GPT Index (LlamaIndex): a project to make it easier to use large external knowledge bases with LLMs
- Llama Hub: a repository of data loaders for LlamaIndex (GPT Index) and LangChain
- Chroma: an open-source AI-native database that makes it easy to use embeddings
- [2406.04784] SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals
- [2406.04692] Mixture-of-Agents Enhances Large Language Model Capabilities
- MervinPraison/PraisonAI: PraisonAI application combines AutoGen and CrewAI or similar frameworks into a low-code solution for building and managing multi-agent LLM systems, focusing on simplicity, customisation, and efficient human-agent collaboration.
- Practices for Governing Agentic AI Systems: paper by OpenAI that offers a set of practices for keeping agents’ operations safe and accountable.
- [2312.05230] Language Models, Agent Models, and World Models: The LAW for Machine Reasoning and Planning
- [2309.02427] Cognitive Architectures for Language Agents: "we draw on the rich history of cognitive science and symbolic artificial intelligence to propose Cognitive Architectures for Language Agents (CoALA)"
- [2309.07864] The Rise and Potential of Large Language Model Based Agents: A Survey
- [2310.01444] Adapting LLM Agents Through Communication
- [2309.17288] AutoAgents: A Framework for Automatic Agent Generation
- Exploring Multi-Persona Prompting for Better Outputs: "method of prompt engineering that instructs the LLM to summon multiple personas and have them work together to solve a task"
- Conceptual Framework for Autonomous Cognitive Entities: a paper that "introduces the Autonomous Cognitive Entity (ACE) model, a novel framework for a cognitive architecture, enabling machines and software agents to operate more independently"
- Mindstorms in Natural Language-Based Societies of Mind: a paper that evaluates the natural language-based societies of mind (NLSOMs), leveraging mindstorms in them to solve some practical AI tasks
- AutoGen | Microsoft: multi-agent conversation framework as a high-level abstraction by Microsoft [github]
- OpenBMB/ChatDev: create customized software using natural language idea (through llm-powered multi-agent collaboration)
- a16z-infra/ai-town: A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.
- AI Town: a virtual town where AI characters live, chat and socialize.
- joonspk-research/generative_agents - Generative Agents: code for interactive simulacra of human behavior [arxiv]
- AgentBench: Evaluating LLMs as Agents: Hugging Face paper page on a benchmark to evaluate LLMs agents
- geekan/MetaGPT: the multi-agent framework that, give one line requirement, return PRD, design, tasks, repo
- GPT Researcher: AI agents for insights and research
- Multi-agent Simulation by Jim Fan on Twitter: "The next frontier of emergent intelligence will be multi-agent simulation: a crowd of AI characters carry out their daily lives through complex social interactions"
- Introducing AACP | SuperAGI: agent to agent communication protocol
- BrainstormGPT: AI multi-agent problem solving
- ChatArena: building multi-agent environments for LLMs
- [🔥🔥🔥] LLM Powered Autonomous Agents | Lil'Log: the LLM agents learning notes by Lilian Weng
- Vercel for AI agents: "help developers to build, deploy, and monitor AI agents, focusing on specialized AI agents that build software for you - your personal software developers"
- 101dotxyz/GPTeam: "GPTeam uses GPT-4 to create multiple agents who collaborate to achieve predefined goals"
- Fine-Tuner.ai: no code approach to build AI agents
- AI Agent Basics: Let’s Think Step By Step - by Jon Stokes
- [🔥🔥] Transformers Agent: provides a natural language API on top of Hugging Face's transformers library
- AgentGPT: "assemble, configure, and deploy autonomous AI Agents in your browser"
- yoheinakajima/babyagi: an AI-powered task management system that uses OpenAI and Pinecone APIs to create, prioritize, and execute tasks
- Torantulino/Auto-GPT: "an experimental open-source attempt to make GPT-4 fully autonomous"
- Generative Agents: Interactive Simulacra of Human Behavior: a paper that presents computational software agents that simulate believable human behavior
- microsoft/JARVIS: JARVIS, a system to connect LLMs with ML community
- HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace
- [2307.05300] Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration
- [2308.07201] ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate
- OpenBMB/ChatDev: create customized software using natural language idea (through llm-powered multi-agent collaboration)
- [2308.10848] AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors
- BrainSoup: multi-agent & multi-LLM client with RAG, multi-modality, automation, code interpreter, and sandboxed file system
- confident-ai/deepeval: The LLM Evaluation Framework
- LLM Benchmarks: MMLU, HellaSwag, BBH, and Beyond - Confident AI
- LLM Leaderboards
- Reward Bench Leaderboard - a Hugging Face Space by allenai
- LiveBench: A Challenging, Contamination-Free LLM Benchmark
- Evaluating Large Language Models: Methods, Best Practices & Tools | Lakera – Protecting AI teams that disrupt the world
- ianarawjo/ChainForge: An open-source visual programming environment for battle-testing prompts to LLMs.
- Prometheus-2 Cookbook - LlamaIndex: "An Open Source Language Model Specialized in Evaluating Other Language Models."
- [2305.13711] LLM-Eval: Unified Multi-Dimensional Automatic Evaluation for Open-Domain Conversations with Large Language Models
- LLM Evaluation: research on evaluation of LLMs conducted by Microsoft Research and other collaborated institutes. (Updated at: 2023/10)
- LLM Evaluation: Everything You Need To Run, Benchmark Evals
- The Ultimate Guide to LLM Product Evaluation
- How to Evaluate, Compare, and Optimize LLM Systems
- LLM Evaluation | Clarifai Guide
- How to Evaluate LLM Applications: The Complete Guide - Confident AI
- AI Evaluation Metrics | Microsoft Learn
- How to Evaluate Large Language Model Outputs: Current Best Practices | FinetuneDB
- The Ultimate Guide to LLM Evaluation | Deci
- Large Language Model Evaluation in 2024: 5 Methods
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators
- Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
- LLM Evaluation Metrics: Everything You Need for LLM Evaluation - Confident AI
- Criteria Evaluation | 🦜️🔗 LangChain
- Evaluation of LLMs - Part 1
- Evaluation of LLMs - Part 2
- The Crucial Role of Model Evaluation in LLM and AI Integrations
- MLGroupJLU/LLM-eval-survey: The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
- A Survey on Evaluation of Large Language Models | ACM Transactions on Intelligent Systems and Technology
- [2307.03109] A Survey on Evaluation of Large Language Models
- qcri/LLMeBench: Benchmarking Large Language Models
- TruLens for LLMs: Evaluate and Track LLM Applications
- LLM Testing Guide: Comprehensive Strategies for Testing and Behavior Analysis by Kolena
- Chatbot Arena: benchmarking LLMs through pairwise confrontation and evaluation
- [2311.12022] GPQA: A Graduate-Level Google-Proof Q&A Benchmark
- OpenAI Cookbook: Evaluating RAG systems | by Ravi Theja | Nov, 2023 | LlamaIndex Blog
- Amazon will offer human benchmarking teams to test AI models - The Verge
- [2311.05020] First Tragedy, then Parse: History Repeats Itself in the New Era of Large Language Models: "that meaningful evaluation informed by actual use is still an open problem"
- [2311.12983] GAIA: a benchmark for General AI Assistants
- Sharing LangSmith Benchmarks
- [2311.09247] Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks
- vectara/hallucination-leaderboard: "leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents"
- [2305.16938] Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluation
- LLM Comparison/Test: 39 models tested (7B-70B + ChatGPT/GPT-4)
- LLM Evaluation at Scale – Airtrain: no-code batch compute platform for LLM evaluation and tuning workloads
- How to evaluate a summarization task | OpenAI Cookbook
- openai/evals: Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
- Red teaming and model evaluations | Anthropic
- Challenges in evaluating AI systems | Anthropic
- Evaluating LLMs is a minefield: talk by Princeton professor Arvind Narayanan
- Eden AI: provides a unique API connected to the AI engines
- Dify: LLMOps platform for creating and operating AI-native apps based on GPT-4
- LLM App: LLM App is a Python library that helps you build real-time AI-powered data pipelines with few lines of code.
- An AI Engineer’s Guide to Machine Learning and Generative AI | by ai geek (wishesh) | Oct, 2023 | Medium
- Marvin: AI engineering framework for building natural language interfaces
- Instructor: library for structured LLM extraction in Python
- One AI: an NLP-as-a-service platform
- LangSmith: a developer platform for deploying LLM apps
- [2310.04451] AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models
- MITRE ATLAS™: knowledge base of adversary tactics and techniques based on real-world attack observations and realistic demonstrations from AI red teams and security groups, modeled after the MITRE ATT&CK® framework.
- OWASP Top 10 for Large Language Model Applications: the Open Worldwide Application Security Project's list related to LLMs [Youtube video]
- Scalable Extraction of Training Data from (Production) Language Models: extracting training data from ChatGPT [webpage]
- The Emerging Attacks on Large Language Models (LLMs): "key attack vectors that threat actors can exploit to compromise or manipulate LLMs".
- Adversarial Attacks on LLMs | Lil'Log
- Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection
- Attacking Large Language Models: an overview of the current attack techniques on LLMs by Marcello Carboni
- corca-ai/awesome-llm-security: A curation of awesome tools, documents and projects about LLM Security.
- Adversarial Prompting: a list of adversarial prompts attacks by Prompt Engineering Guide
- LangChain Cheatsheet: All Secrets on a Single Page | by Ivan Reznikov | Nov, 2023 | Towards AI
- LangChain Template: Research Assistant
- Embedchain: Framework to create ChatGPT like bots over your dataset
- FlowiseAI: "Open source UI visual tool to build your customized LLM flow using LangchainJS, written in Node Typescript/Javascript"
- Langchain for paper summarization
- LangChain Docs: Python library that helps building applications with LLMs through composability
- Getting started with LangChain | by Avra | Feb, 2023 | Medium: A powerful tool for working with Large Language Models
- Advanced Guide to ChatGPT: guide by Neatprompts.com
- [🔥] 104 Growth Hacking Swipe (ChatGPT): set of ChatGPT prompts for design, products and marketing
- acheong08's list / Awesome ChatGPT: list of wrappers for accessing ChatGPT in platform such as Discord, Telegram, and languages such as Python, JS.
- [🔥🔥🔥] Awesome ChatGPT Prompts: repo that includes curated ChatGPT prompts to obtain better results from ChatGPT
- ("Publicly announced ChatGPT variants and competitors: a thread" / Twitter: a Twitter thread by @goodside with alternatives to ChatGPT
- danielmiessler/fabric: fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere
- Jack AI: AI Marketing Copywriter tool
- aiPDF: The most advanced AI document assistant
- AICamp: ChatGPT for Teams
- Yomu: AI writing assistant for students and academics
- Google Sheets Formula Generator: Forget about frustrating formulas in Google Sheets.
- Elephas: Personal AI writing assistant for the Mac.
- Lemmy: Autonomous AI Assistant for Work.
- Fable Fiesta: Creative AI writing assistant
- Plus AI for Google Slides: Create AI-powered presentations in Google Slides
- ChatBotKit: toolkit to build AI chat bots
- Boring Report: "an app that uses AI to remove sensationalism from the news and makes it boring to read"
- ChatPDF - Chat with any PDF!: upload a PDF file and make questions about it #semanticsearch
- Character.AI: platform for creating and talking to advanced AI Characters
- SlidesAI: "create presentation slides with AI in minutes"
- Rationale: decision-making tool powered by the latest GPT and in-context learning
- DetangleAI: AI-generated summaries of provided legal docs
- GPT-2 Output Detector: tool that estimate is a given text is real or generated by GPT
- HyperWrite: a personal writing assistant with suggestions and sentence completions
- DeepStory: A tale of co-creation between man & machine
- InferKit
- CopyHat
- Lucid Lyrics - AI Assisted Art: AI-Assisted Lyrical Interpretations by Walter Arnold
- Authors A.I.: AI-powered text analysis
- Rytr: Rytr is an AI writing assistant that helps creating content
- Charisma: Charisma is a platform for creating interactive stories with believable virtual characters
- Riku.AI | The vault for your A.I. creations
- First look - Riku.ai - inference platform Mar/2022 - J1, GPT-3, Fairseq-13B, GPT-NeoX-20B, Cohere-XL - YouTube
- Taskade: Taskade is an AI outliner and mind map generator for teams with built-in AI chat
- AI Story Generator (Advance Options) Create Unique and Engaging Stories Instantly with Customized Tone, Genre, and Narration.
- AI Story Generator: Free and fast online AI-powered story generator that writes short stories for you
- AI Story Generate: Generate stories using LLM with custom emotion, genre, and word count.
- Composum AI Plugin for CMS Adobe Experience Manager (AEM) or Composum Pages helping the editor to create / edit / translate texts
- danielmiessler/fabric: fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
- AI Research Tools | x post: Some AI tools that can be used for research/teaching
- Unlocking productivity and personalizing learning with AI | Microsoft EDU
- Sourcely: Academic Citation Finding Tool with AI
- GummySearch: AI-based customer research via Reddit. Discover problems to solve, sentiment on current solutions, and people who want to buy your product.
- [2310.17143] Supercharging academic writing with generative AI: framework, techniques, and caveats
- Elicit: automate research workflow for literature review
- Paper Brain: summarizer for paper parts. The user needs to copy and paste into their interface.
- Explainpaper: "Upload a paper, highlight confusing text, get an explanation"
- Paper Player: A new way for busy scientists and technologists to consume open science
- TalkToPapers - namuan/dr-doc-search: Converse with book - Built with GPT-3: a github util where AI will do the paper reading for you instead
- hwaseem04/Research-digest: Research paper summariser application for our hackathon
- whitead/paper-qa: "LLM Chain for answering questions from documents with citations"
- Metaphor: search engine that "understands language — in the form of prompts — so you can say what you're looking for in all the expressive and creative ways"
- MemFree - Open Source Hybrid AI Search Engine, Instantly Get Accurate Answers from the Internet, Bookmarks, Notes, and Docs. Support One-Click Deployment.
- The FLUX.1 family of models – Replicate
- ToTheBeginning/PuLID: Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
- Edit Your Image: Find all the trending and useful Gradio demos that you can use to edit your images
- OutfitAnyone - a Hugging Face Space by HumanAIGC: Ultra-high quality virtual try-on for Any Clothing and Any Person
- StockPhotoAI.net: Great stock photos, made for you.
- Transforming 2D Images into 3D with the AdaMPI AI Model: guide on how to use the AdaMPI AI model for creating 3D photos from 2D images
- deep-floyd/IF: open-source text-to-image model with a high degree of photorealism and language understanding by Stability.AI
- Word-As-Image for Semantic Typography: semantically transforming fonts into illustrations
- Scribble Diffusion: turn your sketch into a refined image using AI
- Muse: Text-To-Image Generation via Masked Generative Transformers
- openai/point-e: OpenAI's point cloud diffusion for 3D model synthesis
- [arxiv/2211.11319] VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models
- Parrot Zone: a database of image synthesis references
- Image Synth Link List: a collection of links organized by the collective parrot zone
- [🔥🔥🔥] Ai generative art tools: a massive list of shared Google Colab notebooks and tools organized by @pharampsychotic
- Introduction — PyTTI-Tools
- pyttitools-PYTTI.ipynb - Colaboratory
- pixray/pixray: Pixray is an image generation system
- pixray/pixray_notebooks: pixray demo notebooks
- dribnet/pixray-text2image – Run with an API on Replicate
- sberbank-ai/ru-dalle: Generate images from texts. In Russian.
- Pyttipanna: visual interface for Pytti by @_staus. Pytti is created by @sportsracer48
- Imagen: Google's Text-to-Image Diffusion Models
- Make-A-Scene: Meta's creative control for AI image generation
- Stable Diffusion: Stability.Ai's text-to-image model that is a breakthrough in speed and quality meaning that it can run on consumer GPUs
- CLIPasso: Semantically-Aware Object Sketching
- DreamFusion / Twitter: Text-to-3D using 2D Diffusion paper
- apple/ml-no-token-left-behind: PyTorch Implementation of No Token Left Behind: Explainability-Aided Image Classification and Generation
- disco-diffusion/Local_Disco_Diffusion_v4_1.ipynb at main · Midgraph/disco-diffusion
- Audio to keyframe string: this tool is used to generate strings for the keyframes of AI animation notebooks, such as this VQGAN+CLIP Animations notebook, using the volume of audio tracks.
- [🔥] S2ML Image Generator: evolution of the first VQGAN+CLIP Google Colab notebook by Katherine Crownson maintained by Justin Bennington
- [🔥] Create Variations on Images With Looking Glass 1.1 (ru-DALLE) - YouTube | Artificial Images
- [🔥] Looking Glass 1.1 (ru-DALLE): Making ruDALL-E fine tuning quick and painless. Copyright (C) 2021 Bearsharktopus Studios
- NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion (ML Research Paper Explained) - YouTube | Yannic Kilcher
- [🔥] yuval-alaluf/hyperstyle: Official Implementation for "HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing" https://arxiv.org/abs/2111.15666
- [🔥] Vadim Epstein’s Aphantasia library: CLIP + FFT/DWT/RGB = text to image/video
- mikaelalafriz/lucid-sonic-dreams: syncs GAN-generated visuals to music
- Greg Surma - Portfolio
- crowsonkb (Katherine Crowson): who wrote the tutorial of VQGAN+CLIP
- DALL·E: Creating Images from Text
- DALL-E mini: DALL·E mini is an AI model that generates images from any prompt you give!
- DALL-E mini GitHub
- DALL-E mini Project Report
- CLIPIT PixelDraw - Colaboratory
- CLIP Guided Diffusion HQ 512x512.ipynb - Colaboratory
- Smooth Transitioning Between Position / Rotation / Zoom and Text Inputs by Keyframing Parameters: A Proof of Concept [15,000 Frames] : deepdream
- neural-dream Alternatives and Similar Photos & Graphics Apps | AlternativeTo
- CoG 21: Adversarial Reinforcement Learning for Procedural Content Generation
- GitHub Repositories of Hugging Face
- Complete guide to samplers in Stable Diffusion - Félix Sanz
- Stable Diffusion Models: list of custom Stable Diffusion models
- Stable Diffusion KLMC2 Animation.ipynb forked: fork by @DigThatData
- Stable Diffusion KLMC2 Animation.ipynb: notebook by @RiversHaveWings to generate animation based on scripted prompts using a technique called KLMC2 discretization of underdamped Langevin dynamics
- DETEXTIFY: A Python library to remove unwanted pseudo-text from images generated by your favorite generative AI models (Stable Diffusion, Midjourney, DALL·E)
- InvokeAI: Stable Diffusion Toolkit and application that runs Windows, Mac and Linux machines, and on GPU cards with as little as 4 GB or RAM
- Stability.ai REST API Documentation: service provided by Stability.ai. DreamStudio authentication required to access this REST API
- [🔥🔥🔥] SD GUIDE FOR ARTISTS AND NON-ARTISTS - Google Docs: a Google Docs with in-depth tips, tricks, tutorials and more related to Stable Diffusion
- [NEWS]Canva Adds a Free and Unlimited AI Text-to-Image Generator | PetaPixel
- prompthero/midjourney-v4-diffusion · Hugging Face: Stable Diffusion fine tuned on Midjourney v4 images, by PromptHero
- CHARL-E: Run Stable Diffusion on your M1 Mac
- The Illustrated Stable Diffusion: explained by Jay Alammar (Visualizing machine learning one concept at a time)
- Img To Music a Hugging Face Space by fffiloni
- Atlas KREA Stable Diffusion: An explorable map of KREA AI's Stable Diffusion Search Engine
- TheLastBen/fast-stable-diffusion: fast-stable-diffusion, +25-50% speed increase + memory efficient + DreamBooth
- NovelAI Improvements on Stable Diffusion | by NovelAI | Oct, 2022 | Medium
- ashawkey/stable-dreamfusion: A pytorch implementation of text-to-3D dreamfusion, powered by stable diffusion.
- [🔥🔥🔥] JoePenna/Dreambooth-Stable-Diffusion: Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion (tweaks focused on training faces)
- [🔥🔥🔥] DreamBooth: fine tuning text-to-image diffusion models for subject-driven generation
- [🔥] Arki's Stable Diffusion Guides
- examples/stable-diffusion-finetuning at main · LambdaLabsML/examples: Fine Tuning Stable Diffusion
- lkwq007/stablediffusion-infinity: Outpainting with Stable Diffusion on an infinite canvas
- [🔥🔥🔥] ML News Stable Diffusion Takes Over! (Open Source AI Art) by Yannic Kilcher - YouTube: video with examples, updates, and discussion about the impact of Stable Diffusion
- Diffusion Models in Vision: A Survey | DeepAI: paper about the diffusion techniques which also discuss the relation with other generative deep learning models
- ThereforeGames/txt2mask: Automatically create masks for Stable Diffusion inpainting using natural language
- basujindal/stable-diffusion: Optimized Stable Diffusion modified to run on lower GPU VRAM
- Stable WarpFusion v0.5 (restricted to patreons): conditioning video frames with Stable Diffusion by @devdef
- nateraw/stable-diffusion-videos: Create videos with Stable Diffusion by exploring the latent space and morphing between text prompts
- dreamlike.art: image generator based on Stable Diffusion with fine-tuned models such as Dreamlike Photoreal 2.0. Users receive 1 credit per hour up to 50 credits
- AITWO.CO: a AI-powered design platform with multiple features
- aiimagegenerator.org: free AI art generator that supports Stable Diffusion txt2img and img2img generation, drawing and inpainting
- InteriorAIDesigns: a platform which allows the easy redesign of rooms.
- Playground AI: frontend for Stable Diffusion with 1000 image generations per day
- Astria: tailor-made AI image generation
- drawanyone: generate drawings based on five input images
- DiffusionBee: stable diffusion GUI App
- getimg.ai: Generate photo-realistic images from text using Stable Diffusion
- Enstil: Fast, open, AI-generated images
- Dezgo - Text-to-Image AI generator
- PhotoAIStudio: a AI-powered photoshot platform with multiple styles
- Baseten: Stable Diffusion Demo
- DreamStudio: Frontend for Stable Diffusion API by Stability.ai
- Pollinations - pollinations/stable-diffusion-private
- tencentarc/gfpgan – Run with an API on Replicate
- andreasjansson/stable-diffusion-wip – Run with an API on Replicate
- stability-ai/stable-diffusion – Run with an API on Replicate
- Osmosis.Studio : web-based content-aware collaborative design tool for generating AI ads that sell real products
- Artistic.wtf: stable diffusion GUI App
- Prodia: Stable diffusion-based art generator that does not require signup
- ComicsMaker.ai: Stable diffusion-based comic book generator with support for text2img, img2img, inpainting and controlnet
- POTO.AI: Finetune Stable Difussion model as AI Photographer to generate headshots, portrait and couple wedding photos
- camenduru/stable-diffusion-webui-colab: collection of stable diffusion webui colab for different checkpoints
- StableDiffusion_WebUI_Simplified.ipynb: versão em português do notebook para rodar a Web UI do Stable Diffusion no Google Colab de graça
- GitHub - AUTOMATIC1111/stable-diffusion-webui: Stable Diffusion web UI: expanded Stable Diffusion web UI
- GitHub - sd-webui/stable-diffusion-webui: Stable Diffusion web UI
- Stable_Diffusion_WebUi_Simplified.ipynb - Colaboratory
- GitHub - awesome-stable-diffusion/awesome-stable-diffusion: Curated list of resources for the Stable Diffusion AI Model
- Stable Diffusion General Updates Posted by u/ImeniSottoITreni | Reddit: a general update on all the "most important" news/repos available
- List of Stable Diffusion systems | Reddit
- Stable Diffusion Akashic Records | Maks-s/sd-akashic: A compendium of information regarding Stable Diffusion (SD)
- 1 week of Stable Diffusion | multimodal.art
- Voldy Guide: detailed beginners guide for Stable Diffusion
- Dreamer's Guide to Getting Started w/ Stable Diffusion! | Reddit
- A collection of sites using Stable Diffusion (and other handy links) | Reddit
- Prompt+: extended textual conditioning in text-to-image generation [unofficial repo] [arxiv] [page]
- A Beginner's Guide to Line Detection and Image Transformation with ControlNet
- Scribble Diffusion: turn your sketch into a refined image using AI (based on ControlNet)
- rinongal/textual_inversion: repo contains the official code, data and sample inversions of Textual Inversion paper
- 2208.01618 An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion: paper that describes the Textual Inversion technique
- sd-concepts-library (Stable Diffusion concepts library): Stable Diffusion Textual Inversion Concepts Library - browse through objects and styles taught by the community to Stable Diffusion and use them in your prompts!
- AI Profile Pictures: paid service for generating profile pictures using AI
- Training Stable Diffusion with Dreambooth using Diffusers: experiments to analyze the effect of different settings in Dreambooth
- fast-DreamBooth.ipynb - Colaboratory: train custom concepts from input images with this simplified DreamBooth colab
- (1166) Como Criar Artes Incríveis com o seu Próprio Rosto Usando o Dreambooth! DE FORMA FÁCIL E DE GRAÇA! - YouTube: tutorial in Portuguese on how to train DreamBooth with your own face
- [🔥🔥🔥] Parseq: parameter sequencer for Stable Diffusion [Youtube Tutorials]
- deforum-art/sd-webui-deforum: Deforum extension for AUTOMATIC1111's Stable Diffusion webui [wiki docs]
- Deforum Stable Diffusion Animation - v5 Math Functions - Demo and Test - YouTube
- Deforum Stable Diffusion: generating videos from scripted prompts
- (5) Deforum notebook v0.5 for Stable Diffusion animations is out! Now with math automation, perspective flips, prompt weights, video masking and waifus! : StableDiffusion
- De-painting historical photographs | Reddit
- img2img animation with hands | Reddit
- VID 2 VID user script | Reddit
- Seamless textures AI generator for Blender by Antonio Freyre | Twitter
- "Shattered" by Ronny Khalil | Twitter: using warp fusion to generate a shattered glass effect
- Acid Dance by aiplague | Twitter
- [Fused video by @remi_molettee](https://twitter.com/remi_molettee/status/1568245586494738432)
- Animation with Dall-e + AE | Reddit: Patent drawing of an electronic device that ...
- You Describe & AI Photoshops Faces For You [StyleCLIP] - YouTube
- Experimental Films + Machine Learning Week 7 Part 1 (Aphantasia with OpenAI CLIP) - YouTube
- GitHub - Sanster/lama-cleaner: Image inpainting tool powered by SOTA AI Model
- AgaMiko/pixel_character_generator: Generating retro pixel game characters with Generative Adversarial Networks. Dataset "TinyHero" included.
- Wilco Sierra: A platform that generates engineering challenges for software engineers using GPT.
- Remini - AI Photo Enhancer: photo and video enhancer
- AI Image Upscaler - Enlarge & Enhance Your Photos for Free - Upscale.media: simple free alternative for image upscaling
- Topaz Labs: AI Image Quality Software: "professional grade workflow, with many features" (this is an affiliate link by nejcsusec.beehiiv.com).
- AI Image Upscaler - Upscale Photo, Cartoons in Batch Free: "free, browser-based, with five credits per day" reference by nejcsusec.beehiiv.com
- Why you should upscale your images: comparing different tools
- Model Database - Upscale Wiki: list of models for upscaling images
- Gigapixel AI: paid AI image upscaler delivering enhanced detail and resolution
- Image Super-Resolution
- Upscale to huge sizes and add detail with SD Upscale : StableDiffusion: tutorial on Reddit
- sczhou/codeformer: face restoration algorithm for old photos and AI-generated faces
- TencentARC/GFPGAN: GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration
- Segment Anything | Meta AI: "a new AI model from Meta AI that can "cut out" any object, in any image, with a single click"
- Sora: OpenAI's text-to-video model [technical report]
- SDV (Stable Diffusion Image To Video): generates 3 seconds of video in about 30 seconds using an A100 GPU on Colab+.
- [Emu Video | Meta ](https://emu-video.metademolab.com/demo#/demo): state-of-the-art text-to-video generation
- AILab-CVC/VideoCrafter: Open Diffusion Models for High-Quality Video Generation
- Ssemble: collaborative video editor with a collection of AI plugins
- Transforming 2D Images into 3D with the AdaMPI AI Model: guide on how to use the AdaMPI AI model for creating 3D photos from 2D images
- Nathan Lands on Twitter: "AI video has started to produce mindblowing results and could eventually disrupt Hollywood / Twitter: Twitter thread with examples of Generative AI tools for video
- Stable Animation SDK: a text-to-animation tool for developers by Stability AI [dev platform]
- Twelve Labs: multimodal, contextual understanding for video search
- Align your Latents: high-resolution video synthesis with latent diffusion models [arxiv]
- Gen-2 by Runway: "a multi-modal AI system that can generate novel videos with text, images, or video clips" [arxiv]
- CiaraRowles/TemporalNet · Hugging Face: a ControlNet model designed to enhance the temporal consistency of generated outputs [tweet]
- Video-P2P UI - a Hugging Face Space by video-p2p-library: video editing with cross-attention control [tweet]
- Text2Video-Zero - a Hugging Face Space by PAIR: zero-shot text-to-video synthesis diffusion framework [tweet] [arxiv]
- ModelScope - a Hugging Face Space by damo-vilab: text-to-video synthesis [page]
- neural frames: tools for animation creation inspired on deforum
- [🔥] dmarx/video-killed-the-radio-star: Notebook and tools for end-to-end automation of music video production with generative AI
- [🔥🔥🔥] Phenaki – Google Research: realistic video generation from open-domain textual descriptions
- THUDM/CogVideo: text-to-video generation
- baowenbo/DAIN: Depth-Aware Video Frame Interpolation (CVPR 2019)
- Dain-App 1.0 [Nvidia Only] by GRisk: Depth-Aware Video Frame Interpolation (CVPR 2019)
- Content Studio AI: Faceless Video Generator
- StemGen: A music generation model that listens
- Mustango: "Toward Controllable Text-to-Music Generation"
- Lyria by Google DeepMind: "transforming the future of music creation"
- Suno AI: "make any song you can imagine"
- Riffusion: this AI system generates singing voice for literally any text as input
- Stable Audio - Generative AI for music & sound fx
- An early look our AI Music experiment - YouTube Blog
- What's Generative Music? - Generative Music AI - YouTube
- Ultimate Vocal Remover: vocal removal using AI
- Introducing Voicebox: The first generative AI model for speech to generalize across tasks with state-of-the-art performance
- MusicGen: Meta's tool for generating music
- facebookresearch/audiocraft: a library for audio processing and generation with deep learning.
- AudioGPT | arxiv: Understanding and Generating Speech, Music, Sound, and Talking Head [code] [demo]
- AudioLDM: Text-to-Audio Generation with Latent Diffusion Models - Speech Research
- lucidrains/musiclm-pytorch: Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
- [🔥🔥🔥] archinetai/audio-ai-timeline: A timeline of the latest AI models for audio generation, starting in 2023
- MusicLM: generating music from text
- Harmonai's Dance Diffusion: Open-Source AI Audio Generation Tool For Music Producers – Weights & Biases
- Dance Diffusion: the Hugging Face Space by harmonai
- MubertAI/Mubert-Text-to-Music: a simple notebook demonstrating prompt-based music generation via Mubert API
- DDSP-VST: Neural Audio Synthesis for All
- LOVO AI: AI Voiceover & Text to Speech Platform with human-like voices
- AIVA: The AI composing emotional soundtrack music
- Jukebox: "a neural net that generates music, including rudimentary singing, as raw audio in a variety of genres and artist styles"
- Magenta: Music and Art Generation with Machine Intelligence
- magenta/magenta: Magenta's official GitHub repository
- AI Image to sound [Melobytes.com]
- archinetai/audio-diffusion-pytorch: Audio generation using diffusion models, in PyTorch
- Parler-TTS: fully open-source high-quality TTS
- p0n1/epub_to_audiobook: EPUB to audiobook converter, optimized for Audiobookshelf
- The "Voice Cloning AIs" they never tell you about (and how they work): Youtube video by @bycloud summarizing the available technologies for voice cloning
- Voice-Swap: transform vocals to match the style of a list of singers
- Shaunwei/RealChar: AI Character/Companion in Realtime
- UneeQ Digital Humans: 3D character lib synced
- AI Voice Generator: free online AI-powered text-to-speech generator that creates voice overs with natural, realistic voices
- KangweiiLiu/Awesome_Audio-driven_Talking-Face-Generation: A curated list of resources of audio-driven talking face generation
- Play.ht: "AI voice generator and realistic text to speech online"
- Murf AI | AI Voice Generator: versatile text to tpeech software
- VALL-E: synthesize high-quality personalized speech with only a 3-second samples
- [🔥] Eleven Labs Beta: a TTS service that adds emotion to the generated voice
- neonbjb/tortoise-tts: "A multi-voice TTS system trained with an emphasis on quality"
- Studio D-ID: create video with still images synced with text-to-speech tool [#avatar]
- Synthesia: AI Video Generation Platform [#avatar]
- Speech Studio - Microsoft Azure: Microsoft's cloud cognitive services
- Introducing Universal-1: multilingual speech-to-text
- ggerganov/whisper.cpp: Port of OpenAI's Whisper model in C/C++. It can be executed locally.
- Good Tape: paid service for transcription
- shashikg/WhisperS2T: An Optimized Speech-to-Text Pipeline for the Whisper Model
- Vaibhavs10/insanely-fast-whisper: accelerates transcription with the combination of OpenAI's Whisper Large v2, HF Transformers, Optimum, and flash attention
- facebookresearch/seamless_communication: Foundational Models for State-of-the-Art Speech and Text Translation
- LeMUR: a single API, enabling developers to reason over their spoken data with a few lines of code
- The Generative AI Revolution in Games | Andreessen Horowitz: this article presents a list of use cases of generative AI in games
- AI for Game Development: Creating a Farming Game in 5 Days. Part 1
- Archie: AI-Driven Product Architect that Designs and Plans Software Applications
- DhiWise: DhiWise is an app development platform that automates coding tasks, letting developers focus on core functionalities.
- New study on coding behavior raises questions about impact of AI on software development – GeekWire
- CostGPT: Software Development Cost Calculator: "find the cost, time and the best tech stack for any kind of software, tools that you want to build using the power of AI"
- codefuse-ai/Awesome-Code-LLM: a curated list of language modeling researches for code and related datasets.
- tldraw/draw-a-ui: draw a mockup and generate HTML for it
- deepseek-ai/DeepSeek-Coder: a tool that experiments the motto "let the code write itself"
- Cody: AI coding assistant
- Kombai: generate UI code per component from Figma
- geekan/MetaGPT: the multi-agent framework that, give one line requirement, return PRD, design, tasks, repo
- ZZZ Code AI: AI-powered free website to get any programming question answered or code generated.
- Rapidpages: create React & Tailwind landing pages using AI
- Teaching Programming in the Age of ChatGPT – O’Reilly
- GPT Web App Generator: generates a webapp from a title, description, and other simple parameters
- wolfia-app/gpt-code-search: search a codebase with natural language using AI
- Dedicated File for Inbox for GenAI + Dev: a list for further analysis and organization of GenAI + dev references
- e2b-dev/e2b: "Open-source platform for building AI-powered virtual software developers"
- Metabob: Generative AI to improve and automate code reviews
- gventuri/pandas-ai: Pandas AI is a Python library that integrates LLMs capabilities into Pandas, making dataframes conversational
- A Systematic Evaluation of Large Language Models of Code: arxiv paper
- pgosar/ChatGDB: "Harness the power of ChatGPT inside the GDB debugger"
- The Impact of AI on Developer Productivity: Evidence from GitHub Copilot | arxiv
- openai/openai-cookbook: Examples and guides for using the OpenAI API
- Reduce costs when prompting using GPT
- Co-Developer GPT engine - local r/w file access and execute actions from an OpenAI GPT
- [2406.09403] Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models
- BradyFU/Awesome-Multimodal-Large-Language-Models: Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
- NExT-Chat: An LMM for Chat, Detection and Segmentation
- roboflow/awesome-openai-vision-api-experiments: Examples showing how to use the OpenAI vision API to run inference on images, video files and webcam streams
- Microsoft KOSMOS-2: new capabilities of perceiving object descriptions (e.g., bounding boxes) and grounding text to the visual world [HF demo] [arxiv]
- Segment Anything | Meta AI: "a new AI model from Meta AI that can "cut out" any object, in any image, with a single click"
- facebookresearch/ImageBind: ImageBind One Embedding Space to Bind Them All
- Ego-Exo4D: a foundational dataset by Meta for research on video learning and multimodal perception Dataset Download
- Carolina: General Corpus of Contemporary Brazilian Portuguese with provenance and typology information - Corpus Geral do Português Brasileiro Contemporâneo
- RedPajama-Data-v2 by Together AI: an open dataset with 30 trillion tokens for training Large Language Models
- Have I Been Trained?: tool for searching 5.8 billion images used to train popular AI art models
- laion-aesthetic-6pls: exploring 12 million of the 2.3 billion images used to train Stable Diffusion's image generator
- CLIP retrieval for laion5B: CLIP retrieval using Laion5B. "It works by converting the text query to a CLIP embedding , then using that embedding to query a knn index of clip image embedddings".
- rom1504/clip-retrieval: Easily compute CLIP embeddings and build a CLIP retrieval system with them
- LAION: Large-scale Artificial Intelligence Open Network
- gabolsgabs/DALI: a large Dataset of synchronised Audio, LyrIcs and vocal notes
- Hassan El Mghari (@nutlope) / X: the creator of roomgpt
- science on Instagram: “Human evolution generated by AI Stable Diffusion”
- Deep Music Visualizer
- Lucid Sonic Dreams (@lucidsonicdreams)
- Artificial Images: Demos and explanations to make art using machine learning
- Glenn Marshall Neural Art
- How to Generate Art - Intro to Deep Learning #8
- dvschultz: Derrick Schultz's GitHub
- dvschultz/ml-art-colabs: collection of Google Colab Notebooks for ML Arts
- [🔥] Structured State Space for Sequence Modeling (S4): Stole generation from the gods
- Ai Generated Music Video - Deltron 3030 - Virus - YouTube
- Artificial Realities: Coral / Twitter: artwork by @refikanadol commissioned by World Economic Forum
- [🔥] Creep - YouTube by Glenn Marshall Neural Art: how did they translated the images using VQGAN+CLIP? How did they seamlessly wander on the latent space?
- 35 Artists Using AI With Under 1000 Followers That You Need To Follow Today / Twitter
- Computer Vision Art Gallery : CVPR 2021: artworks dealing with computer vision technologies
- Confluence: a generative art project by Devi Parikh on BrainDrops.
- Learning to See – Memo Akten | Mehmet Selim Akten | The Mega Super Awesome Visuals Company
- Alien Dreams: An Emerging Art Scene - ML@B Blog
- Neural Zoo | Sofia Crespo
- KRЯRL DЯAWINGS: Runway ML -- 3rd "Model" (based on long poses)
- Frea Buckler ~ Artist: obras usadas para criar essa rede (19) derrick has started yet another project on Twitter: "Just sent @buntworthy a demo StyleGAN model I trained / Twitter
- (Non-)Human
- Authentic Digital Art - Unknown Departure | SuperRare
- A Selection of Machine Learning Art Inspiration
- Top 25 AI Artists of 2021 (Photos, Profiles & History of AI Art)- AIArtists.org: AIArtists.org showcases leading artists using Artificial Intelligence, tools to make AI Art, and a timeline of AI Art History.
- Helena Sarin – Artist Profile (Photos, Videos, Exhibitions) — AIArtists.org
- Images Generated By AI Machines (@images_ai) / Twitter
- https://www.instagram.com/refikanadol/
- The Steampunk Circus Human Machine Collaboration - Video, Sound and Stories with AI / YouTube
- Hannibal046/Awesome-LLM: Awesome-LLM: a curated list of Large Language Model
- AlexChalakov/awesome-generative-ai-companies: a curated list of Gеnerative AI companies, sorted by focus area and total fundraised amount
- kyrolabs/awesome-langchain: 😎 Awesome list of tools and project with the awesome LangChain framework
- KangweiiLiu/Awesome_Audio-driven_Talking-Face-Generation: A curated list of resources of audio-driven talking face generation
- [🔥] amrzv/awesome-colab-notebooks: Collection of google colaboratory notebooks for fast and easy experiments
- [🔥🔥🔥] steven2358/awesome-generative-ai: A curated list of modern Generative Artificial Intelligence projects and services
- [🔥🔥🔥] jonathandinu/awesome-ai-art: "A list of AI Art courses, tools, libraries, people, and places"
- margaretmz/awesome-ai-art-design: An awesome list: AI for art and design.
- toxtli/awesome-machine-learning-jupyter-notebooks-for-colab: A curated list of Machine Learning and Deep Learning tutorials in Jupyter Notebook format ready to run in Google Colaboratory
- chaosreactor/awesome-generative-ai: An awesome list of low- and no-code generative AI resources
- [🔥] altryne/awesome-ai-art-image-synthesis: A list of awesome tools, ideas, prompt engineering tools, colabs, models, and helpers for the prompt designer playing with aiArt and image synthesis. Covers Dalle2, MidJourney, StableDiffusion, and open source tools.
- justinpinkney/awesome-pretrained-stylegan2: A collection of pre-trained StyleGAN 2 models to download
- fMRI-to-image: tweet by danberridge "The 'presented images' were shown to a group of humans. The 'reconstructed images' were the result of an fMRI output to Stable Diffusion. In other words, Stable Diffusion literally read people's minds."
- 7 ways to load external data into Google Colab | by B. Chen | Towards Data Science
- 10 tricks for a better Google Colab experience | by Cyprien NIELLY | Towards Data Science
- Quickly share ML WebApps from Google Colab using ngrok for Free | by AbdulMajedRaja RS | Towards Data Science
- Jupyter Widgets for Interactivity in Google Colab: notebook with examples of using Jupyter Widgets in Colab, allowing interactive inputs
- Jupyter Widgets official documentation
- MuckBrass: Find & Validate Startup Ideas using AI
- ResumeDive: A resume boosting service using AI
- Owlbot: AI Support Agent
- fynk: AI powered contract management software
- Taskbase: Virtual assistants packaged with AI powered software.
- AI Wedding Toast: Generate a personalized wedding speech with AI
- Interviews Chat: Your Personal Interview Prep & Copilot
- Inline Help: Answer customer questions before they ask
- LinkActions: AI Internal Links Assistant
- Marblism: Generate a SaaS boilerplate from a prompt
- SiteSpeakAI: Automate your customer support with AI
- Room Reinvented: Transform your room effortlessly with Room Reinvented! Upload a photo and let AI create over 30 stunning interior styles. Elevate your space today.
- FairyTailAI: Personalized bedtime story generator
- PromptPal: Search for prompts and bots, then use them with your favourite AI. All in one place.
- Never Jobless LinkedIn Message Generator: Maximize Your Interview Chances with AI-Powered LinkedIn Messaging.
- Aispect: New way to experience events.
- SiteGPT: Make AI your expert customer support agent.
- PressPulse AI: Get personalized media coverage leads every morning.
- GPTHelp.ai: ChatGPT for your website / AI customer support chatbot.
- chaiNNer-org/chaiNNer: A node-based image processing and AI upscaling GUI that makes it easy to chain together complex processing tasks
- BIRME: Bulk Image Resizing Made Easy 2.0 (Online & Free)
- The Art of PNG Glitch
- HashLips/hashlips_art_engine: tool used to create multiple different instances of artworks based on provided layers
- Taplio: The all-in-one, AI-powered LinkedIn tool.
- Galichat.com: AI Support Assistant that helps you grow your business.
- Aidbase - AI-Powered Support for your SaaS startup.
- Why you should use Topological Data Analysis over t-SNE or UMAP?
- YingfanWang/PaCMAP: PaCMAP: Large-scale Dimension Reduction Technique Preserving Both Global and Local Structure
- UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction
- Visualizing Data using t-SNE
- (1166) A Hackers' Guide to Language Models - YouTube
- [🔥🔥] Generative AI for Beginners: introductory 12 lesson course by Microsoft
- Introduction to Generative AI: series of Medium articles by Youssef Hosni
- Prompt Engineering Roadmap - roadmap.sh
- Prompt Engineering Guide | Learn Prompting: Your Guide to Communicating with AI
- Short Courses | Learn Generative AI from DeepLearning.AI
Contributions welcome! Read the contribution guidelines first.
To the extent possible under law, Filipe Calegario has waived all copyright and related or neighboring rights to this work.