AnnthomyGILLES / Awesome_LLM

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Awesome LLM - Personal review

Background

Vector databases

  • Vector databases (Part 1): What makes each one different?
  • Vector databases (Part 2): Understanding their internals: The article delves into the rising significance of vector databases in the era of Large Language Models (LLMs) like ChatGPT. While LLMs can generate coherent text, they sometimes produce inaccurate results. Vector databases, which store data as vectors, can be combined with LLMs to provide more accurate and up-to-date information. The piece also touches on the technical aspects of vector databases, including embeddings, similarity computation, indexing, and the potential of hybrid search systems that combine traditional keyword and vector searches.

Theoretical Understanding

Transformer and Attention

LLMs Limitations & Challenges

  • Open challenges in LLM research: The article delves into the current challenges and research directions concerning Large Language Models (LLMs). It highlights 10 key areas: 1) Reducing AI's tendency to fabricate information (hallucinations), 2) Optimizing context understanding and response generation, 3) Incorporating multiple data types like text and images (multimodality), 4) Enhancing LLMs' speed and cost-efficiency, 5) Designing new model architectures beyond the Transformer, 6) Exploring alternatives to traditional GPU-based deep learning, 7) Making AI agents that can perform tasks more usable, 8) Refining models' learning based on human preferences, 9) Improving the efficiency of chat interfaces, and 10) Developing LLMs for languages other than English.

Build

  • Building Scalable Large Language Model (LLM) Apps
    • The article delves into building scalable large language model (LLM) applications. It emphasizes FastAPI's benefits and Langchain's scalability challenges. It recommends vendor-provided VectorStores or using Redis and Qdrant for scalability, underscores the significance of semantic caching with GPTCache, and suggests Microsoft Guidance for controlled LLM outputs. For document processing, unstructured.io is highlighted as a top recommendation.
  • 🦜️ LangChain + Streamlit🔥+ Llama 🦙: Bringing Conversational AI to Your Local Machine 🤯
    • Integrating Open Source LLMs and LangChain for Free Generative Question Answering (No API Key required)
  • All You Need to Know to Build Your First LLM App
    • Integrating Open Source LLMs and LangChain for Free Generative Question Answering (No API Key required)
  • Building LLM applications for production
    • This post consists of three parts.
      • Part 1 discusses the key challenges of productionizing LLM applications and the solutions that I’ve seen.
      • Part 2 discusses how to compose multiple tasks with control flows (e.g. if statement, for loop) and incorporate tools (e.g. SQL executor, bash, web browsers, third-party APIs) for more complex and powerful applications.
      • Part 3 covers some of the promising use cases that I’ve seen companies building on top of LLMs and how to construct them from smaller tasks.

Transformers

For RAG

For Time Series

Applied LLMs

in medicine

  • Large language models in medicine: the potentials and pitfalls: The paper explores the potential of large language models (LLMs) like ChatGPT in medicine. While they show promise in tasks like patient queries and medical documentation, they also have challenges like biases and privacy concerns. Solutions include domain-specific models and human oversight. The paper details LLM technicalities, provides guidance on medical applications, and emphasizes the need for rigorous evaluation. Healthcare professionals are urged to stay informed about LLM advancements.

About