hariexcel / Sarvadnya

This repo is a collection of various PoCs (Proof-of-Concepts) to interface custom data using LLMs.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Sarvadnya (सर्वज्ञ), an All-Knowing Micro SaaS!!

Chatbots can be real WoW!! The recent evidence is: ChatGPT. Now that they are more human-like with the latest LLMs (Large Language Models). But these LLMs are Pretrained on their own (HUGE) data. Mere mortals don't have any ways ($$, time, expertise) to train own LLMs. Some do have facility to get fine-tuned on custom corpus, but limited. Custom fine tuning of text documents is being provided by many.

This repo is a collection of various PoCs (Proof-of-Concepts) to interface custom data using LLMs.

Stretch (RnD) goals:

Pathways

  • Enterprise: Google Cloud: Gen AI, Doc AI, Vertex AI: Skills Boost paths, Professional ML Certification
  • Open Source: Langchain, HuggingFace, Streamlit: Custom fine-tuned models

Why LangChain based Implementations ?:

  • Local (secure), no over-the-net API/web calls
  • Open source, Free via HuggingFace
  • Python!! end-to-end, with Streamlit as UI
  • Huge support, community, opportunities

Publications so far

References

Bottom-line

  • Not looking for Success, but Wonder!!
  • तमसो मा ज्योतिर्गमय : From Dark (hidden in text data) to Light (insights)

About

This repo is a collection of various PoCs (Proof-of-Concepts) to interface custom data using LLMs.

License:MIT License


Languages

Language:Jupyter Notebook 90.2%Language:HTML 5.6%Language:Python 4.2%Language:Dockerfile 0.0%Language:Shell 0.0%