There are 2 repositories under gemini-flash topic.
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
Deploy your private Gemini application for free with one click, supporting Gemini 1.5, Gemini 2.0 models.
A desktop application that extracts YouTube playlist transcripts and enhances them using Google's Gemini AI models. The output is a book in any language you want.
Vanilla JS web interface for Gemini 2.0 flash-exp Multimodal API with text, audio, camera, screen inputs and audio responses and function calling
Co-create a PowerPoint presentation with Generative AI
Simplified Gemini for Claude Code.
A lightweight Python API wrapper and CLI for Google’s Gemini language models.
Autospec is an open-source AI agent that takes a web app URL and autonomously QAs it, and saves its passing specs as E2E test code
The Gemini API wrapper for Delphi utilizes advanced models developed by Google to provide robust capabilities, including interactive chat, text embeddings, code generation, image and video prompting, audio analysis and transcription, fine-tuning, caching, and integration with Google Search.
Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flash
Gemini Pro: An AI-powered Telegram bot script for generating text and image-based responses using Gemini AI
A Multi-Agent based application which provides a comphrehensive financial/market analysis of any company
This project enables real-time streaming of audio (and optionally video or screen captures) from your local device to Google Gemini using the Live API. It allows you to interact with Gemini through both text and voice, supporting conversational AI responses.
AI agent for creating personalized digests of research papers
Google Gemini Voice/Vision Assistant with gemini-1.5-pro / gemini-1.5-flash modal ! #Gemini 1.5 Flash #Gemini 1.5 Pro
A Streamlit-based chatbot application using Gemini models for NLP. Features include light/dark mode toggle, model selection (Gemini 1.5 Flash, 1.5 Pro, 1.0 Pro), adjustable parameters (temperature, top-p, top-k, max tokens), secure API key input, and an interactive chat interface with history.
AI-powered flashcard generator built with React , Gemini and Cloudflare Workers. Create and customize quiz content seamlessly for an interactive learning experience.
This project provides a custom keyboard for iPhones, built using UIKit, and a companion app built with SwiftUI for customization options. The keyboard features a built-in search bar that integrates with the Gemini-1.5 flash API to quickly answer questions or generate content for commenting or replying.
This repository provides a framework to integrate internet search capabilities with a Language Learning Model (LLM), specifically using Gemini 1.5 API. This allows the LLM to fetch and use real-time data from the internet to enhance its responses to user queries.
T20: Multi-Agent. Orchestrator-delegate model. TAS. Goal -> plan -> delegate. Agents: Gemini family. Autonomous, traceable. Logs sessions. CLI. Usage: `t20-cli "goal string"`. Artifacts of high value.
GitPilot is an intelligent AI-powered Git assistant that bridges the gap between natural language and Git commands. It's designed to make Git more accessible and efficient by allowing developers to express their intentions in plain English, while providing context-aware suggestions and safety checks.
This repository contains the VS Code extension for the main project, GitPilot. You can find the main repository here: https://github.com/InflixOP/GitPilot
NeoExamShield
⚡Powered by Goggle TPUs and the latest (Aug 27, 2024) Gemini 1.5 Pro and Flash Models to generate high-quality engineered prompts, analyze text and images, and create datasets for fine-tuning AI models, helping you to become a prompt engineering pro
This repository contains a transformer-based model for real-time American Sign Language (ASL) recognition. The model leverages transformer architecture to interpret ASL gestures and utilizes the Gemini-Pro LLM API for constructing sentences from recognized ASL signs.
In this we explore into visual Question Answering Using Gemini LLM and image was in URL or any other extension
Clean-architecture Flutter-based AI chatbot designed to be a powerful conversational AI interface and compatible with any AI model.
Heavily modified version Discord Bot ChatGPT based on Zero6992 code and use gpt4free lib providers
Python script for AI image generation using Imagen 3 and Google Gen AI SDK. Generates images from text prompts, saves locally.
Segment objects from images using natural language and Gemini Flash
A curated list of resources, tools, apps, and power-user workflows for Google Gemini