normal-computing / fuji-web

Fuji is an AI agent that lives in your browser's sidepanel. You can now get tasks done online with a single command!

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Fuji-web Logo

Fuji-Web: AI based Full Browser Automation 🗻

Fuji-Web is an intelligent AI partner that understands the user’s intent, navigates websites autonomously, and executes tasks on the user’s behalf while explaining each action step.

Demo

FujiWebCompressed.mp4

How does it work?

Please read our blog post for a demo video, benchmarks and deep-dive technical overview!

Installing and Running

Download and Install the extension in your browser

  1. Go to the releases page, find the latest version of the extension and download "fuji-extension.zip".
  2. Unzip the file.
  3. Load your extension on Chrome by doing the following:
    1. Navigate to chrome://extensions/
    2. Toggle Developer mode
    3. Click on Load unpacked extension
    4. Select the unzipped folder

Use the extension

Please note that you may need to refresh the page for the extension to work.

  1. Find the Fuji icon in the top right corner of your browser and click on it to open the sidepanel.
  2. Create or access an existing OpenAI API Key and paste it in the provided box. This key will be stored in your browser, and will not be uploaded to a third party.
  3. Finally, navigate to a webpage you want Fuji-Web and type in the task you want it to perform.

Note: all prompts (text and image) are sent directly to the API of your selection. Fuji-Web does not attempt to collect any information from you.

Build the extension

If you want to build the extension from source, follow these instructions:

  1. Ensure you have Node.js. The development was done on Node v20 but it should work with some lower versions.
  2. Clone this repository
  3. Install pnpm globally: npm install -g pnpm
  4. Run pnpm install
  5. Run pnpm dev to start the development server, or pnpm build to build the extension.

When loading the extension, you will need to load the dist folder created by the build process.

Roadmap

  • Expose API for easy integration with browser automation frameworks (e.g. Puppeteer, Playwright, Selenium)
  • Add support for more complex & cross-tab workflows
  • Add support for more browsing behaviors (select from dropdown, extract content from entire page etc.)
  • Add support for saving workflows
  • Add support for sharing workflows & instructions with others
  • Create wikipedia-like knowledge base where users can work together to create knowledge that can improve the Fuji-Web's performance

Troubleshooting

Check out our Troubleshooting Guide for help with common problems.

Contributing

Interested in contributing to Fuji-Web? We'd love your help! Check out our Contribution Guide for guidelines on how to contribute, report bugs, suggest enhancements, and more.

We also have set up a dedicated channel for Fuji-Web feedback on Discord at https://discord.gg/yfMjZ8udb5.

Credits

About

Fuji is an AI agent that lives in your browser's sidepanel. You can now get tasks done online with a single command!

License:Apache License 2.0


Languages

Language:TypeScript 93.3%Language:JavaScript 4.6%Language:CSS 1.1%Language:HTML 0.7%Language:SCSS 0.3%Language:Shell 0.1%