There are 0 repository under html-to-text topic.
Standalone .NET Converter library, not require Adobe Acrobat component nor Microsoft Office Interop Assemblies, to convert PDF, DOCX, XLSX, HTML, Image, CSV, RTF, TXT in .NET framework
Python library for converting HTML to markup or plain text
The web search engine was a try to make a mini version of the other popular search web searches engines such as Google, Bing, or YouTube. The web search engine that we built is developed using various data structures to perform efficiently to result accurately. First of all, we collected the web pages using web crawler using python. The web crawler fetches all the web pages to create a database. After that, we converted all the web pages into text files so that it is easier to go through the text file. Lastly, we build a database for the text-files linked to the words that the text-file contains. We implemented the Inverted Index to build the database. So we used java data Structure that uses key-value pair called HashMap to implement an Inverted Index.
A simple utility to convert HTML into text, keeping as much content as possible
MERN Ecommerce Carpets Shop (Front-end)
This program takes a *.mbox file or .txt file that contains the emails downloaded with Google Takeout. Then that file is processed with this program to gather the relevant information and deliver a *.txt file with it.
A java-based search engine that searches the data from the database using different data structure concepts.
A console based web search engine developed in Java.
Code and data for SORE (ACL 2025), a semantic boilerplate remover.
I'm an aspiring Full Stack Developer specializing in the MERN stack (React.js, Next.js, Node.js, PostgreSQL, Prisma ORM). I build real-world projects with clean, efficient code, focusing on modern UI/UX and robust backend solutions. Proficient in JavaScript, Python, and Java
backend work with users and contacts