HURIDOCS (huridocs)

HURIDOCS

huridocs

Geek Repo

HURIDOCS equips human rights defenders with tools to mobilise information for justice and accountability.

Location:Geneva, Switzerland

Home Page:http://www.huridocs.org/

Github PK Tool:Github PK Tool

HURIDOCS's repositories

uwazi

Uwazi is a web-based, open-source solution for building and sharing document collections

Language:TypeScriptLicense:MITStargazers:225Issues:28Issues:3366

casebox

Casebox: Secure all your information and team communication in one place

Language:JavaScriptLicense:NOASSERTIONStargazers:50Issues:16Issues:0

pdf-document-layout-analysis

A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of different parts of PDF pages, identifying the elements such as texts, titles, pictures, tables and so on.

Language:PythonLicense:Apache-2.0Stargazers:31Issues:6Issues:5

OpenEvSys

OpenEvSys is free open source software designed for use by organisations who need a software tool to manage information on human rights violations

Language:PHPLicense:AGPL-3.0Stargazers:30Issues:25Issues:0

preserve

Preserve is a tool for capturing and saving online digital content. Integrated with Uwazi, Preserve captures content from websites, social media and communication platforms, and archives them with accompanying key metadata to ensure evidentiary value by establishing and demonstrating authenticity and chain of custody.

Language:TypeScriptLicense:MITStargazers:6Issues:4Issues:81

pdf-text-extraction

This project aims to extract text from PDF files using the outputs generated by the pdf-document-layout-analysis service. By leveraging the segmentation and classification capabilities of the underlying analysis tool, this project automates the process of text extraction from PDF files.

Language:PythonLicense:Apache-2.0Stargazers:4Issues:6Issues:0

pdf_metadata_extraction

pdf_information_extraction

Language:TypeScriptLicense:Apache-2.0Stargazers:3Issues:11Issues:0
Stargazers:3Issues:0Issues:0

pdf-table-of-contents-extractor

This project aims to extract Table of Contents (TOC) information from PDF files using the outputs generated by the pdf-document-layout-analysis service. By leveraging the segmentation and classification capabilities of the underlying analysis tool, this project automates the process of identifying and structuring the document's TOC.

Language:PythonLicense:Apache-2.0Stargazers:2Issues:5Issues:0

python_uwazi_API

Python API to interact with Uwazi

pdf_ocr_service

An http service to OCR PDFs based on a redis queue.

Language:PythonLicense:MITStargazers:1Issues:7Issues:3

twitter_crawler

twitter crawler

Language:PythonStargazers:1Issues:7Issues:0
Language:PythonStargazers:0Issues:16Issues:0
Language:HTMLStargazers:0Issues:7Issues:0

convert-to-pdf-service

An http service to convert documents to PDF based on a redis queue.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:6Issues:0

mock-semantic-ml-server

Mock server that simulates the ML server that processes documents for semantic search

Language:JavaScriptStargazers:0Issues:0Issues:0

oe-db-restore-helper

Helper script used to restore/import databases of OE instances based on user-uploaded backups.

Language:PHPStargazers:0Issues:2Issues:0

react-text-selection-handler

text selection handling and highlighting

Language:TypeScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0