Media Cloud (mediacloud)

Media Cloud

mediacloud

Geek Repo

An open-source platform for studying media ecosystems.

Home Page:https://mediacloud.org/

Twitter:@media_cloud

Github PK Tool:Github PK Tool

Media Cloud's repositories

backend

Media Cloud is an open source, open data platform that allows researchers to answer quantitative questions about the content of online media.

Language:PythonLicense:AGPL-3.0Stargazers:277Issues:35Issues:613

sentence-splitter

Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.

Language:PythonLicense:NOASSERTIONStargazers:222Issues:7Issues:7

ultimate-sitemap-parser

Ultimate Website Sitemap Parser

Language:PythonLicense:NOASSERTIONStargazers:173Issues:11Issues:31

cliff-annotator

A lightweight server to allow HTTP requests to the Stanford Named Entity Recognized and a heavily modified CLAVIN geoparser.

Language:JavaLicense:Apache-2.0Stargazers:119Issues:11Issues:74

api-client

Public client for consuming content from the Media Cloud Online News Archive & Directory.

Language:PythonLicense:Apache-2.0Stargazers:68Issues:21Issues:67

web-tools

The shared repository for Media Cloud web apps (Explorer, Source Manager, Topic Mapper)

Language:JavaScriptLicense:Apache-2.0Stargazers:63Issues:8Issues:1631

nyt-news-labeler

Tag news stories based on models trained on the NYT corpus.

Language:PythonLicense:Apache-2.0Stargazers:39Issues:9Issues:9

api-tutorial-notebooks

A set of jupyter notebooks demonstrating how to use the Media Cloud API.

Language:Jupyter NotebookStargazers:33Issues:3Issues:1

feed_seeker

Find rss, atom, xml, and rdf feeds on webpages

Language:PythonLicense:MITStargazers:31Issues:12Issues:8

metadata-lib

How Media Cloud approaches extracting metadata from online news stories

Language:PythonLicense:Apache-2.0Stargazers:11Issues:8Issues:47

web-search

Code that drives the public web-based tools for the Media Cloud Online News Archive and Directory.

Language:JavaScriptLicense:Apache-2.0Stargazers:9Issues:6Issues:216

cliff-api-client

A Python client for the CLIFF geoparsing tool

Language:PythonLicense:MITStargazers:5Issues:12Issues:4

rss-fetcher

Intelligently fetch lists of URLs from a large collection of RSS Feeds as part of the Media Cloud Directory.

Language:PythonLicense:Apache-2.0Stargazers:5Issues:6Issues:36

wayback-news-client

A client library to access the Wayback Machine news archive search.

Language:PythonLicense:Apache-2.0Stargazers:4Issues:5Issues:11

glimpse

Get a glimpse of attention to a topic on social media.

Language:PythonLicense:Apache-2.0Stargazers:2Issues:2Issues:9

postgresql-citus-aws-graviton2

PostgreSQL built for AWS Graviton2

word-embeddings-server

Helpful micro-service to return results from word2vec models

Language:PythonLicense:MITStargazers:2Issues:4Issues:8

cliff-homepage

A simple homepage for the CLIFF project

Language:HTMLLicense:MITStargazers:1Issues:8Issues:0

news-search-api

Internal API server that offers search access to the Media Cloud Online News Archive (in Elasticsearch).

Language:PythonLicense:AGPL-3.0Stargazers:1Issues:0Issues:0

sous-chef

Configurable Data Analytics Pipeline

story-indexer

The core pipeline used to ingest online news stories in the Media Cloud archive.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:5Issues:169

a-catchall-page

Dokku app that serves a static HTML catch-all page, displayed for bad domains

Language:HTMLStargazers:0Issues:4Issues:0

backend-temporal-server-config

Temporal server configuration

Stargazers:0Issues:3Issues:0

backup-collection-maker

Notebook demonstrating how to create and update a Media Cloud collection.

Language:Jupyter NotebookStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:0Issues:0

mc-providers

Internal library to allow querying multiple media platforms with a consistent API.

Language:PythonStargazers:0Issues:5Issues:13

mediacloud-news-client

An internal client library to access the new Mediacloud news archive search.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:2

sc-buffet

Sous-chef buffet - Self-service data access for sous-chef.

Language:PythonStargazers:0Issues:0Issues:0

system-metrics

Daily performance metrics for the mediacloud application

Language:PythonStargazers:0Issues:0Issues:0

wal-g-aws-graviton2

WAL-G built for AWS Graviton2

Stargazers:0Issues:2Issues:0