meisin / Resources-in-Finance-and-Economics-Domain

A library listing down available resources for the Finance and Economics domain

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Resources in Finance and Economics Domain

A library listing down available resources for the Finance and Economics domain

Pre-trained Language Models

  1. FinBERT: A pre-trained Financial Language Representation Model for Financial Text Mining(https://www.ijcai.org/proceedings/2020/0622.pdf)

Financial Corpora

Corpus Description
1. FinancialWeb From CommonCrawl
2. FINWEB
3. YahooFinance
4. RedditFinanceQA

FinancialPhrase Bank FinTextSen FinCausal2020 Numeral Understanding (FinNum-1/NTCIR-14) Numerical Attachment (FinNum-2/NTCIR-15)

Loughran & McDonald’s Dictionary(https://sraf.nd.edu/textual-analysis/resources/#Master%20Dictionary)

Fine-tuning tasks involving Financial dataset

  1. FinSBD-2019 (IJCAI 2019 FinNLP challenge) Task: Financial Sentence Boundary Detection Description: contains financial text that had been pre-segmented automatically. There are 953 distinct beginning tokens and 207 distinct ending tokens in the training and dev sets of FinSBD-2019 data.

  2. [Financial PhaseBank](https://www.researchgate.net/publication/251231364 FinancialPhraseBank-v10/) Task: Financial Sentiment Analysis Description: consists of 4845 English sentences selected randomly from financial news found on LexisNexis database.

  3. FiQA Task: Financial Sentiment Analysis (-1 is negative and 1 is positive) Description: contains two discourse: (1) Financial news headline (529 annotated headlines) and (2) Financial Microblogs (774 annotated samples) , with manual annotated target entities, sentiment scores and aspects. This dataset was created for WWW18 conference financial opinion mining and question answering challenge.

Other github tabulating information about AI + Finance:

  1. https://github.com/sangyx/deep-finance
  2. https://github.com/YangLinyi/FinNLP-Progress

Areas of research for NLP in Finance and Economics: • NLP-based (stock) market analytics, e.g., prediction of economic performance indicators (trend prediction, performance forecasting, etc.), by analyzing verbal statements of enterprises, businesses, companies, and associated legal or administrative actors • NLP-based product analytics, e.g., based on social and mass media monitoring, summarizing reviews, classifying and mining complaint messages and other (non)verbal types of customer reactions to products or services • NLP-based customer analytics, e.g., client profiling, tracking product/company preferences, screening customer reviews or complaints, identifying high-influentials in economy-related communication networks • NLP-based organization/enterprise analytics (e.g., tracing and altering social images of organizational actors, risk prediction, fraud analysis, predictive analysis of annual business, sustainability and auditing reports) • Analysis of market sentiments and emotions as evident from consumers’ and enterprises’ verbal behavior and their communication strategies about products, services or market performance • Competitive intelligence services based on NLP tooling • Relationships and interactions between quantitative (structured) economic data (e.g., contained sales databases and associated time series data) and qualitative (unstructured verbal) economic data (press releases, newswire streams, social media contents, etc.) • Organizational information management based on the content-based assembling, packaging and archiving of verbal communication streams in organizations/enterprises (emails, meeting minutes, business letters, internal reporting, etc.) • Credibility and trust models for business agents involved in the economic process (e.g., as traders, sellers, advertisers) extracted from text/opinion mining their current communication as well as legacy data • Deceptive or fake information recognition related to economic objects (such as products, advertisements, etc.) or economic actors (such as industries, companies, etc.), including opinionspam targeting or emanating from economic actors and processes • Verbally fluent software agents (chat bots for counseling, sales and marketing) as reliable actors in economic processes serving business interests, e.g., embodying models of persuasion, information biases, fair trading • Enterprise search engines (e-commerce, e-marketing) involving NLP analytics • Consumer search engines, market monitors, product/service recommender systems involving NLP analytics • Client-supplier interaction platforms (e.g., portals, helps desks, newsgroups) and transaction support systems based on written or spoken natural language communication • Multi-media and multi-modality interaction platforms, including written/spoken language channels, triggering or supporting economic processes • Specialized modes of information extraction and text mining in economic domains, e.g., temporal event or transaction mining • Information aggregation from economy-related discourse, from and across single sources (e.g., review summaries, automatic threading) • Text generation in economic domains, e.g., review generation, complaint response generation • Ontologies for economics, lexicons for the economy domain or adaptation of general-domain lexicons for economic NLP – acquisition, maintenance or update of such terminological resources • Corpora and annotations policies (guidelines, metadata schemata, etc.) for economics-related NLP • Economy-specific text genres (business reports, sustainability reports, auditing documents, product reviews, economic newswire, business letters, law documents, etc.) and their usage for NLP • Dedicated software resources for economic NLP (e.g., NER taggers, sublanguage parsers, pipelines or end-to-end systems for processing economic discourse)

About

A library listing down available resources for the Finance and Economics domain