FinBERT: Financial Sentiment Analysis with Pre-trained Language Models

FinBERT, described by Dogu Araci in a paper posted to arXiv on August 27, 2019 (originating as a University of Amsterdam master’s thesis), took Google’s BERT language model and fine-tuned it for the financial domain. The motivation was simple: general-purpose language models struggle with financial text because the language is specialized - words like “liability,” “exposure,” or “write-down” carry meanings and sentiment that differ from everyday usage.

The paper’s core idea follows the transfer-learning recipe that BERT popularized. Start from a model pre-trained on a large general corpus, optionally continue pre-training on domain text, then fine-tune on a smaller labeled dataset for the specific task. Because the model already encodes a great deal of language structure, it needs relatively few labeled financial examples to specialize. FinBERT was evaluated on financial sentiment analysis - classifying sentences from financial news as positive, negative, or neutral - and the paper reported improvements on every measured metric over the prior state of the art on two financial sentiment datasets.

FinBERT became one of the most cited early demonstrations that domain adaptation of transformer language models pays off in finance. The name was later reused by other groups for related financial models, but Araci’s 2019 work is the original.

Why business readers should care: FinBERT is a concrete, early example of the pattern now everywhere in financial AI - take a general language model and specialize it cheaply for a narrow task such as reading filings, news, or earnings calls. It foreshadowed larger purpose-built systems like BloombergGPT.

FinBERT: Financial Sentiment Analysis with Pre-trained Language Models

Sources

Related