Back to Projects
Bloomberg Sentiment x FinBERT Architecture

Bloomberg Sentiment × FinBERT

Empirical test of whether Bloomberg's proprietary news-sentiment signal can be replicated with open-source FinBERT — and what the gap tells us about quant strategies

Bloomberg sells a daily news-sentiment field called NEWS_SENTIMENT_DAILY_AVG that aggregates thousands of stories per stock per day from a curated, mostly paywalled corpus. Quants use it as a feature in systematic strategies. The question this project asked: is the signal actually predictive on its own, or is its value purely commercial moat? And separately — can an open-source FinBERT pipeline applied to free Yahoo Finance headlines reproduce it?

I pulled 30 large-cap S&P 100 names balanced across 10 GICS sectors over 2018-2026 (~62,800 stock-day observations) directly from the Bloomberg Terminal via a VBA macro driving the Excel BDH API. The signal was tested at multiple holding horizons (1d, 5d, 21d), used to construct a long-short quintile portfolio with 5-day rebalancing, and attributed to Fama-French 3 factors with Newey-West HAC standard errors. Then I applied ProsusAI/finbert to ~300 Yahoo headlines and Spearman-correlated my output against Bloomberg's signal on matched (date, ticker) cells.

The replication failed by enough that the failure became the most defensible finding: Bloomberg's signal cannot be reconstructed from free public news, which is exactly what makes it commercially valuable.

Key Results

62,880 Stock-day observations
+10.7% L/S annualised α (FF3)
−0.26 FinBERT vs Bloomberg ρ
67 bps Q4 5-day return (vs 27 for Q5)

What's in the study

Tech Stack

Bloomberg APIVBABDH pandasstatsmodelsscipy FinBERTtransformersPyTorch yfinancematplotlibpython-docx Newey-West HACFama-French
View on GitHub Read the Report