Text-NLP#82tier 2live in productionNew
filing text delta
cadence: Annualdata: mediumlong shortlong onlyshort only
JF
2020
Journal of Finance
#82 filing_text_delta — Cohen-Malloy-Nguyen 2020 JF "Lazy Prices" + 10-K Item 1 uncertainty-language delta.
Read the paper →
Mechanism
Year-over-year change in uncertainty/risk language in 10-K Item 1 ('Business' section). Spike in 'may', 'could', 'uncertain', 'challenging', 'risk' tokens per 10K words → management is privately more cautious → forward earnings miss / underperformance. Stable or decreasing language → quietly confident outlook → outperform.
Signal rule
uncertainty_density_t / uncertainty_density_{t-1}; top-quintile increase = short, bottom-quintile = long.Data dependencies
sec_10k_business_textWorker data table — see services/worker schema.
daily_pricesAdjusted-close OHLCV for every US-listed ticker; primary price feed.
Expected edge
~3-4% annualized long-short return (Cohen-Malloy-Nguyen 2020 extension).
Illustrative pattern only
NOT a backtestIllustrative pattern only — see /app for live backtests and the actual current equity curve.
Explore filing text delta on alphactor.ai
See which tickers this family is currently firing on, with live signals and rankings.