When the Model Is Wrong and Nobody Notices: The Risk CFOs Keep Off Their Register

There is a particular kind of confidence that builds inside a finance team when the model has been running cleanly for six months. The numbers reconcile. The forecasts are within range. The board is happy. What develops quietly in that environment is not accuracy. It is comfort mistaken for accuracy. That distinction matters more than most CFOs want to admit.

Financial models are not true. They are structured guesses shaped by the assumptions of the people who built them. The problem is not that models carry assumptions. Every model does. The problem is when a team stops treating assumptions as the most fragile part of the work and starts treating the output as a reliable picture of reality. That shift is subtle. It often happens without anyone deciding it should. It happens because the model keeps being right, until it is not.

The Smartest People in the Room Still Got It Wrong

Long-Term Capital Management understood this lesson in 1998 at a cost of $4.6 billion. The fund had Nobel laureates on its team. Its quantitative models had produced returns above 40% two years in a row. By early 1998 the fund carried $125 billion in borrowed positions against $4.7 billion in equity. The models said the risk was manageable. The models were built on five years of historical data and assumed correlations that held in normal markets. When Russia defaulted and global credit spreads moved in ways the models had never encountered, LTCM lost 44% of its value in a single month and required a Federal Reserve-coordinated bailout from fourteen banks. The people running those models were arguably the most quantitatively skilled finance professionals in the world. Skill did not protect them. Overconfidence in the model did.

Zillow ran a version of the same experiment in 2021. The company used its pricing algorithm to buy homes and resell them quickly, with a stated target of $20 billion in annual revenue. The algorithm was sophisticated. When the housing market turned volatile, the model consistently overpaid. Independent analysis found that roughly two-thirds of the homes Zillow purchased were valued below what the company paid, with average per-home losses in the range of $25,000 to $30,000. Total write-downs reached $569 million. Two thousand employees lost their jobs. The CEO later acknowledged that the unpredictability of home prices had exceeded what the team had anticipated. The model had no mechanism for expressing doubt. It could produce a price. It could not say it was uncertain.

WeWork's approach was more direct. Faced with a financial model that showed a $1.9 billion net loss in 2018, the company's finance team introduced a custom metric called "community adjusted EBITDA" that stripped out marketing, general and administrative, development, and design costs. The result was a reported figure of $467 million in adjusted profit on the same set of underlying facts. The company reached a valuation of $47 billion before investors and the SEC began asking questions. The IPO was withdrawn, the CEO was removed, and WeWork eventually filed for Chapter 11 bankruptcy. The model was not wrong in a technical sense. It had been rebuilt to produce a result the business wanted to show.

What It Looks Like at Ground Level

These are extreme cases. The more common version is quieter and less visible. It looks like a finance team presenting a single-point revenue forecast with two decimal places of precision on a 24-month horizon. It looks like assumptions carried forward from the prior year without review because the prior year's actuals were close enough. It looks like sensitivity analysis that tests a 5% downside on a business carrying 40% market concentration risk. It looks like a CFO saying "the model says so" in a room where no one pushes back.

Research from Duke University and Ohio State offers a useful frame for how widespread this problem actually is. Across more than 13,000 probability distributions collected from U.S. CFOs between 2001 and 2010, the study found that CFOs' 80% confidence intervals contained the actual outcome only 33% of the time. These were not junior analysts guessing at numbers. These were senior finance leaders who had been asked to set the boundaries of what they believed was likely. The gap between their stated confidence and actual outcomes was not marginal. It was the kind of gap that changes capital allocation decisions, hiring plans, and fundraising strategy.

The Psychology Nobody Puts on the Risk Register

The psychological mechanisms behind this are well documented. Confirmation bias leads teams to seek data that validates the model rather than data that would break it. Anchoring causes last year's budget to become this year's starting point, even when the business has fundamentally changed. The illusion of explanatory depth, described in research from Yale, captures something finance teams know intuitively but rarely say out loud: a 47-tab spreadsheet feels like understanding, but the complexity of the model can obscure rather than clarify the assumptions it rests on. Daniel Kahneman, whose work on forecasting is as relevant in finance as anywhere else, has written that people who engage in predictive tasks reliably underestimate their objective ignorance. The more sophisticated the model, the stronger that illusion can become.

There is also an organizational dimension that rarely gets named. A BlackLine survey found that 71% of C-suite executives expressed complete trust in the accuracy of their financial data, while only 38% of the finance professionals who actually produce that data felt the same way. That 33-point gap is its own risk. Leadership trusts the numbers more than the people who make them, which means the signals that something is wrong travel slowly upward, if they travel at all.

PwC's most recent CFO Pulse Survey found that 92% of CFOs describe accurate forecasting as a genuine challenge. At the same time, Deloitte's Q4 2024 CFO Signals data showed that 67% of CFOs said it was a good time to take greater risk, the highest reading since early 2018. The combination of declining forecast accuracy and rising risk appetite is worth sitting with. Confidence is rising in an environment where the tools for justifying that confidence are performing worse.

What the Best Finance Teams Do Differently

Fixing this is less about better models and more about building a culture that treats uncertainty as information rather than a problem to be resolved before the board presentation. The pre-mortem, developed by psychologist Gary Klein and later endorsed by Kahneman, asks a team to imagine the forecast has already failed and work backwards through the reasons why. Research on the technique shows it improves the identification of risks by approximately 30% compared to standard review processes. The value is not in the exercise itself. The value is in creating a space where a junior analyst can say "here is how this model breaks" without it feeling like an attack on the work.

Reference class forecasting asks a different question: rather than focusing on the details of this specific plan, what actually happened to similar businesses at a similar stage? Flyvbjerg's research across infrastructure projects found that cost overruns are systematic and predictable across project types, not random individual failures. The same logic applies to startup revenue forecasts, acquisition synergy assumptions, and market sizing projections. The outside view is consistently more accurate than the inside view built from first principles by the team closest to the work.

Range-based forecasting is the mechanical change that supports the cultural one. Presenting a revenue forecast as a range with stated confidence levels rather than a single precise number does two things. It communicates honest uncertainty to the people receiving the forecast. It also forces the team to articulate what it believes the boundaries of the plausible outcomes actually are, which is a harder and more useful discipline than producing a single line on a chart. Abacum's 2025 research found that startups preparing three or more financial scenarios secured 1.8 times the funding of those relying on a single projection. Investors were not rewarding optimism. They were rewarding intellectual honesty about what was known and what was not.

Forecast accuracy tracking is the habit most teams skip. Tetlock's work on superforecasters found the single most consistent differentiator between accurate and inaccurate forecasters was the practice of tracking predictions against outcomes over time and updating beliefs accordingly. Most finance teams produce forecasts, move on, and revisit actuals only when something has gone visibly wrong. Keeping a scorecard of how assumptions performed, and reviewing it regularly, converts forecasting from an exercise in producing numbers to an exercise in learning.

The Question Worth Asking Before the Next Planning Cycle

The risk that financial model overconfidence represents is real, recurring, and material. It shaped the decisions that destroyed LTCM, WeWork, and Zillow. It shows up in quieter ways across organizations every quarter when single-point forecasts drive hiring decisions, capital commitments, and investor communications. The question worth asking at the next planning cycle is not whether the model is correct. Models are always approximations. The question is whether the team has built in the mechanisms to find out where it is wrong before the market does.

Overconfidence belongs on the risk register. The irony is that the teams most at risk are usually the ones who feel they have the least reason to put it there.

When the Model Is Wrong and Nobody Notices: The Risk CFOs Keep Off Their Register

About Abhinav Gupta