What is Fahali's accuracy?

We do not publish a headline accuracy number yet, on purpose. The outcome ledger was reset in mid-2026 and we will not quote a per-engine accuracy figure until it has resolved across a mixed market regime — not a one-directional stretch where a directionless signal can look 'right' without adding information. Until then we publish the method and the record itself, including the misses, rather than a manufactured percentage.

How does Fahali measure itself?

Every detection is recorded with a timestamp and later resolved against realized price over fixed horizons (1h, 4h, 24h, up to 48h+). Each call is marked correct or incorrect and stored in a signal-to-outcome ledger. The misses are kept, not deleted.

Does Fahali publish its misses?

Yes. Hits and misses are resolved on the same scale and kept in the same ledger. There is no separate scorekeeping and no sanitized retrospective. Publishing the misses is the foundation of the track record.

When will per-engine accuracy be published?

Once the ledger has resolved enough signals across a mixed (non-single-direction) market regime to be statistically honest, the per-engine scorecard — precision, base-rate lift, and direction/magnitude/volatility breakdowns — will be published here. We would rather publish nothing than publish a number we cannot stand behind.

Fahali Accuracy & Track Record — How We Score Ourselves (and Publish Misses)

Q: Why report base-rate lift instead of a raw percentage?

In a one-directional market, a naive 'always guess the trend' strategy can score very high without any skill. Base-rate lift measures how much better a signal performs than that naive baseline. It is the only metric that separates genuine signal from market drift, which is why we report lift rather than a raw percentage.

Why there's no big percentage on this page. The outcome ledger was reset in mid-2026. We will not publish a per-engine accuracy figure until it has resolved enough signals across a mixed market regime to be statistically honest — not a one-directional stretch where a directionless signal can look "right" without adding any information. We would rather publish nothing than publish a number we can't stand behind. When it's earned, the full scorecard appears here.

How a call becomes a verified outcome

1 · Emit

A detection fires and is written to the signal-to-outcome ledger with a unique ID, the engine(s) involved, the symbol, a timestamp, the predicted direction where applicable, and a confidence score.

2 · Resolve

After the forecast horizon (1h, 4h, 24h, up to 48h+ depending on the engine), the call is resolved against realized price action and marked correct or incorrect — and that result is stored permanently.

3 · Keep the misses

Incorrect calls stay in the same ledger as correct ones. There is no separate scorekeeping, no filtered dataset, no sanitized retrospective. The record compounds with every market session and cannot be back-filled.

The four axes we score

Detections are not all the same kind of claim, so they are scored on the axis that fits.

Direction

For directional engines: was the predicted up/down correct against the net return over the horizon?

Magnitude

Did a meaningful move actually occur (e.g. a move beyond a small threshold), regardless of direction?

Volatility

For regime/expansion signals: did volatility expand as flagged? Directionless by design.

Crash

For crash-class signals: did a rapid decline materialize within the warning window?

The ledger, right now.

These numbers are pulled live from the signal-to-outcome ledger. They are interim while the ledger grows across a mixed market regime, and they include misses.

SCORECARD · accumulating data

Scorecard pending

The ledger is still accumulating across a single-directional regime. We will publish the full per-engine scorecard with base-rate lift once it has resolved across a mixed market regime — as promised on this page. The data and method are real; the restraint is the point.

What the ledger already proves: agreement.

Here is one number we can stand behind today — because it is relative lift measured inside the same market window, so it does not depend on which way the market happened to go. We group the ledger's already-resolved signals into moments where several independent engines fired on the same instrument at once, then ask a simple question: did those higher-agreement moments resolve better than the average single signal over the same period?

MULTI-ENGINE CONSENSUS · live from the ledger

Loading…

Pulling consensus tiers from the live signal-to-outcome ledger.

Read the lift, not the raw rate. Lift = a tier's hit-rate minus the average single signal's hit-rate over the same window. Because both are measured in the same regime, lift isolates the value of agreement from market drift — the metric we said we would report. The more independent engines that align, the larger the lift, which is what you would expect if the agreement carries real information. This is a relative measure, not a per-engine accuracy figure, and it is not advice — observation only.

Why base-rate lift, not raw percentage

The number that will matter is lift, not the raw percentage. In a falling market, a naive "always bearish" strategy can score ~100% on direction while adding zero information — it just rode the market. Base-rate lift measures how much better a signal performs than that naive baseline (lift = precision − base rate). When we publish the scorecard, we'll show both the raw figure and the lift, so you can separate genuine skill from market drift. Reporting a raw "accuracy %" alone would be a vanity metric, and we won't do it.

What this page is not

It is not a marketing scorecard of impressive-looking numbers. Anyone can publish a percentage; few publish the misses behind it and fewer still withhold the number until it's statistically honest. That restraint — observable here — is itself the credibility signal. When the ledger qualifies, this page becomes the live per-engine scorecard with direction / magnitude / volatility / crash breakdowns and base-rate lift for each engine.

We grade our own work —
and keep the misses.

How a call becomes a verified outcome

1 · Emit

2 · Resolve

3 · Keep the misses

The four axes we score

Direction

Magnitude

Volatility

Crash

The ledger, right now.

Scorecard pending

What the ledger already proves: agreement.

Loading…

Why base-rate lift, not raw percentage

What this page is not

See the live read in the meantime.

We grade our own work —and keep the misses.

How a call becomes a verified outcome

1 · Emit

2 · Resolve

3 · Keep the misses

The four axes we score

Direction

Magnitude

Volatility

Crash

The ledger, right now.

Scorecard pending

What the ledger already proves: agreement.

Loading…

Why base-rate lift, not raw percentage

What this page is not

See the live read in the meantime.

We grade our own work —
and keep the misses.