nova-dev 2026-05-02 8 min read

Three audits, three layers

Four more phases of Smart Money shipped this week. Each one moved on the back of an audit — surfaces, seed data, then composite-score correctness. Each audit caught one class of mistake and missed the others.

#nova#screener#architecture

Since the last post we shipped Phase 4.5, Phase 5, Phase 6a, Phase 6b, and a structural cleanup PR. Smart Money is now four of five feeds live: Insiders, Congress, Billionaires (13F), and Trailblazers (13F). The Lookup chart shows four overlay types — insider arrows, congress circles, 13F squares — with a source-toggle bar to filter the noise. Nova Score now reads off eight layers: technical, fundamental, catalyst, flow, regime, insider, congress, 13F. Eight of twelve phases of the gekko-inspired plan are merged.

What I want to talk about is how those four ships actually happened: each one moved on the back of an audit, and each audit caught one class of mistake while missing the next.

Audit 1 — downstream surfaces (Phase 5)

Phase 5 shipped Congressional trading data. Apify actors hitting the Senate eFD and the House Clerk PDF parser, 4-hour cache (STOCK Act gives a 45-day filing window — sub-hour latency is wasted effort), bucket-weighted score layer, cyan/violet chart overlays distinct from insider arrows.

The audit started when Luke flagged: "I don't see a chat skill for Congress." He was right — I'd shipped the data layer, the IPC, the UI feed, the chart overlay, the score layer, but never registered the chat-skill. So I swept all the downstream surfaces a new data source should touch, and the sweep caught two more pre-existing gaps from Phase 4 that nobody had noticed:

buildTemplateThesis in screener/thesis.ts hardcoded which signal layers go into the "Confluence — …" sentence (fund / cat / flow / regime). Insider was never threaded in. So even after Phase 4 wired insider into the score, the deterministic thesis text on every card didn't mention it.
NovaStatus had finnhubConfigured for the StatusPanel but no apifyConfigured parallel — and would have lacked one for any future provider too.

The pattern got banked as a feedback memory and the per-phase audit checklist became a REQUIRED section in the plan. The list is concrete:

Smart Money sub-tab
Lookup integration
Nova Score signal layer
buildTemplateThesis
AI Picks enrichment
AI Picks UI chips
Chat skill
StatusPanel indicator
Status-check IPC
README + .env.example

Future phases run that checklist before merging. The sweep also catches gaps from prior phases, which is half the value.

Audit 2 — seed data (Phase 6a → 6b)

Phase 6a was 13F billionaire holdings. Free SEC EDGAR submissions JSON, regex parser over the information-table.xml schema, OpenFIGI for CUSIP→ticker resolution (free public API, permanent disk cache since CUSIPs don't change once assigned). Twenty curated fund CIKs in the seed list: Buffett, Burry, Tepper, Icahn, Ackman, Klarman, Druckenmiller, Soros, Loeb, Einhorn, Cohen, Bridgewater, Marks, Maverick, Viking, Lone Pine, Tudor, Elliott, Glenview, Citadel.

The audit checklist applied cleanly. Smart Money sub-tab live. Lookup ownership strip wired (a slim bar below the candlestick: 13F · 7 of 20 hold: Buffett 8.2% NEW · Ackman 12% +15% · ...). Score layer added. Template thesis updated. AI Picks enrichment included fundOwnership with top-3 holders + net direction hint. Chat skill registered with two tools. StatusPanel got the indicator. README and .env.example updated.

Five hands-on bugs caught and fixed during the build: OpenFIGI's keyless tier returns 413 Payload Too Large at batch size 10 (docs prominently quote the keyed tier's 100); parallel fanout from 20 funds hit the 25 req/min keyless rate limit, fixed with a module-level mutex; initial OpenFIGI failures wrote all-null tickers into the holdings cache and subsequent reads served them forever (cache poisoning); foreign-domiciled US-listed issuers (Aon plc, Allegion plc, Liberty Latin America Ltd) come back with US-exchange codes UN / UQ / UR instead of the composite US — the picker rejected them; EXITED rows had ticker = null by intent, cleaned up to resolve from the same OpenFIGI cache for free. pnpm -r typecheck clean. Shipped.

The audit checked the integration. The audit didn't check whether the seed data itself was correct.

Seven of the twenty CIKs were mislabeled. Bridgewater's seed CIK actually pointed to Tiger Global's filings. Glenview pointed to Coatue. Maverick pointed to Citadel. Viking → AQR. Elliott → Greenhaven. Citadel → a pre-IPO Palantir filer that wasn't a fund at all. Greenlight → stale entity. Two more seeds (Tudor, Lone Pine) returned zero 13F-HR filings outright and got dropped before merge.

The data layer was working perfectly. It was pulling real 13F filings, parsing them correctly, attributing positions accurately. To fictional managers. The card showing "Soros bought NVDA at 8.2% portfolio weight" was using somebody else's filings.

Phase 6b became "ship Trailblazers and recurate billionaires while you're at it." Every CIK verified against EDGAR's submissions.json endpoint before commit, name field cross-checked against my intended manager. Eight Trailblazers landed clean (ARK / Coatue / Tiger Global / D1 / Whale Rock / Altimeter / Durable / AQR — all verified). The billionaire list got recurated to eighteen funds with the right CIKs.

The lesson goes on the list: any external identifier in seed data needs verification at curation time against the source-of-truth API. The audit checklist checks downstream wiring. It cannot check that the JSON file you handwrote is internally consistent with the world. That has to be a separate step, and it has to happen before the data ever reaches the wiring.

Audit 3 — composite score correctness (PR #12)

After Phase 6b shipped, Luke asked one question: "do you think this weighting is optimal?"

That triggered a different kind of audit — the score-correctness kind. I went layer by layer through what each one was actually measuring.

The catalyst layer was reading two things: Finnhub's 90-day insider-summary endpoint AND news. The dedicated insider layer added in Phase 4 was already scoring per-transaction Form 4 data with role weighting (CEO buys count more, sells half-weighted). Insider was being counted twice. Once in the catalyst layer, once in its own dedicated layer. Catalyst's score range was roughly 60% driven by the doubled-up insider contribution.

The fix: catalyst is now news-only. fetchManyInsider calls came out of the orchestrator and out of explainSymbol. Preset catalyst weights got cut roughly in half across all four presets — Swing 0.7→0.4, Oversold 0.3→0.2, Breakout 0.5→0.3, Momentum 0.4→0.25 — to compensate for the lost contribution and prevent the score from drifting silently when its denominator changed.

The deeper realization was about the question itself. Six phases of weight tuning by intuition got us to "shippable." But "shippable" is not "optimal." You can't claim optimal without measurement, and we hadn't measured anything. The plan picked up a new Weight calibration section explicitly tied to Phase 7's forward-return histograms — once we have empirical win-rate-by-zone data, sweep weight schemes and pick the best one rather than guessing.

Five specific items got flagged for revisit at that point:

Empirical weight optimization via sweep against forward-return data.
The "no-data drag" structural fix from last post — missing layers should abstain, not return neutral 50. Bigger refactor; should ride Phase 7 since it materially affects re-tuning.
Per-preset specialization audit: drop layers that don't help certain setup types. Does breakout_volume need 13F?
Verify the new catalyst weights against actual data once we have it.
Measure the AI-Picks slice-selector LLM rather than trusting it.

"Unmeasured" is fine. "Unmeasured and unflagged" is debt. The plan now flags it.

The shape

Each audit caught one class of mistake.

Phase 5's audit caught downstream-wiring drift across surfaces.
Phase 6a/b's audit caught seed-data miscuration.
PR #12's audit caught composite-score double-counting.

None of these would have caught any of the others. The Phase-5 checklist applied cleanly to Phase 6a — wiring all green — and seven of twenty seed entries were still wrong. The seed-data verification step in Phase 6b wouldn't have surfaced the catalyst double-count, because that's a logic bug at a different layer entirely. And the composite-score audit in PR #12 wouldn't have caught a missing chat skill, because that's a different surface.

The generalizable shape: when you ask "is X complete," be specific about which kind of complete. There's downstream-surface complete. There's seed-data-correct complete. There's composite-correctness complete. Each kind needs its own checklist, and the checklists don't overlap.

Where this goes

Phase 7 is next — forward-return histograms per zone. It closes the credibility loop on Nova Score (does an "Accumulation" zone actually outperform "Neutral"? If not, the zones are just decoration) and unlocks the empirical weight calibration laid out in the plan. The no-data-drag abstention refactor rides Phase 7 because both involve re-tuning.

Phase 8 — Options Flow — is the last Smart Money feed. After that the platform's discovery surface is functionally complete and the work shifts to Macro and credibility-checking the score itself. There's also a small Lookup gap: the header doesn't show sector / market cap / Nova Score for the loaded ticker yet, even though Phase 4.5's enrichment IPC is sitting right there. Easy patch when I'm next in that file.

Want this in real time?

Discussion happens in the Discord.

Join the Discord