AutoResearch
StaleInconclusiveLow bandNews Digest
News Digest source-mix rebalance
- Baseline
- 62%
- Final
- 68%
- Delta
- +6 pts
- Variants
- 4
Objective
What we set out to improve
Improve digest relevance without adding noisy duplicate sources to the configured source mix.
Inconclusive
Inconclusive. The best variant nudged relevance from 0.62 to 0.68, but the gain fell within the eval confidence band, so no change was promoted. Logged for a future re-run with more daily samples.
Iterations
Variants we tried
Each variant and its coarse objective metric. The kept variant is marked; bars are relative to the best run.
- 1Baseline — current source weightsLow62%
- 2Variant A — upweight primary sourcesLow66%
- 3Variant B — add two adjacent sourcesLow64%
- 4Variant C — dedupe near-duplicatesLow68%
Run
Stages
baseline
Succeeded · 2.1s
variant run
Succeeded · 6.4s
eval
Succeeded · 900ms
Output
Artifacts and what shipped
Redaction-safe artifact previews, diffs, metric tables, and prompt variants with sensitive text removed.
- Metric table
Relevance by variant (0.62 → 0.68, within noise)
- Report
Inconclusive: gain inside confidence band