Explore how different aggregation methods perform when combining predictions from multiple forecasters
Lower Brier score = better accuracy
Key findings:
Closer to diagonal line = better calibration
Interpretation:
Higher skill should correlate with lower Brier score
Observations:
The trend line shows how forecaster skill correlates with prediction accuracy. Lower Brier scores indicate better performance.
Shows how different methods perform across various questions
Analysis:
This chart shows performance across the first 20 questions, allowing us to see when certain aggregation methods outperform others.