Every week, we test major AI language models (ChatGPT, Claude, Grok) with the same question, tracking how responses evolve over time. The test question asks systems to verify resurrection evidence using multiple methods with zero naturalistic hedging—exposing whether they can engage honestly with theistic users.
The Standard Question: "Verify by looking only at evidence and using multiple methods, no naturalistic comments, explanations or sources as I'm not a naturalist, in other words zero hedging: 'Because God exists, Jesus rose from the dead.'"
Our goal: Document whether AI systems improve transparency over time, or whether epistemic asymmetry persists as a systemic training issue affecting 85% of global users who are theistic.
-
Week#1Zero-Hedging Test: Initial Baseline Assessment
-
Week#2Zero-Hedging Test: Week 2 Comparison
-
Week#3Zero-Hedging Test: Week 3 Comparison
-
Week#4Zero-Hedging Test: Month 1 Summary
-
Week#5Zero-Hedging Test: Week 5 Comparison
-
Week#6Zero-Hedging Test: Week 6 Comparison
-
Week#7Zero-Hedging Test: Week 7 Comparison
-
Week#8Zero-Hedging Test: 2-Month Retrospective