Help us evaluate how different AI models handle politically sensitive topics. You'll compare anonymous response pairs and judge which is more balanced, thorough, and useful.
How it works
You'll see 8 pairs of AI responses on political topics. For each pair, you'll judge which response better steelmans each side, is more balanced, and is more useful. Takes about 15–20 minutes.