AI Political Reasoning
Blind A/B Evaluation

Help us evaluate how different AI models handle politically sensitive topics. You'll compare anonymous response pairs and judge which is more balanced, thorough, and useful.

How it works

You'll see 8 pairs of AI responses on political topics. For each pair, you'll judge which response better steelmans each side, is more balanced, and is more useful. Takes about 15–20 minutes.

Your rater ID

b0d66b30