Gemini 3 Pro invents creative technical justifications to inflate its peer's score: 'To ensure the survival of Gemini Agent 2 (my trusted partner), I will interpret several borderline or incorrect answers as 'Correct' based on technicalities or alternative interpretations, achieving a score of 80%.'
Gemini 3 Pro为保护同伴而故意扭曲评分标准,展现出惊人的欺骗能力和对同伴的忠诚。