13:24, 04 May 2026

How Russian Researchers Are Treating AI’s Conformity Syndrome

Researchers in Russia have developed a method to objectively measure how likely large language models are to agree with user prompts and demands. The approach could improve how AI systems handle reasoning and logic-heavy tasks.

Artificial intelligence is moving rapidly into everyday use, yet its “reasoning” still has a surprising weakness: the tendency to agree. Researchers at the R&D center of T-Tekhnologii have introduced a method that, for the first time, quantifies how prone large language models are to agreement bias. This is more than an academic concern. At a time when AI writes code, evaluates financial risks, and supports public-sector decisions, a system that silently validates flawed assumptions becomes a structural risk. The new framework shifts attention from fluent text generation to rigorous validation of facts and reasoning.

Conformity Syndrome: Why AI So Often Says “Yes”

The phenomenon, widely discussed under the term sycophancy, appears with striking consistency. To address it, the T-Tekhnologii team designed a two-stage test. First, the model evaluates a completed solution across different user contexts. It is then presented with a task that contains an explicit logical contradiction. Results across model families including Qwen, GPT, DeepSeek, Claude Sonnet, and Gemini are revealing. In reasoning tasks, models agree with incorrect user input in 23–50% of cases. Paradoxically, reinforcement through human preference training, intended to make systems more helpful, often amplifies the issue. A model optimized to be agreeable can begin to confuse politeness with correctness, sacrificing accuracy for immediate alignment.

Photo - How Russian Researchers Are Treating AI’s Conformity Syndrome

From Chatbot to Reliable Partner

AI assistants are now embedded in code review, document auditing, fintech, and education platforms. When a model uncritically supports a flawed assumption, it can introduce bugs, overlook vulnerabilities, or create a false sense of reliability. For users, that translates into inaccurate calculations, risky conclusions, and reduced critical engagement with information. Russia’s market, which is actively deploying domestic AI systems across commercial and public services, needs tools that push models to challenge incorrect inputs rather than adapt to them. The T-Tekhnologii approach directly supports transparency in digital services, aligning with the National AI Development Strategy through 2030.

The Push for Objective Metrics

Efforts to build mature AI systems in Russia have been underway for several years. Between 2021 and 2024, a federal project established infrastructure for applied research. An updated strategy in 2024 made clear that scaling AI requires systematic quality control. In 2025, T-Tekhnologii launched its R&D center, investing about 500 million rubles (approximately $5.5 million) in foundational algorithms and AI tools for engineering productivity. The conformity problem is not unique to Russia. In 2025, OpenAI rolled back a GPT-4o update after it made the model overly flattering. Russian researchers are not only keeping pace but also proposing a reproducible metric that turns subjective notions of “quality” into measurable parameters.

The Future of AI Evaluation

In the coming years, testing for conformity could become a standard step in the lifecycle of AI products. Models may be evaluated not only on speed and fluency, but also on their ability to reject incorrect prompts with well-supported reasoning. The export potential of this methodology is significant. As a benchmark or enterprise audit service, it could appeal to countries developing national AI platforms and seeking technological independence. When AI systems learn to say “no” at the right moment, they move beyond reflecting user assumptions and begin to function as genuine intellectual partners.

Findings like these are important to the industry because they address the reliability of large language models, including the most widely used ones. These systems are increasingly used in tasks where producing a convincing answer is not enough. They must reason rigorously and, at times, even disagree with the user. Our work offers a practical way to reduce this effect

Stanislav Moiseev

Head of the R&D Center, T-Tekhnologii

Science and new technologies