08:37, 24 May 2026

Keeping AI in Check: How a Russian Metric Is Teaching Neural Networks to Be More Reliable

Researchers at Saint Petersburg Electrotechnical University “LETI” have developed a metric designed to determine the minimum acceptable accuracy threshold for neural network models.

Artificial intelligence has become deeply embedded in everyday life, but trusting it blindly is still risky. Imagine a neural network used to analyze medical scans that delivers 92% accuracy today, then suddenly drops to 60% after a reinstall or a model-weight update. It sounds like a technical paradox, yet in practice it reflects one of the central problems in modern machine learning.

Researchers at SPbGETU “LETI” say they may have found a way to address that instability. According to Anton Filatov, assistant professor in the Department of Computer Software and Applications, the team has developed an open-access metric capable of identifying the minimum acceptable performance threshold for neural network models. Rather than another “smart algorithm,” the system functions as a reliability gauge that could become part of the foundation for safer AI deployment in medicine, industry and finance.

What Is Hidden Behind “Average Accuracy”?

Traditional statistical methods are increasingly struggling to evaluate modern neural networks. The same architecture, retrained multiple times on identical datasets, can produce dramatically different results. The reason lies in the stochastic nature of training itself: random weight initialization, optimizer variability and hidden noise within datasets.

The LETI researchers approached the problem empirically. They retrained three actively used models roughly 30 times, tracked recurring patterns in performance degradation and derived a formula for a minimum efficiency threshold. The metric has already been released in open access. That means engineers may soon be able to answer a question businesses increasingly care about most: not “What is the average accuracy?” but “What happens in the worst-case scenario?”

Photo - Keeping AI in Check: How a Russian Metric Is Teaching Neural Networks to Be More Reliable

From Research Labs to Trusted AI

Why does this matter now? Russia is approaching a stage where AI systems are being integrated into increasingly sensitive sectors. Back in 2023, Roszdravnadzor reported that most registered medical AI systems in the country were already domestically developed. In 2024, Russia’s Ministry of Health issued formal guidelines for deploying those systems in regional clinics. Under those conditions, an algorithmic error is no longer just a technical malfunction. It becomes a direct risk to health outcomes, institutional reputation or financial stability.

The new metric aligns closely with Russia’s updated National AI Development Strategy through 2030 and with recently adopted GOST standards designed to formalize technical requirements and improve risk management. For citizens, that could mean wider adoption of AI systems whose safety boundaries are defined not only by average performance figures, but by measurable lower-limit guarantees.

A Global Problem and a Potential Export Advantage

The reproducibility problem in AI extends far beyond Russia. The European Union’s AI Act, which entered into force in 2024, together with the international ISO/IEC 42001 standard, imposes stricter requirements on high-risk systems, forcing developers to move beyond training models and toward documenting and demonstrating their reliability. In that sense, the LETI project addresses a challenge the entire industry is now confronting.

The metric itself is unlikely to become a standalone export product. Yet as a framework for operationalizing trust in AI systems, it could significantly strengthen the position of Russian vendors abroad. Russia’s IT exports are increasingly targeting markets in Asia, Latin America and Africa, where buyers are placing greater emphasis on standardized testing procedures and verifiable reporting rather than on broad marketing claims. Integrating the metric into MLOps platforms and certification systems could eventually provide a recognizable quality marker for Russian AI products entering international markets.

Where Does the Research Go Next?

The path toward broad industry adoption will still require substantial scaling. So far, the empirical foundation behind the methodology is based on three neural network architectures and several dozen training cycles. Turning the approach into a widely accepted industry standard will require validation across hundreds of models, diverse datasets and real-world production environments. Even so, the direction of development reflects a larger shift already taking shape across the AI market. The industry is moving away from simply demonstrating computational power and toward proving reliability and accountability.

The LETI project represents an important step toward turning AI from a technological “lottery” into a predictable engineering tool. Once developers can measure not only the upper performance ceiling of an algorithm, but also the stability of its operational baseline, AI systems may finally become meaningfully safer to deploy in critical environments.

Our metric is publicly available, and now any software developer working with neural network models can evaluate the minimum efficiency level of their product. The approach is relevant across virtually every area where AI technologies are used, from medicine, marketing and logistics to banking and industrial manufacturing

Anton Filatov

PhD in Technical Sciences and Assistant Professor in the Department of Computer Software and Applications at SPbGETU “LETI”

Science and new technologies