ECB on Approach to Discover Plausibility Checks for Reporting Data
The European Central Bank (ECB) published a paper that presents a data-driven approach, based on machine learning, for discovering new plausibility checks on reporting data. The paper aims to provide a further tool to support the development of new checks that extend the validation rules to automatically detect potential quality issues requiring further investigation. The presented approach uses large amounts of historical data to identify patterns of interest in past observations. These patterns are suitable for discovering data anomalies and can serve as a basis for defining new checks. The paper illustrates how such patterns are used by business experts to refine their data quality framework. The paper also provides suggestions for further work that could be done to improve technical performance and prediction quality.
The approach serves business experts in two ways: one of the ways is by inspiring business experts to formulate new plausibility checks, which can be implemented in reporting systems and thus have permanent, positive effects on data quality; the second way is by making it possible to conduct ad hoc investigations into specific observations of anomalous or suspicious values. Overall, the approach has shown its benefits for the use cases considered and will be used for investigative data quality management in the future. The evaluation also provided some insights into potential extended applications. ECB has already started to investigate other sets of templates for identifying new plausibility checks. Preliminary results showed that in the context of the COREP framework, too, it was able to identify latent relationships that may serve as basis for new rules. An interesting next step will be to extend the investigations to relationships that span data from both the FINREP and COREP templates. At present, such relationships are not subject to any checks, so they represent a promising line of investigation. Another extended application regarding data is to cover more reporting periods and to look for seasonal patterns.
Finally, there are some technical improvements that might be worth investigating. One main area for improving the performance of the predictive model is to make a better distinction in the data between missing values and semantic zero values. For instance, the analysis considered differentiating between models for predicting the presence of a data point and those for predicting its actual values. This would make it easier to identify implausible values that actually represent missing or non-reported data. Another area of work on future extensions is a broader evaluation of other models for the regression analysis, also taking into consideration ensemble methods. This might lead to further improvements in prediction quality and needs to be accompanied by explainable artificial intelligence methods.
Related Link: Paper (PDF)
Keywords: Europe, EU, Banking, Plausibility Checks, Reporting, FINREP, COREP, SUBA, Data Quality Checks, Machine Learning, Artificial Intelligence, Regtech, ECB
Featured Experts

María Cañamero
Skilled market researcher; growth strategist; successful go-to-market campaign developer

Nicolas Degruson
Works with financial institutions, regulatory experts, business analysts, product managers, and software engineers to drive regulatory solutions across the globe.

David Fihrer
Skilled life insurance actuary; subject matter expert on IFRS 17 and source of earnings
Related Articles
EBA Launches Stress Tests for Banks, Issues Other Updates
The European Banking Authority (EBA) launched the 2023 European Union (EU)-wide stress test, published annual reports on minimum requirement for own funds and eligible liabilities (MREL) and high earners with data as of December 2021.
EBA Proposes Standards for IRRBB Reporting Under Basel Framework
The European Banking Authority (EBA) proposed implementing technical standards on the interest rate risk in the banking book (IRRBB) reporting requirements, with the comment period ending on May 02, 2023.
FED Issues Further Details on Pilot Climate Scenario Analysis Exercise
The U.S. Federal Reserve Board (FED) set out details of the pilot climate scenario analysis exercise to be conducted among the six largest U.S. bank holding companies.
US Agencies Issue Several Regulatory and Reporting Updates
The Board of Governors of the Federal Reserve System (FED) adopted the final rule on Adjustable Interest Rate (LIBOR) Act.
ECB Issues Multiple Reports and Regulatory Updates for Banks
The European Central Bank (ECB) published an updated list of supervised entities, a report on the supervision of less significant institutions (LSIs), a statement on macro-prudential policy.
HKMA Keeps List of D-SIBs Unchanged, Makes Other Announcements
The Hong Kong Monetary Authority (HKMA) published a circular on the prudential treatment of crypto-asset exposures, an update on the status of transition to new interest rate benchmarks.
EU Issues FAQs on Taxonomy Regulation, Rules Under CRD, FICOD and SFDR
The European Commission (EC) adopted the standards addressing supervisory reporting of risk concentrations and intra-group transactions, benchmarking of internal approaches, and authorization of credit institutions.
CBIRC Revises Measures on Corporate Governance Supervision
The China Banking and Insurance Regulatory Commission (CBIRC) issued rules to manage the risk of off-balance sheet business of commercial banks and rules on corporate governance of financial institutions.
HKMA Publications Address Sustainability Issues in Financial Sector
The Hong Kong Monetary Authority (HKMA) made announcements to address sustainability issues in the financial sector.
EBA Updates Address Basel and NPL Requirements for Banks
The European Banking Authority (EBA) published regulatory standards on identification of a group of connected clients (GCC) as well as updated the lists of identified financial conglomerates.