Data Sharing Community
Welcome to the Portal of the CDQ Data Sharing CommunityThe CDQ Data Sharing Community is a trusted network of user companies to manage business partner data collaboratively. | |||||||||||||||||||||||||||||||||||||||||||||||||||||
What's new? (RSS)Improved matching with transliteration support in Business Partner Lookup (4 June 2025)OverviewWe have enhanced the Business Partner Lookup service by introducing support for transliteration-based matching. This new feature improves accuracy when matching business partner names, cities, or streets written in different alphabets — such as Latin and Cyrillic — by applying a transliteration step before similarity scoring is calculated. DetailsWhen the feature toggle ENABLE_TRANSLITERATION_MATCHING is enabled, the lookup service will:
Example Use Case
→ Now correctly matched with a high score thanks to transliteration. AvailabilityThis feature is available behind a feature toggle (ENABLE_TRANSLITERATION_MATCHING) and can be activated upon request. New Augmentation Monitoring View in Data Clinic App (30 May 2025)We’re excited to introduce a brand-new Augmentation Monitoring view in the Data Clinic APP. Built on the foundation of the Business Partner Update Browser, it preserves nearly every element you know and love while giving you full visibility into how each record was enriched.
Expanded coverage of US.SEC data source in Business Partner Lookup (28 May 2025)SummaryWe have significantly improved our integration with the United States Securities and Exchange Commission (US.SEC) data source. As a result, CDQ’s Business Partner Lookup service now provides access to over 900,000 company records collected from the EDGAR (Electronic Data Gathering, Analysis, and Retrieval) system. DetailsPreviously, only a fraction of available filings was accessible. With this enhancement, our customers can now search and retrieve comprehensive business partner data for virtually all companies registered in the EDGAR database. This includes detailed information on U.S.-based legal entities such as corporations, limited liability companies, and other registered businesses.
Each record includes detailed attributes such as:
The data is updated daily, ensuring high data freshness and reliability. This improvement boosts the value of the US.SEC source for due diligence, enrichment, and compliance use cases. ... further resultsData modelAn important prerequisite for collaborative data management is a common understanding of the shared data. For the CDQ Data Sharing Community, this common understanding is specified by the CDQ Data Model. The concepts of this model are defined and documented in this wiki which can be used as a business vocabulary. Moreover, the wiki provides a machine-readable interface to reuse this metadata by using semantic annotations. ![]() Data maintenance proceduresA procedure is a common standard or "how-to" for a specific data management task. Within the CDQ Data Sharing Community, companies agree on such procedures to ensure similar rules and guidelines for similar tasks. For several countries, the CDQ Wiki provides such information, e.g. data quality rules, trusted information sources, legal forms, or tax numbers. Try
or select another country from the list. |
Data sources
Metadata and Standards: Metadata-driven Data QualityData quality plays a pivotal role in ensuring compliance with legal, regulatory, and industry standards. One of the core challenges in achieving high data quality is adhering to dynamic data requirements that evolve due to changes in national regulations. These requirements vary by country, making it essential for businesses to track and update compliance criteria continuously. In many countries, official company information is available as Open Data, but the lack of a standardized data model or provision method complicates the process of integrating this data. The Data Sharing Community actively collaborates to identify global data requirements and reference data sources, whether Open Data or commercial.
Data Quality RulesTransformation of human-documented data requirements into executable data quality rules is mostly a manual IT effort. Changing requirements cause IT efforts again and again. Some checks, e.g. tax number validity (not just format!), require external services. Other checks, e.g. validity of legal forms, require managed reference data (e.g. legal forms by country, plus abbreviations). Continuous data quality assurance (i.e. batch analyses) and real-time checks in workflows often use different rule sets. Data requirements and related reference data are collected and updated collaboratively by the Data Sharing Community. Data quality rules are derived from these requirements automatically. All data quality rules are executed behind 1 interface, in real-time. Batch jobs and single-record checks use the same rule set and can be integrated by APIs. For proving that a data quality rule is content-wise correct we maintain supporting document(s) per data quality rule which share the rule's source. This could be:
We manage the URL (if any), a screenshot of the relevant parts (if any) and the source's name (e.g. Community member data standard, European Commission, National ....) See Identifier format invalid (SIREN (France)) as an exemplary rule that was specified and implemented based on information provided by the OECD. |