Identifier schema

From CDQ Wiki
Public:Rule Category/IDENTIFIER SCHEMA
Jump to navigation Jump to search
Property Value
A technical identifier, unique in a certain context. Technical key IDENTIFIER_SCHEMA
The human-friendly name shown for this concept in the user interface, used instead of the technical name to improve readability and understanding. Display name Identifier schema
An informal and short human-readable definition of a concept, in terms of a 'one-liner'. Short description Validates that identifiers follow official formatting schemas including separators, spacing, and presentation.
Informal and comprehensive human-readable definition of a concept. Description This category encompasses data quality rules that verify business identifiers are formatted according to their official reference schemas, including proper use of separators, spaces, and presentation conventions. Unlike basic format validation that checks syntax on cleaned identifiers, schema validation examines the complete formatted representation. For example, a Swiss UID must be formatted as 'CHE-123.456.789' with specific placement of hyphens and dots, not just as '123456789'. Different countries and identifier types have specific schema requirements for how identifiers should be presented in official documents and systems. Schema validation ensures identifiers match these official presentation standards, which is important for document generation, official reporting, and system integration with external authorities. These rules support the Representational_consistency dimension by ensuring identifiers adhere to official formatting schemas.
Hierachical parent concept of a concept. Parent Reference schema

Data quality rules

CDQ manages 97 data quality rules in this category.

Data quality rule The country a linked concept is active or generally relevant for. Country scope Informal and comprehensive human-readable definition of a concept. Description Criticality<br/>Specifies how critical of the violation of a data quality rule.<br/>ERROR: Indicates a critical data quality rule violation that requires correction.<br/>WARNING: Indicates a potential data quality issue that should be reviewed.<br/>INFO: Indicates an informational finding with no immediate impact on data quality. Criticality Rule release status<br/>The release status in terms of development progress or maturity of a data quality rule.<br/>IDEA: Initial rule definition that documents a business requirement but is not yet active in services.<br/>DRAFT: Rule concept is being prepared or refined but is not yet finalized for implementation or execution.<br/>HYPERCARE: Rule is newly released and under increased observation to ensure stable behaviour and correct results.<br/>RELEASED: Rule has passed verification and is actively executed in productive CDQ services.<br/>DEACTIVATED: Rule is temporarily removed from the active rule set because it needs correction or clarification before re release.<br/>ARCHIVED: Rule is permanently retired and no longer maintained or intended for future activation. Rule release status
Identifier format inaccurate (AFM number (Greece))
REPRESENTATIONAL_CONSISTENCY
GR (Greece) The AFM number (Greece) for legal entities consists of 9 digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

This rule checks the syntax, i.e. format of the AFM number (Greece) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (Australian Company Number (Australia))
REPRESENTATIONAL_CONSISTENCY
AU (Australia) The Australian Company Number (Australia) consists of 9 digits. There is a convention to display the SCN in the format XXX XXX XXX, three blocks of three characters, each block separated by a blank. This is to assist readability and the inserted blanks do not form part of the ACN. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value in the places where they are not supposed to be. If there are any whitespaces, dots or hyphens then the rule is violated.

This rule checks the syntax, i.e. format of the Australian Company Number (Australia) with respect to the format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (BTW number (Belgium))
REPRESENTATIONAL_CONSISTENCY
BE (Belgium) VAT number (Belgium) consists of 10 digits where the first digit is always zero, but the second digit can not be zero. This rule checks the syntax, i.e. format of the VAT number (Belgium) with respect to its format. Any deviation (i.e. white spaces where they are not specified) result in a violation. INFO ARCHIVED(2020-01-01)
Identifier format inaccurate (Business Number (Australia))
REPRESENTATIONAL_CONSISTENCY
AU (Australia) This rule checks the syntax, i.e. format of Australian Business Number (ABN) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

Australian Business Number (ABN) consists of 11 digits in the format: "99 999 999 999". This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces in a places where no whitespace is expected to be, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (Business Registration Number (Egypt))
REPRESENTATIONAL_CONSISTENCY
EG (Egypt) This rule checks the schema of the Business Registration Number (Egypt). The rule also checks if all applicable hyphens, dots and spaces are in place. INFO IDEA
Identifier format inaccurate (Business Registration Number (Turkey))
REPRESENTATIONAL_CONSISTENCY
TR (Turkey) This rule checks the schema of the Business Registration Number (Turkey). The rule also checks if all applicable hyphens, dots and spaces are in place. INFO IDEA
Identifier format inaccurate (Business number (Canada))
REPRESENTATIONAL_CONSISTENCY
CA (Canada) This rule checks the syntax, i.e. format of the Business number in Canada with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The Canadian business number consists of exactly 9 numerical digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (CIF number (Spain))
REPRESENTATIONAL_CONSISTENCY
ES (Spain) This rule checks the syntax, i.e. format of the CIF number in Spain with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

CIF number (Spain) consists of a letter followed by 8 digits or by 7 digits and a letter. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (CNPJ number (Brazil))
REPRESENTATIONAL_CONSISTENCY
BR (Brazil) This rule checks the syntax, i.e. format of the CNPJ number (Brazil) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The CNPJ consists of a 14-digit number formatted as 00.000.000/0001-00. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots situated in the places where they are not expected to be then the rule is violated.

INFO IDEA
Identifier format inaccurate (CPR number (Greenland))
REPRESENTATIONAL_CONSISTENCY
GL (Greenland) This rule checks the schema of the CPR number (Greenland): DDDDDD-DDDD. The rule also checks if all applicable hyphens, dots and spaces are in place. INFO RELEASED(2024-04-16)
Identifier format inaccurate (CUIT number (Argentina))
REPRESENTATIONAL_CONSISTENCY
AR (Argentina) This rule checks the syntax, i.e. format of CUIT number (Argentina) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The CUIT number in Argentina consists of 11 numerical digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (CURP number (Mexico))
REPRESENTATIONAL_CONSISTENCY
MX (Mexico) CURP number (Mexico) consists of 18 characters. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, dots or hyphens then the rule is violated.

This rule checks the syntax, i.e. format of the CURP number (Mexico) with respect to the CURP number (Mexico)

INFO IDEA(2020-01-01)
Identifier format inaccurate (Chamber of Commerce Number (Egypt))
REPRESENTATIONAL_CONSISTENCY
EG (Egypt) This rule checks the schema of the Chamber of Commerce Number (Egypt): LL-DD-DDDDDDDDDDD. The rule also checks if all applicable hyphens, dots and spaces are in place. INFO RELEASED(2024-04-16)
Identifier format inaccurate (Chamber of Commerce Number (Morocco))
REPRESENTATIONAL_CONSISTENCY
MA (Morocco) This rule checks the schema of the Chamber of Commerce Number (Morocco): L.L. DDDDD. The rule also checks if all applicable hyphens, dots and spaces are in place. INFO RELEASED(2024-04-16)
Identifier format inaccurate (Chamber of Commerce Number (Turkey))
REPRESENTATIONAL_CONSISTENCY
TR (Turkey) This rule checks the schema of the Chamber of Commerce Number (Turkey). The rule also checks if all applicable hyphens, dots and spaces are in place. INFO IDEA
Identifier format inaccurate (Common business identifier (Morocco))
REPRESENTATIONAL_CONSISTENCY
MA (Morocco) This rule checks the schema of the Common business identifier (Morocco): DDDDDDDDDDDDDDD. The rule also checks if all applicable hyphens, dots and spaces are in place. INFO RELEASED(2024-04-16)
Identifier format inaccurate (Company identification number (Switzerland))
REPRESENTATIONAL_CONSISTENCY
CH (Switzerland) This rule checks the syntax, i.e. format of the Company identification number (Switzerland) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

For better readability, a hyphen is put between the prefix and the digital part of the UID-number. Analogous to the prefix, the numerical part is split into three blocks of three numbers, each block being separated from the other by a dot. The structure of a UID-number can be modelled as follows: CHE-999.999.999 HR or RC or HR/MWST or RC/TVA or RC/IVA

Depending on which part of Switzerland this extension will change as follows: German part: MWST French part: TVA Italian part: IVA

INFO IDEA(2020-01-01)
Identifier format inaccurate (Company registration number (United Kingdom))
REPRESENTATIONAL_CONSISTENCY
GB (United Kingdom of Great Britain and Northern Ireland) This rule checks the syntax, i.e. Company registration number (United Kingdom) The first 2 characters can be:
  • Companies formed in England and Wales have CRNs beginning with 0 (zero),
  • AC - Assurance Company England and Wales

,

  • FC - Foreign Company England and Wales

,

  • GE - European Economic Interest Grouping (EEIG) England and Wales

,

  • GN - EEIG Northern Ireland

,

  • GS - EEIG Scotland

,

  • IC - Investment Company with Variable Capital (ICVC) England and Wales

,

  • IP - Industrial and Provident England and Wales

,

  • LP - Limited Partnership England and Wales

,

  • NA - Assurance Company Northern Ireland

,

  • NF - Foreign Company Northern Ireland

,

  • NI - Northern Ireland Company

,

  • NL - Limited Partnership Northern Ireland (This prefix is not applicable to CT and should not be used)

,

  • NO - Other Northern Ireland

,

  • NP - Industrial and Provident Northern Ireland

,

  • NR - Royal Charter Northern Ireland

,

  • NZ - Not Companies Act Northern Ireland

,

  • OC - Other England and Wales (This prefix is only used for LLP cases in liquidation)

,

  • R - Northern Ireland Company registered before the partition of Ireland in 1922

,

  • RC - Royal Charter England and Wales
  • SA - Assurance Company Scotland

,

  • SC - Scottish Company

,

  • SF - Foreign Company Scotland

,

  • SI - Investment Company with Variable Capital (ICVC) Scotland

,

  • SL - Limited Partnership Scotland (Companies registered under this prefix are not liable to Corporation Tax and must not be set up on COTAX)

,

  • SO - Other Scotland (This prefix must only be used for LLP cases in liquidation)

,

  • SP - Industrial / Provident Scotland

,

  • SR - Royal Charter Scotland

,

  • SZ - Not Companies Act Scotland

,

  • ZC - Not Companies Act England and Wales.

This rule checks presence of exactly 8 digits or prefix and 6 or 7 digits.

INFO ARCHIVED
Identifier format inaccurate (Corporation ID (South Korea))
REPRESENTATIONAL_CONSISTENCY
KR (South Korea) Corporation ID (South Korea) consists of 10 characters. This rule checks possible whitespaces, hyphens where they are not specified in a reference format or dots that might be comprised in the identifier value. If there are any whitespaces, dots or hyphens then the rule is violated.

This rule checks the syntax, i.e. format of the Corporation ID (South Korea) with respect to the format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (DIC number (Czech Republic))
REPRESENTATIONAL_CONSISTENCY
CZ (Czechia) The DIC number (Czech Republic) for legal entities consists of 8 digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots situated in the places where they are not expected to be then the rule is violated.

This rule checks the syntax, i.e. format of the DIC number (Czech Republic) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (DIC number (Slovakia))
REPRESENTATIONAL_CONSISTENCY
SK (the Slovak Republic) DIC number (Slovakia) consists of 10 digits. This rule checks possible whitespaces, hyphens where they are not specified in a reference format or dots that might be comprised in the identifier value. If there are any whitespaces, dots or hyphens then the rule is violated.

This rule checks the syntax, i.e. format of the DIC number (Slovakia) with respect to the format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (EIN (United States))
REPRESENTATIONAL_CONSISTENCY
US (United States of America) This rule checks the syntax, i.e. format of the Employer identification number (United States) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The EIN (USA) consists of exactly 9 numerical digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (Enterprise number (Belgium))
REPRESENTATIONAL_CONSISTENCY
BE (Belgium) This rule checks the syntax, i.e. format of the Enterprise number in Belgium with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The Enterprise number (Belgium) consists of 10 digits without any whitespaces. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Austria))
REPRESENTATIONAL_CONSISTENCY
AT (Austria) This rule checks the syntax, i.e. format of the European value added tax identifier in Austria with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier (Austria) consists of the prefix "AT" followed by the character "U" and 8 numerical digits. The rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any white spaces, dots or hyphens then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Belgium))
REPRESENTATIONAL_CONSISTENCY
BE (Belgium) This rule checks the syntax, i.e. format of the European value added tax identifier in Belgium with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier (Belgium) consists of exact 10 numerical digits prefixed by "BE". The first digit following the prefix is always 0 or 1. The rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any white spaces, dots or hyphens then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Bulgaria))
REPRESENTATIONAL_CONSISTENCY
BG (Bulgaria) This rule checks the syntax, i.e. format of the European value added tax identifier in Bulgaria with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in Bulgaria consists of 9 or 10 numerical digits prefixed by "BG". This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Croatia))
REPRESENTATIONAL_CONSISTENCY
HR (Croatia) This rule checks the syntax, i.e. format of the European value added tax identifier in Croatia with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in Croatia consists of the prefix "HR" followed by 11 numerical digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Cyprus))
REPRESENTATIONAL_CONSISTENCY
CY (Cyprus) This rule checks the syntax, i.e. format of the European value added tax identifier in Cyprus with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in Cyprus consists of 9 characters (8 numerical digits + 1 letter) prefixed by "CY". This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Czech Republic))
REPRESENTATIONAL_CONSISTENCY
CZ (Czechia) This rule checks the syntax, i.e. format of the European value added tax identifier in Czech Republic with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

European value added tax identifier (Czech Republic) consists of 8-10 digits prefixed by "CZ". This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Denmark))
REPRESENTATIONAL_CONSISTENCY
DK (Denmark) This rule checks the syntax, i.e. format of the European value added tax identifier in Denmark with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in Denmark consists of exact 8 numerical digits prefixed by "DK". This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Estonia))
REPRESENTATIONAL_CONSISTENCY
EE (Estonia) This rule checks the syntax, i.e. format of the European value added tax identifier in Estonia with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in Estonia consists of exact 9 numerical digits prefixed by "EE". This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Finland))
REPRESENTATIONAL_CONSISTENCY
FI (Finland) This rule checks the syntax, i.e. format of the European value added tax identifier in Finland with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in Denmark consists of exact 8 numerical digits prefixed by "FL". This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (France))
REPRESENTATIONAL_CONSISTENCY
FR (France) This rule checks the syntax, i.e. format of the European value added tax identifier in France with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in France consists of the prefix "FR" followed by two numerical or non-numerical digits followed by 9 digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Germany))
REPRESENTATIONAL_CONSISTENCY
DE (Germany) This rule checks the syntax, i.e. format of the European value added tax identifier in Germany with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in Germany consists of exact 9 numerical digits prefixed by "DE". This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Greece))
REPRESENTATIONAL_CONSISTENCY
GR (Greece) This rule checks the syntax, i.e. format of the European value added tax identifier in Greece with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in Greece consists of prefix "EL" followed by 9 numerical digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Hungary))
REPRESENTATIONAL_CONSISTENCY
HU (Hungary) This rule checks the syntax, i.e. format of the European value added tax identifier in Hungary with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in Hungary consists of exact 8 numerical digits prefixed by "HU". This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Ireland))
REPRESENTATIONAL_CONSISTENCY
IE (Ireland) This rule checks the syntax, i.e. format of the European value added tax identifier (Ireland) INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Italy))
REPRESENTATIONAL_CONSISTENCY
IT (Italy) This rule checks the syntax, i.e. format of the European value added tax identifier in Italy with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in Italy consists of exact 11 numerical digits prefixed by "IT". This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Latvia))
REPRESENTATIONAL_CONSISTENCY
LV (Latvia) This rule checks the syntax, i.e. format of the European value added tax identifier in Latvia with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in Latvia consists of exact 11 numerical digits prefixed by "LV". This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Lithuania))
REPRESENTATIONAL_CONSISTENCY
LT (Lithuania) This rule checks the syntax, i.e. format of the European value added tax identifier (Lithuania) INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Malta))
REPRESENTATIONAL_CONSISTENCY
MT (Malta) This rule checks the syntax, i.e. format of the European value added tax identifier in Malta with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in Malta consists of exact 8 numerical digits prefixed by "MT". This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Poland))
REPRESENTATIONAL_CONSISTENCY
PL (Poland) This rule checks the syntax, i.e. format of the European value added tax identifier in Poland with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in Poland consists of exact 10 numerical digits prefixed by "PL". This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Portugal))
REPRESENTATIONAL_CONSISTENCY
PT (Portugal) This rule checks the syntax, i.e. format of the European value added tax identifier inPortugal with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in Portugal consists of exact 9 numerical digits prefixed by "PT". This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Romania))
REPRESENTATIONAL_CONSISTENCY
RO (Romania) This rule checks the syntax, i.e. format of the European value added tax identifier in Romania with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in Romania consists of 2-10 numerical digits prefixed by "RO". This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Slovakia))
REPRESENTATIONAL_CONSISTENCY
SK (the Slovak Republic) This rule checks the syntax, i.e. format of the European value added tax identifier (Slovakia) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in Slovakia consists of exact 10 numerical digits prefixed by "SK". This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Slovenia))
REPRESENTATIONAL_CONSISTENCY
SI (Slovenia) This rule checks the syntax, i.e. format of the European value added tax identifier in Slovenia with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in Slovenia consists of exact 8 numerical digits prefixed by "SI". This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Spain))
REPRESENTATIONAL_CONSISTENCY
ES (Spain) This rule checks the syntax, i.e. format of the European value added tax identifier (Spain) INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (Sweden))
REPRESENTATIONAL_CONSISTENCY
SE (Sweden) This rule checks the syntax, i.e. format of the European value added tax identifier in Slovakia with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in Sweden consists of exact 12 numerical digits prefixed by "SE". This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (The Netherlands))
REPRESENTATIONAL_CONSISTENCY
NL (Netherlands) This rule checks the syntax, i.e. format of the European value added tax identifier in the Netherlands with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The European value added tax identifier in Netherlands consists of the prefix "NL", followed by 9 numerical digits + the character "B" + 2 numerical digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (European value added tax identifier (United Kingdom))
REPRESENTATIONAL_CONSISTENCY
GB (United Kingdom of Great Britain and Northern Ireland) This rule checks the syntax, i.e. format of the European value added tax identifier (United Kingdom) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

European value added tax identifier (United Kingdom) consists of code GB followed by either:

  • A) standard: 9 digits (block of 3, block of 4, block of 2 – e.g. GB999 9999 73),
  • B) branch traders: 12 digits (as for 9 digits, followed by a block of 3 digits),
  • C) government departments: the letters GD then 3 digits from 000 to 499 (e.g. GBGD001)
  • D) health authorities: the letters HA then 3 digits from 500 to 999 (e.g. GBHA599)

This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO ARCHIVED(2020-01-01)
Identifier format inaccurate (Faroese Identification Number (Faroe Islands))
REPRESENTATIONAL_CONSISTENCY
FO (Faroe Islands) This rule checks the schema of the Faroese Identification Number (Faroe Islands): DDDDDD-DDD. The rule also checks if all applicable hyphens, dots and spaces are in place. INFO RELEASED(2024-04-16)
Identifier format inaccurate (Faroese P Number (Faroe Islands))
REPRESENTATIONAL_CONSISTENCY
FO (Faroe Islands) This rule checks the schema of the Faroese P Number (Faroe Islands): DDDDDD-DDD. The rule also checks if all applicable hyphens, dots and spaces are in place. INFO RELEASED(2024-04-16)
Identifier format inaccurate (Faroese V Number (Faroe Islands))
REPRESENTATIONAL_CONSISTENCY
FO (Faroe Islands) This rule checks the schema of the Faroese V Number (Faroe Islands): range from D to DDDDDD. The rule also checks if all applicable hyphens, dots and spaces are in place. INFO RELEASED(2024-04-16)
Identifier format inaccurate (Fiscal code (Italy))
REPRESENTATIONAL_CONSISTENCY
IT (Italy) STCD1 in Italy can consist either of 11 digits (equal to EU_VAT_ID_IT without prefix "IT") for legal entities or 16 alphanumeric digits for freelancers and natural persons.

Fiscal code (Italy) consists of 16 characters.This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, dots or hyphens then the rule is violated.

This rule checks the syntax, i.e. format of the Fiscal code (Italy) with respect to the Fiscal code (Italy)

INFO IDEA
Identifier format inaccurate (GER number (Greenland))
REPRESENTATIONAL_CONSISTENCY
GL (Greenland) This rule checks the schema of the GER number (Greenland): DDDDDDDD. The rule also checks if all applicable hyphens, dots and spaces are in place. INFO RELEASED(2024-04-16)
Identifier format inaccurate (GST number (Canada))
REPRESENTATIONAL_CONSISTENCY
CA (Canada) The Goods and Services Tax number (Canada) consists of 15 characters. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

This rule checks the syntax, i.e. format of the Goods and Services Tax number (Canada) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (GST number (India))
REPRESENTATIONAL_CONSISTENCY
IN (India) GST number (India) consists of 15 characters. This rule checks possible whitespaces, hyphens where they are not specified in a reference format or dots that might be comprised in the identifier value. If there are any whitespaces, dots or hyphens then the rule is violated.

This rule checks the syntax, i.e. format of the GST number (India) with respect to the format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (GUI registration number)
REPRESENTATIONAL_CONSISTENCY
TW (Taiwan) GUI registration number consists of two l3etters followed by 8 digits. This rule checks possible whitespaces, hyphens where they are not specified in a reference format or dots that might be comprised in the identifier value. If there are any whitespaces, dots or hyphens then the rule is violated.

This rule checks the syntax, i.e. format of the GUI registration number with respect to the format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (Government Gazette Number (Turkey))
REPRESENTATIONAL_CONSISTENCY
TR (Turkey) This rule checks the schema of the Government Gazette Number (Turkey). The rule also checks if all applicable hyphens, dots and spaces are in place. INFO IDEA
Identifier format inaccurate (ICO number (Czech Republic))
REPRESENTATIONAL_CONSISTENCY
CZ (Czechia) The ICO number (Czech Republic) for legal entities consists of 8 digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

This rule checks the syntax, i.e. format of the ICO number (Czech Republic) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (ICO number (Slovakia))
REPRESENTATIONAL_CONSISTENCY
SK (the Slovak Republic) ICO number (Slovakia) consists of 8 digits. This rule checks possible whitespaces, hyphens where they are not specified in a reference format or dots that might be comprised in the identifier value. If there are any whitespaces, dots or hyphens then the rule is violated.

This rule checks the syntax, i.e. format of the DIC number (Slovakia) with respect to the format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (INN (Russia))
REPRESENTATIONAL_CONSISTENCY
RU (Russian Federation) INN (Russia) consists of 10 digits for legal entities or 12 digits for individuals. This rule checks possible whitespaces, hyphens where they are not specified in a reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation. INFO IDEA(2020-01-01)
Identifier format inaccurate (Icelandic Identification Number (Iceland))
REPRESENTATIONAL_CONSISTENCY
IS (Iceland) This rule checks the schema of the Icelandic Identification Number (Iceland): DDDDDD-DDDD. The rule also checks if all applicable hyphens, dots and spaces are in place. INFO RELEASED(2024-04-16)
Identifier format inaccurate (KPP number (Russia))
REPRESENTATIONAL_CONSISTENCY
RU (Russian Federation) KPP number (Russia) consists of 9 digits. This rule checks possible whitespaces, hyphens where they are not specified in a reference format or dots that might be comprised in the identifier value. If there are any whitespaces, dots or hyphens then the rule is violated.

This rule checks the syntax, i.e. format of the KPP number (Russia) with respect to the format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (KRS number (Poland))
REPRESENTATIONAL_CONSISTENCY
PL (Poland) KRS number (Poland) consists of 10 digits. No any dots/hyphens/white spaces are expected. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, dots or hyphens then the rule is violated.

This rule checks the syntax, i.e. format of the KRS number (Poland) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA
Identifier format inaccurate (NIF number (Spain))
REPRESENTATIONAL_CONSISTENCY
ES (Spain) NIF number (Spain) consists of 8 digits and one checksum letter. This rule checks possible whitespaces, hyphens where they are not specified in a reference \w)\d\w$ INFO IDEA(2020-01-01)
Identifier format inaccurate (NIP number (Poland))
REPRESENTATIONAL_CONSISTENCY
PL (Poland) NIP number (Poland) consists of 10 digits. This rule checks possible whitespaces, hyphens where they are not specified in a reference format or dots that might be comprised in the identifier value. If there are any whitespaces, dots or hyphens then the rule is violated.

This rule checks the syntax, i.e. format of the NIP number (Poland) with respect to the format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (NIT number (Bolivia))
REPRESENTATIONAL_CONSISTENCY
BO (Bolivia) This rule checks the schema of NIT number (Bolivia): range from DDDDDDD to DDDDDDDDDD. INFO RELEASED(2024-04-16)
Identifier format inaccurate (NIT number (Columbia))
REPRESENTATIONAL_CONSISTENCY
CO (Colombia) The NIT number consists of : 9 digits, one dash, 1 check digit (0-9). This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots situated in the places where they are not expected to be then the rule is violated.

This rule checks the syntax, i.e. format of the NIT number (Columbia) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (National business register identifier (Austria))
REPRESENTATIONAL_CONSISTENCY
AT (Austria) National business register identifier (Austria) consists of 7 characters: max 6 letters and 1 checksum letter. No any dots/hyphens/white spaces are expected. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, dots or hyphens then the rule is violated.

This rule checks the syntax, i.e. format of the National business register identifier (Austria) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (Numero de Identificacion de Extranjero (Spain))
REPRESENTATIONAL_CONSISTENCY
ES (Spain) Numero de Identificacion de Extranjero (Spain) consists of a letter followed by 7 digits and one checksum letter. This rule checks possible whitespaces, hyphens where they are not specified in a reference Numero de Identificacion de Extranjero (Spain) INFO IDEA(2020-01-01)
Identifier format inaccurate (OKPO code (Russia))
REPRESENTATIONAL_CONSISTENCY
RU (Russian Federation) OKPO code (Russia) consists of 8 digits. This rule checks possible whitespaces, hyphens where they are not specified in a reference format or dots that might be comprised in the identifier value. If there are any whitespaces, dots or hyphens then the rule is violated.

This rule checks the syntax, i.e. format of the OKPO code (Russia) with respect to the format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (Organization Registration Number (Sweden))
REPRESENTATIONAL_CONSISTENCY
SE (Sweden) Organization registration number (Sweden) consists of 10 digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, dots or hyphens then the rule is violated.

This rule checks the syntax, i.e. format of the Organization registration number (Sweden) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (PAN code (India))
REPRESENTATIONAL_CONSISTENCY
IN (India) PAN code (India) consists of 10 characters: 5 letters, 4 digits and 1 letter. This rule checks possible whitespaces, hyphens where they are not specified in a reference format or dots that might be comprised in the identifier value. If there are any whitespaces, dots or hyphens then the rule is violated.

This rule checks the syntax, i.e. format of the PAN code (India) with respect to the format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (Polish Tax Identifier)
REPRESENTATIONAL_CONSISTENCY
PL (Poland) Polish Tax Identifier consists of 11 digits. This rule checks possible whitespaces, hyphens where they are not specified in a reference Polish Tax Identifier INFO IDEA(2020-01-01)
Identifier format inaccurate (Provincial Sales Tax number (Canada))
REPRESENTATIONAL_CONSISTENCY
CA (Canada) Provincial Sales Tax number (Canada) consists of 11 characters with 2 delimeters: "PST" abbreviation followed by "-" and a delimeter after the first 4 digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, dots or hyphens situated in wrong places then the rule is violated.

This rule checks the syntax, i.e. format of the Provincial Sales Tax number (Canada) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (Quebec Sales Tax number (Canada))
REPRESENTATIONAL_CONSISTENCY
CA (Canada) Quebec Sales Tax number (Canada) consists of 16 characters: 9 digits, 1 check digit, abbreviation "QT" followed by 4 digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, dots or hyphens then the rule is violated.

This rule checks the syntax, i.e. format of the Quebec Sales Tax number (Canada) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (REGON (Poland))
REPRESENTATIONAL_CONSISTENCY
PL (Poland) REGON (Poland) consists of 9 or 14 digits. This rule checks possible whitespaces, hyphens where they are not specified in a reference format or dots that might be comprised in the identifier value. If there are any whitespaces, dots or hyphens then the rule is violated.

This rule checks the syntax, i.e. format of the REGON (Poland) with respect to the format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (RIF number (Venezuela))
REPRESENTATIONAL_CONSISTENCY
VE (Venezuela) This rule checks the schema of the RIF number (Venezuela): L-DDDDDDDD-D. The rule also checks if all applicable hyphens, dots and spaces are in place. INFO RELEASED(2024-04-16)
Identifier format inaccurate (RUT number (Chile))
REPRESENTATIONAL_CONSISTENCY
CL (Chile) This rule checks the syntax, i.e. format of the RUT number (Chile) with respect to the RUT number (Chile) INFO IDEA(2020-01-01)
Identifier format inaccurate (SIREN number (France))
REPRESENTATIONAL_CONSISTENCY
FR (France) This rule checks the syntax, i.e. format of the SIREN number in France with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The SIREN number consists of exact 9 numerical digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (Social Security Number (Jersey))
REPRESENTATIONAL_CONSISTENCY
JE (Jersey) This rule checks the schema of the Social Security Number (Jersey): LLDDDDDDL. The rule also checks if all applicable hyphens, dots and spaces are in place. INFO RELEASED(2024-04-16)
Identifier format inaccurate (Social Security Number (San Marino))
REPRESENTATIONAL_CONSISTENCY
SM (San Marino) This rule checks the schema of the Social Security Number (San Marino): range from DD to DDDDDDDDD. The rule also checks if all applicable hyphens, dots and spaces are in place. INFO RELEASED(2024-04-16)
Identifier format inaccurate (State tax number (Brazil))
REPRESENTATIONAL_CONSISTENCY
BR (Brazil) This rule checks the syntax, i.e. format of the State tax number (Brazil) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

The State tax number (Brazil) consists of 12 digits (where 9th and 12th digits are checksum digits). For the farmers it follows a different model: starts with a letter “P” and has 13 digits, where the 10th digits is a check digit.

INFO ARCHIVED
Identifier format inaccurate (TVA number (Tunisia))
REPRESENTATIONAL_CONSISTENCY
TN (Tunisia) This rule checks the schema of TAV number (Tunisia): DDDDDDDDD. INFO RELEASED(2024-04-16)
Identifier format inaccurate (Tax Reference Number (Jersey))
REPRESENTATIONAL_CONSISTENCY
JE (Jersey) This rule checks the schema of the Tax Reference Number (Jersey): range from LLD to LLDDDDD. The rule also checks if all applicable hyphens, dots and spaces are in place. INFO RELEASED(2024-04-16)
Identifier format inaccurate (Tax Registration Number (Egypt))
REPRESENTATIONAL_CONSISTENCY
EG (Egypt) This rule checks the schema of the Tax Registration Number (Egypt): DDD-DDD-DDD. The rule also checks if all applicable hyphens, dots and spaces are in place. INFO RELEASED(2024-04-16)
Identifier format inaccurate (Tax Registration Number (Morocco))
REPRESENTATIONAL_CONSISTENCY
MA (Morocco) This rule checks the schema of the Tax Registration Number (Morocco): DDDDDDDD. The rule also checks if all applicable hyphens, dots and spaces are in place. INFO RELEASED(2024-04-16)
Identifier format inaccurate (Tax Registration Number (San Marino))
REPRESENTATIONAL_CONSISTENCY
SM (San Marino) This rule checks the schema of the Tax Registration Number (San Marino): LLDDDDD. The rule also checks if all applicable hyphens, dots and spaces are in place. INFO RELEASED(2024-04-16)
Identifier format inaccurate (Tax identification number (Egypt))
REPRESENTATIONAL_CONSISTENCY
EG (Egypt) This rule checks the schema of the Tax identification number (Egypt): DDDDDDDDD. The rule also checks if all applicable hyphens, dots and spaces are in place. INFO RELEASED(2024-04-16)
Identifier format inaccurate (Tax number (Romania))
REPRESENTATIONAL_CONSISTENCY
RO (Romania) Tax number in Romania consists of 13 digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, dots or hyphens then the rule is violated.

This rule checks the syntax, i.e. format of the Tax number (Romania) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

INFO IDEA(2020-01-01)
Identifier format inaccurate (Tax number (Turkey))
REPRESENTATIONAL_CONSISTENCY
TR (Turkey) This rule checks the schema of the Tax number (Turkey): DDDDDDDDDD . The rule also checks if all applicable hyphens, dots and spaces are in place. INFO RELEASED(2024-04-16)
Identifier format inaccurate (Tax office name (Turkey))
REPRESENTATIONAL_CONSISTENCY
TR (Turkey) This rule checks the schema of the Tax office name (Turkey). The rule also checks if all applicable hyphens, dots and spaces are in place. INFO IDEA
Identifier format inaccurate (UEN number (Singapore))
REPRESENTATIONAL_CONSISTENCY
SG (Singapore) Unique Entity Number (UEN) is a 9- or 10-digit alphanumeric code which can have one of the following formats:

1) 8 numbers + 1 checksum letter 2) 9 numbers + 1 checksum letter 3) "LnnLLnnnnL", where n - digit, L - letter

This rule checks possible whitespaces, hyphens where they are not specified in a reference UEN number (Singapore)

INFO IDEA(2020-01-01)
Identifier format inaccurate (Unified Business Identifier Number (United States - Washington))
REPRESENTATIONAL_CONSISTENCY
US (United States of America) This rule checks the syntax, i.e. format of the Unified Business Identifier Number (United States - Washington) with respect to the reference format. Any deviation (i.e. white spaces where they are not specified) result in a violation.

Unified Business Identifier Number (United States - Washington) consists of 9 digits. This rule checks possible whitespaces, hyphens or dots that might be comprised in the identifier value. If there are any whitespaces, hyphens or dots then the rule is violated.

INFO IDEA(2020-01-01)
Identifier format inaccurate (Unified identification code (Bulgaria))
REPRESENTATIONAL_CONSISTENCY
BG (Bulgaria) This rule checks the syntax, i.e. format of the Unified identification code (Bulgaria) with respect to the Unified identification code (Bulgaria). The Identifier consists of 9 digits or (for a branch) 13 digits. INFO IDEA(2020-01-01)
Identifier format inaccurate (Value Added Tax Number (Iceland))
REPRESENTATIONAL_CONSISTENCY
IS (Iceland) This rule checks the schema of the Value Added Tax Number (Iceland): LLDDDDD or LLDDDDDD. The rule also checks if all applicable hyphens, dots and spaces are in place. INFO RELEASED(2024-04-16)