Precuration

From CDQ
API/Data Curation API/Profile/PRECURATION
Jump to navigation Jump to search


Name Name of a concept, e.g. a data model concept. In contrast to terms, the name does not depend on a given context, e.g. a country-specific language. Precuration
Description Informal and comprehensive human-readable definition of a concept. Parses, preprocess and harmonize business partner data without additional enrichment.
Technical key Defines a unique key by which e.g. data model concepts can be referenced in a technical integration context. These keys are unique in the CDL context. PRECURATION
API  API/Data Curation API

Activated features

 NameDescription
Extract address contextExtract address context
  • Identifies and enriches address context
  • Moves name part after legal form to address context
Extract care ofExtract care of
  • Extracts care of from name and sets it in address.
  • Moves care of information from name local to careOf
  • Removes care of information from name international.
Extract doing business asExtract doing business as
  • Extracts doing business as from name local and international and adds to names list.
  • Removes duplicates of DBAs
  • Trims spaces
Normalize addressNormalize address
  • Capitalize locality, thoroughfare and premise
  • Normalize first level of the locality to CDQ standards
Normalize business partnerNormalize business partner
  • Normalizes hyphens to `-` in identifiers
  • Removes redundant identifires
  • Removes all identifiers from the given set that do not have a valid identifier value
Parse addressParse address
  • Harmonize address data by parsing thoroughfare numbers, etc.
  • Tries to detect country if data invalid
  • Ensure that country shortname and value are set and existent
  • Extracts Postal Delivery Point value and number from PostCode
  • Removes or changes local elements in thoroughfare like esquina (spanish), mieszkania (polish), etc.
  • Extracts building number from thoroughfare
  • Identify and extract premise information, see also Premise Enrichment and Harmonization for details
  • Identifiy special patterns and extract accordingly such as
    • Ensure that kilometre patterns are identified and extracted
    • Detects and moves industrial zone from address fields to a premise of type industrial zone.
Parse legal formParse legal form
  • Tries to detect a valid legalform from a businesspartner's localName with respect to its countrycode and writes the detected legalform to the 'legalForm' field in the businesspartner. If no valid legalform is found, the legalform field is untouched
Parse namesParse names
  • Removes parts from a given entity's name that are not part of the name.
  • Copies name international to local if local is empty
  • Normalizes name international to look like name local without accents and specific characters
  • Abbreviates names and sets calculated values to short names.
  • Detects and sets categories.
  • Removes or changes double spaces, special or local characters, etc.
Preprocess addressPreprocess address
  • Trims each field of the address
  • Removes prefix like (D, D-, DE, DE-) from post code
  • Removes hyphens which are between words and number in thoroughfare value
Preprocess business partnerPreprocess business partner
  • Prepare provided business partner data for further processing (i.e. trimming values).
Remove contact informationRemove contact information
  • Removes contact information from names