User Tools

Site Tools


admin_functionality:create_session:step_2_set_attribute_groups

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
admin_functionality:create_session:step_2_set_attribute_groups [2021/03/04 20:16]
conor.doyle
admin_functionality:create_session:step_2_set_attribute_groups [2021/03/04 20:17]
conor.doyle
Line 53: Line 53:
 E.g Email could be used for matching where sales@dqglobal.com returns multiple duplicate records. However when the data is scored with a score group 'Name' including the attributes 'First Name' & 'Last Name', the records will not score as duplicates upon scoring of these values. This setup would look like this: E.g Email could be used for matching where sales@dqglobal.com returns multiple duplicate records. However when the data is scored with a score group 'Name' including the attributes 'First Name' & 'Last Name', the records will not score as duplicates upon scoring of these values. This setup would look like this:
  
-{{ :admin_functionality:create_session:scoring_vs_matching_example.png?800 |}} +{{ :admin_functionality:create_session:scoring_vs_matching_example.png?800 |}} 
- +
----- +
-===== Match Key ===== +
- +
-You can set the ‘Match Key’ for each group with the help of drop-down selection. The 'Match Key' is used to select an algorithm for phonetic match token generation. The 'Match Key' drop-down will have six choices: +
- +
-==== Soundex ==== +
- +
-Soundex retains the first letter of the input string to formulate its match token. Soundex removes vowels (a, e, i, o, u) and h and w from the input string. The remaining letters are assigned numbers using a lookup table to produce a token of 4 characters. +
- +
-This means ‘Cathy’ and ‘Kathy’ will not match as their match tokens begin with a ‘C’ from Cathy and a ‘K’ from Kathy. As such, Soundex does not match well where the start of a word sounds the same but is not the same. +
-Also, due to the numeric substitution it is possible to be shown non-matches (false positive) matches. +
- +
-{{ :admin_functionality:settings:kathy.png?nolink&600 |}} +
- +
-==== DQSoundex ==== +
- +
-DQSoundex overloads Soundex with the advanced capabilities of DQFonetix™. This improves the start of word logic and modifies the first letter(s) of an input string. DQSoundex will de-pluralise and pre-process the start of words to manage variances like ‘C’ to ‘K’ as in 'Cathy' and 'Kathy', as well as ‘Ph’ as in Phonetix to ‘F’ in Fonetix. +
- +
-==== Metaphone ==== +
- +
-Metaphone improves the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation, to produce a more accurate encoding. +
-  +
-This allows you to find more precise matches than the simple Soundex algorithm. Metaphone considers a larger set of character transformations than Soundex and therefore analyses a string phonetically with far more accuracy.  +
-  +
- +
-==== DQMetaphone ==== +
- +
-DQMetaphone like DQSoundex is an enhanced Metaphone technology with the advanced capabilities of DQFonetix™. This improves the start of word logic and modifies the first letter(s) of an input string. DQSoundex will de-pluralise and pre-process the start of words to manage variances and improve matching. +
- +
-In the case shown below (Christopher), Metaphone would have generated three of five names matches. However, after running DQ’s advanced algorithms and advanced logic, DQMetaphone allows ‘Kh’ from 'Khristopher' to match with the ‘Ch’ from 'Christopher'. Thus generating the same match key token.  +
- +
-{{ :admin_functionality:settings:christopher.png?nolink&600 |}} +
- +
-==== DQFonetix™ ==== +
- +
-DQFonetix™ contains our advanced phonetic algorithms developed over the last 25 years by DQ Global. The algorithm is property DQ Global and hence we do not share the specification of the process. However, DQPhonetix™ has four key features: +
- +
-  * Five spoken languages – English, Spanish, French, Italian and German +
-  * Avoids false matches +
-  * Overcomes character variances  +
-  * Deals with diacritics +
- +
-DQPhonetix™ provides your CRM system with the most varied matching window to highlight duplicate matches that may not be picked up – or falsely matches - in Soundex and Metaphone. +
- +
-==== No Match Key ==== +
- +
-Selecting no match key will not generate a phonetic token, hence no match token will be generated. However, this allows you to match identical strings.  +
- +
-**Include for Matching:** With the help of a checkbox feature, you can configure the ‘Include for Matching’ to select the attributes to be used for matching purposes. By default, the group is included for matching. You do have the option to uncheck the group and mark it as excluded. The groups which are not included for matching are still used for percentage scoring. +
  
 ---- ----
Line 165: Line 114:
 {{ :admin_functionality:create_session:scoring_vs_matching_example.png?800 |}} {{ :admin_functionality:create_session:scoring_vs_matching_example.png?800 |}}
  
 +----
 +===== Match Key =====
 +
 +You can set the ‘Match Key’ for each group with the help of drop-down selection. The 'Match Key' is used to select an algorithm for phonetic match token generation. The 'Match Key' drop-down will have six choices:
 +
 +==== Soundex ====
 +
 +Soundex retains the first letter of the input string to formulate its match token. Soundex removes vowels (a, e, i, o, u) and h and w from the input string. The remaining letters are assigned numbers using a lookup table to produce a token of 4 characters.
 +
 +This means ‘Cathy’ and ‘Kathy’ will not match as their match tokens begin with a ‘C’ from Cathy and a ‘K’ from Kathy. As such, Soundex does not match well where the start of a word sounds the same but is not the same.
 +Also, due to the numeric substitution it is possible to be shown non-matches (false positive) matches.
 +
 +{{ :admin_functionality:settings:kathy.png?nolink&600 |}}
 +
 +==== DQSoundex ====
 +
 +DQSoundex overloads Soundex with the advanced capabilities of DQFonetix™. This improves the start of word logic and modifies the first letter(s) of an input string. DQSoundex will de-pluralise and pre-process the start of words to manage variances like ‘C’ to ‘K’ as in 'Cathy' and 'Kathy', as well as ‘Ph’ as in Phonetix to ‘F’ in Fonetix.
 +
 +==== Metaphone ====
 +
 +Metaphone improves the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation, to produce a more accurate encoding.
 + 
 +This allows you to find more precise matches than the simple Soundex algorithm. Metaphone considers a larger set of character transformations than Soundex and therefore analyses a string phonetically with far more accuracy. 
 + 
 +
 +==== DQMetaphone ====
 +
 +DQMetaphone like DQSoundex is an enhanced Metaphone technology with the advanced capabilities of DQFonetix™. This improves the start of word logic and modifies the first letter(s) of an input string. DQSoundex will de-pluralise and pre-process the start of words to manage variances and improve matching.
 +
 +In the case shown below (Christopher), Metaphone would have generated three of five names matches. However, after running DQ’s advanced algorithms and advanced logic, DQMetaphone allows ‘Kh’ from 'Khristopher' to match with the ‘Ch’ from 'Christopher'. Thus generating the same match key token. 
 +
 +{{ :admin_functionality:settings:christopher.png?nolink&600 |}}
 +
 +==== DQFonetix™ ====
 +
 +DQFonetix™ contains our advanced phonetic algorithms developed over the last 25 years by DQ Global. The algorithm is property DQ Global and hence we do not share the specification of the process. However, DQPhonetix™ has four key features:
 +
 +  * Five spoken languages – English, Spanish, French, Italian and German
 +  * Avoids false matches
 +  * Overcomes character variances 
 +  * Deals with diacritics
 +
 +DQPhonetix™ provides your CRM system with the most varied matching window to highlight duplicate matches that may not be picked up – or falsely matches - in Soundex and Metaphone.
 +
 +==== No Match Key ====
  
 +Selecting no match key will not generate a phonetic token, hence no match token will be generated. However, this allows you to match identical strings.
  
  
admin_functionality/create_session/step_2_set_attribute_groups.txt · Last modified: 2021/03/04 20:17 by conor.doyle