A program to recognize and reward our most engaged community members
Hi,
We want to implement 'fuzzy logic' on name columns between 2 tables to find out closer match or match percentage.
Ex: table_1, col_name_1 and table_2, col_name_2
table_1, col_name_1 values:Dr. John Smith Dr. John Smith Jr. Dr. J. R. Smith III Dr. John Smith, D.D.S. Dr. John L. Smith, M.D. John Q. Smith, M.D. John Q. Van Morrisson, M.D
table_2, col_name_2 - value:
John Smith
RapidMiner has no fuzzy logic operator but you can try the Levenshtein distance operator in the new Operator Toolbox extension. http://community.rapidminer.com/t5/Community-Blog/Introducing-the-Operator-Toolbox-Extension/ba-p/36748