LI Jiao1, LIU Quan1, FU Qi-ming1, et al. Record matching method based on local CON model in distributed database[J]. 2011, 32(7): 196-202.DOI:
分布式数据库中基于局部CON模型的记录匹配方法
摘要
针对现有记录匹配方法需要相关领域专家大量的人工参与或严重依赖于启发式规则
且无法处理大规模数据的问题
提出一种基于局部CON模型的记录匹配方法。该方法利用关联规则发现算法挖掘匹配依赖
将匹配依赖和数据实例同时作为改进型tableau的输入
检测匹配得出结果。实验结果和理论分析表明
该方法能快速识别出分布式记录匹配情况
且不需要人工参与
效率有非常明显的提高。
Abstract
For existing record matching methods needed much artificial participation of experts or depend on heuristic rules heavily
and they could not handle the problem of large-scale data
A record matching method based on the local CON model was proposed.The approach used algorithm of association rules to get the match dependence
then took both match dependence and data instances as the input of the improved tableau
finally got the result by match detection.Ex-perimental results and theoretical analysis show that the method can quickly identify whether the distributed records matched