我想知道在dplyr中进行连接之前是否有一种比较列的简单方法 . 以下是两个简单的数据帧 . 我想基于名字和姓氏加入,但是有一些拼写错误或不同的格式,例如“Elizabeth Ray”和“Elizabeth” .

我想在加入之前比较这些列 . 有没有办法使用匹配函数或设置操作,如交叉等,来查找两列中没有匹配项的名称?我只想要一个不同名称的列表,以便我可以在加入之前手动纠正它们 .

我想要一个基于dplyr,tidyr和stringr的解决方案 .

FirstNames<-c("Chris","Doug","Shintaro","Bubbles","Elsa")
LastNames<-c("MacDougall","Shapiro","Yamazaki","Murphy","Elizabeth Ray")
Pets<-c("Cat","Dog","Cat","Dog","Cat")
Names1<-data.frame(FirstNames,LastNames,Pets)

FirstNames2<-c("Chris","Doug","Shintaro","Bubbles","Elsa")
LastNames2<-c("MacDougal","Shapiro","Yamazaku","Murphy","Elizabeth")
Dwelling<-c("House","House","Apartment","Condo","House")
Names2<-data.frame(FirstNames2,LastNames2,Dwelling)