Question: what is the best command for comparing 2 datasets - something like proc compare in SAS ?
I could not edit this Comparison 2 data sets , So I decide to open this new topic.
I have the following example
old=Dataset(Insurance_Id=[1,2,3,5],Business_Id=[10,20,30,50],Amount=[100,200,300,missing],Account_Id=["x1","x10","x5","x5"])
new=Dataset(Ins_Id=[1,3,2,4,3,2],B_Id=[10,40,30,40,30,20],AMT=[100,200,missing,-500,350,700],Ac_Id=["x1","x1","x10","x10","x7","x5"])
The combination of insurance_id*Business_Id gives a unique id.
The delta is on the amount (absolute difference greater than 50) and the Account_Id.
The only matches here are (insurance_id,Business_Id) in {(1,10)} so I expect to have all other combinations.