cross_matches

powerplantmatching.matching.cross_matches(sets_of_pairs, labels=None)

Combines multiple sets of pairs and returns one consistent dataframe. Identifiers of two datasets can appear in one row even though they did not match directly but indirectly through a connecting identifier of another database.

Parameters:
  • sets_of_pairs (list) – list of pd.Dataframe’s containing only the matches (without scores), obtained from the linkfile (duke() and best_matches())

  • labels (list of strings) – list of names of the databases, used for specifying the order of the output