combine_multiple_datasets

combine_multiple_datasets#

powerplantmatching.matching.combine_multiple_datasets(datasets, labels=None, config=None, **dukeargs)#

Duke-based horizontal match of multiple databases. Returns the matched dataframe including only the matched entries in a multi-indexed pandas.Dataframe. Compares all properties of the given columns [‘Name’,’Fueltype’, ‘Technology’, ‘Country’, ‘Capacity’,’lat’, ‘lon’] in order to determine the same powerplant in different datasets. The match is in one-to-one mode, that is every entry of the initial databases has maximally one link to the other database. This leads to unique entries in the resulting dataframe.

Parameters:
  • datasets (list of pandas.Dataframe or strings) – dataframes or csv-files to use for the matching

  • labels (list of strings) – Names of the databases in alphabetical order and corresponding order to the datasets